Trust us bro
from runching@lemmy.world to privacy@lemmy.ml on 16 Aug 01:47
https://lemmy.world/post/34515421

Lol, saying you are “beginning a process designed to delete your data” is a very different thing to actually deleting your data.

#privacy

threaded - newest

humble_boatsman@sh.itjust.works on 16 Aug 02:28 next collapse

You guys are still using google??

But seriously, Holy shit I just signed GOT INTO/OPENED UP/JUMPED on to DDG on PC after forever and had to opt out of a bunch of AI shit. We are so fucked.

Ilandar@lemmy.today on 16 Aug 02:39 next collapse

You guys are still signing into search engines?

sunzu2@thebrainbin.org on 16 Aug 03:52 next collapse

🫵

ook@discuss.tchncs.de on 16 Aug 06:27 collapse

You guys are still guys?

Truscape@lemmy.blahaj.zone on 16 Aug 04:38 next collapse

Luckily both Librewolf and Iron fox have integrated “no ai” DDG as default search

Zachariah@lemmy.world on 16 Aug 05:11 next collapse

-noai

dropped_packet@lemmy.zip on 16 Aug 06:43 next collapse

SearXNG is nice

tarknassus@lemmy.world on 16 Aug 18:49 next collapse

Fortunately DDG are opt out, and short of cookie sessions expiring it seems to stick.

Unlike a certain set of other “search” engines that are slowly changing into AI chat bot outputs with zero opt-out abilities besides using some hacky tricks to avoid it.

NewNewAugustEast@lemmy.zip on 16 Aug 19:52 collapse

That is bullshit. DDG will give you an ai answer, infrequently, and ask if you want more, less, or none. And that is just a result. To continue the AI option of asking follow up questions you have to opt in.

We are fucked for lots of reasons, this isn’t one of them.

Edit: really people disagree, when you can Prove this to be true? WTF is Lemmy now just reddit? Not even going to comment why you disagree?

humble_boatsman@sh.itjust.works on 17 Aug 01:01 collapse

The pervasiveness of AI (insinuating its lack of concern for higher privacy users) has seeped completely through to products we are choosing for their privacy focus is indicative of being fucked. We are also fucked for many reasons.

I would like to iterate a comment above that says they actively reduce using all types of services for this reason. The greatness of the internet is being squashed by the desire to protect both our personal information and resist corporate enshitification.

I’m not saying I’m gonna stop using DDG or that it is somehow the problem or is even bad. I’m just saying this OP from Google is seeping everywhere.

NewNewAugustEast@lemmy.zip on 17 Aug 01:25 collapse

Your original post makes it sound like duckduckgo is making you do this. It really isn’t. You have to truly opt in to use it. They do ask you, how often do you want to see it, and you can set a preference.

I think they are in a tough spot because users are going to say if they don’t have it, they are behind and a bad search engine for it. Yet if they offer it, people are going to complain about that.

I think they are trying hard for a middle ground.

sunzu2@thebrainbin.org on 16 Aug 02:33 next collapse

They telling you nothing that's legally binding.

It is a trust me bro. You shouldn't trust know stalker!

WhatGodIsMadeOf@feddit.org on 16 Aug 02:37 next collapse

Remove it from view, lol.

Everything on the Internet is permanent. If not by the company then by the NSA. Regardless of where you reside.

kibiz0r@midwest.social on 16 Aug 05:05 next collapse

We looked at the ROI of actually deleting vs “basically mostly virtually indistinguishable from deleting”, and well… I mean, we take your privacy very seriously.

[deleted] on 16 Aug 05:30 next collapse

.

djmikeale@feddit.dk on 16 Aug 06:17 next collapse

As a person working in a field close to data engineering this sounds like they’re actually honest about the process.

Tldr: it’s not possible to “just delete” everything at once, even though we’d love to be able to.

There’s so many layers of where information is stored, and such insane amounts of data in their data platform. so running a clean up job to delete a single persons data in oltp databases, data lakes, dwh’s, backups, etc, would both be expensive and inefficient. Instead what they then do is to do it in stages: flip a flag somewhere (is_deleted = true) which lets it be removed from view initially, and then running periodic clean-up jobs.

dropped_packet@lemmy.zip on 16 Aug 06:41 next collapse

Sounds like a great reason not to use their services

djmikeale@feddit.dk on 16 Aug 06:51 next collapse

This is any company, government, or other organisation with +80 employees. The two other alternatives are

  1. Have all data in Excel with no data governance, robust procedures, or trust in data, as the organisation grows in size
  2. Use only external tools (which in turn are owned by organisations that work like I described in my parent comment)

I’d love to hear of there’s other ways of doing this stuff that actually works, but so far I just haven’t experienced it in my career yet.

dropped_packet@lemmy.zip on 16 Aug 06:53 next collapse

I’m not disputing the technical aspect. But due to these realities I prefer to drastically limit the services I interact with.

djmikeale@feddit.dk on 16 Aug 06:57 collapse

Aha I misunderstood, thanks for clarifying.

Actually for this specific context, there’s an easy solution: I reckon for llms self-hosting would be the way to go, if your hardware supports it. I’ve heard a lot of the smaller models have gotten a lot more powerful over the last year.

dropped_packet@lemmy.zip on 16 Aug 07:02 collapse

Small fine tuned models seem to be where the market as a whole is headed. Even the big players like OpenAI/Google/Meta are doing this as a means to optimize infrastructure. The Qwen3 models have been really interesting to work with.

phoenixz@lemmy.ca on 16 Aug 19:37 collapse

Or, optionally, host it yourself

djmikeale@feddit.dk on 16 Aug 23:34 collapse

Good point!

manuallybreathing@lemmy.ml on 16 Aug 07:23 collapse

I mean this in the most polite way possible, but it seems like youve never read a privacy policy before

dropped_packet@lemmy.zip on 16 Aug 07:37 collapse

What makes you say that?

kadup@lemmy.world on 16 Aug 23:57 collapse

A photo I deleted 10 years ago resurfaced on my Google Drive account recently.

I’m sure it was deleted, and it had never appeared before until now.

But sure, they’re being honest!

djmikeale@feddit.dk on 17 Aug 09:31 collapse

en.m.wikipedia.org/wiki/Hanlon's_razor

kadup@lemmy.world on 17 Aug 14:19 collapse

Doesn’t apply, nor matter.

Malice or not, their systems didn’t delete my photo, that’s the point.

sakuragasaki46@feddit.it on 16 Aug 07:40 collapse

Cassandra is a database designed to make data as available as possible at the cost of possible inconsistency

When a data is deleted from Cassandra it’s replaced by a marker named ‘tombstone’

However backups, deep backups, and copies made on purpose for governments may exist

Law and advertisers mandate some data not being deletable

bignate31@lemmy.world on 16 Aug 23:12 collapse

You know why they “tombstone”? (By the way, they don’t replace with a tombstone marker but instead add the marker.)

Because if you “accidentally” deleted something and then decided you wanted it back, you’d get really mad if they couldn’t do that. If they immediately deleted it, you couldn’t ever get it back

The copies and deep copies are for a similar reason: Some engineer accidentally deletes a bunch of data, it’s really nice to have a backup so you don’t lose everything.