Is anyone actually surprised by this?
No, this is just propaganda
This article is what US propaganda looks like folks. Mashable should be ashamed.
Literally all AI companies do this to run their services. Except you can actually download Deepseek and run it completely securely on your own devices. You know who doesn’t allow that security? OpenAI and the other US companies currently being screwed.
every google site has been doing this for years too. every comment we write in youtube and discard before posting, its being recorded. this isnt news at all.
This make the news only because it’s going to chinese servers. Didn’t see anything like that about ChatGPT or the one made by Google.
They should store the data in US servers like OpenAI does. Apparently then Mashable won’t write an article about it.
The criticism thrown at DeepSeek in the past days is just as applicable to American AI models. But when that was brought up it in the past it was “making things political”.
At least I can run DeepSeek locally.
We are now at a time where US blocks China services in order to protect its companies
Just like many US services are banned in China in Order to protect their companies
So, I hope no surprise…
———
Its or their for countries?
I feel like their is more common. I do deliberately say its for companies because companies aren’t people and don’t deserve people pronouns. Countries seem more like a collection of people, so I use their.
If someone knows more about grammar feel free to correct me.
the company states that it may share user information to "comply with applicable law, legal process, or government requests.
Literally every company’s privacy policy here in the US basically just says that too.
Not only does DeepSeek collect “text or audio input, prompt, uploaded files, feedback, chat history, or other content that [the user] provide[s] to our model and Services,” but it also collects information from your device, including “device model, operating system, keystroke patterns or rhythms, IP address, and system language.”
Breaking news, company with chatbot you send messages to uses and stores the messages you send, and also does what practically every other app does for demographic statistics gathering and optimizations.
Companies with AI models like Google, Meta, and OpenAI collect similar troves of information, but their privacy policies do not mention collecting keystrokes. There’s also the added issue that DeepSeek sends your user data straight to Chinese servers.
They didn’t use the word keystrokes, therefore they don’t collect them? Of course they collect keystrokes, how else would you type anything into these apps?
In DeepSeek’s privacy policy, there’s no mention of the security of its servers. There’s nothing about whether data is encrypted, either stored or in transmission, and zero information about safeguards to prevent unauthorized access.
This is the only thing that seems disturbing to me, compared to what we’d like to expect based on the context of what DeepSeek is. Of course, this was proven recently in practice to be terrible policy, so I assume they might shore up their defenses a bit.
All the articles that talk about this as if it’s some big revelation just boil down to “company does exactly what every other big tech company does in America, except in China”
Oh my, just wait until you learn what Facebook and Google do…
Yes, that is how these generative AI imementations work.
Yes, I’m going to be lectured on privacy by people who are still on twitter.
Just host it yourself?
By extension, anything that’s not self hosted means 3rd party actors snooping. American, Chinese, whoever happens to operate that machine.
You can’t just host the 632B model that the app uses lol
If you have the hardware, then yes, you can.
Ah, just acquire such hardware, very simple and anyone can do it without supply chain knowledge or advantage
Sorry but you are just talking assumptions without even having looked at the facts.
Its not cheap, but basically a single toptier gaming desktop with an additional graphics card (or 2) is literally all you need.
I know multiple people who work normal IT jobs that have already started on setting up their own. They plan on running them for their whole family, many users at a time from the same machine.
Here is someone who got it to work on a cluster of mac-minis. Again not cheap, but clearly within dedicated consumer enthusiast reach. https://digialps.com/deepseek-v3-on-m4-mac-blazing-fast-inference-on-apple-silicon/
And this is before even considering how fast open source moves, i am expecting quantized models which can have double speed for negligible quality impact any second now.
Did the American technology giants think they had the monopoly on capturing human input too?
Unrelated but yesterday I saw a post where the person was mocking those concerned by the chinese getting their data, saying things like “why would they care” and some people sarcastically saying they wouldn’t understand the data because “it was in another language”. Were those people right or not?
It’s a chinese company, where else would they store the data?
The balls.
I think its called a data lake, so they don’t “store” it, its rather floating around there 🤪
These lakes are formed when the cloud is saturated and gives us data precipitation.
thanks for the great picture 👍
so here is the current cloud clima forecast:
The saturated clouds will rain into the data lakes that are already overspilling here and there into the ransomstreams already taking all soil in their way with them. During the day there will be security clouds preventing from visible rain only while during the night those same security clouds rain themselves all collected data to their homelake while their homelake security already is corrupted and spills over regulary.
As soon as the fort-cisc-pal-ocstricken-redm-ondams breach it’ll gonna have floods with multi-exabyte waveheights and the ripples of the release will be felt over to far east china and the currents will circulate around the world multiple times causing damage and devastation in their wake around the world and eventually even reach connected orbit.
The floods will have the potential to also wash away and /or drown or choke all the big tech dinosaurs. Only small foss mammals and deep sea amphibics will survive this historic event.
… you kinda asked for it 😉 same as “they” kinda asked for it too. 🤔
Antarctica, clearly.
I’m confused. Isn’t “collecting keystroke data” just an alarmist way to describe text entry?
Not exactly. Timing between key presses can be used to identify people.
I am literally so paranoid I regularly vary my keysteoke rhythms and explore polyrhytmic techniques to create variations. Not even joking.
this. i mean, the session logs for the prompt are kept at least for your user, right?
Yes.