Is anyone actually surprised by this?

  • JOMusic@lemmy.ml
    link
    fedilink
    English
    arrow-up
    37
    arrow-down
    1
    ·
    2 days ago

    This article is what US propaganda looks like folks. Mashable should be ashamed.

    Literally all AI companies do this to run their services. Except you can actually download Deepseek and run it completely securely on your own devices. You know who doesn’t allow that security? OpenAI and the other US companies currently being screwed.

    • zeca@lemmy.eco.br
      link
      fedilink
      arrow-up
      2
      ·
      1 day ago

      every google site has been doing this for years too. every comment we write in youtube and discard before posting, its being recorded. this isnt news at all.

  • Ju135@lemmings.world
    link
    fedilink
    English
    arrow-up
    2
    ·
    22 hours ago

    This make the news only because it’s going to chinese servers. Didn’t see anything like that about ChatGPT or the one made by Google.

  • geneva_convenience@lemmy.ml
    link
    fedilink
    arrow-up
    15
    ·
    1 day ago

    They should store the data in US servers like OpenAI does. Apparently then Mashable won’t write an article about it.

    The criticism thrown at DeepSeek in the past days is just as applicable to American AI models. But when that was brought up it in the past it was “making things political”.

    At least I can run DeepSeek locally.

  • Petter1@lemm.ee
    link
    fedilink
    arrow-up
    5
    ·
    edit-2
    1 day ago

    We are now at a time where US blocks China services in order to protect its companies

    Just like many US services are banned in China in Order to protect their companies

    So, I hope no surprise…

    ———

    Its or their for countries?

    • HumanPerson@sh.itjust.works
      link
      fedilink
      English
      arrow-up
      0
      ·
      17 hours ago

      I feel like their is more common. I do deliberately say its for companies because companies aren’t people and don’t deserve people pronouns. Countries seem more like a collection of people, so I use their.

      If someone knows more about grammar feel free to correct me.

  • ArchRecord@lemm.ee
    link
    fedilink
    English
    arrow-up
    32
    ·
    2 days ago

    the company states that it may share user information to "comply with applicable law, legal process, or government requests.

    Literally every company’s privacy policy here in the US basically just says that too.

    Not only does DeepSeek collect “text or audio input, prompt, uploaded files, feedback, chat history, or other content that [the user] provide[s] to our model and Services,” but it also collects information from your device, including “device model, operating system, keystroke patterns or rhythms, IP address, and system language.”

    Breaking news, company with chatbot you send messages to uses and stores the messages you send, and also does what practically every other app does for demographic statistics gathering and optimizations.

    Companies with AI models like Google, Meta, and OpenAI collect similar troves of information, but their privacy policies do not mention collecting keystrokes. There’s also the added issue that DeepSeek sends your user data straight to Chinese servers.

    They didn’t use the word keystrokes, therefore they don’t collect them? Of course they collect keystrokes, how else would you type anything into these apps?

    In DeepSeek’s privacy policy, there’s no mention of the security of its servers. There’s nothing about whether data is encrypted, either stored or in transmission, and zero information about safeguards to prevent unauthorized access.

    This is the only thing that seems disturbing to me, compared to what we’d like to expect based on the context of what DeepSeek is. Of course, this was proven recently in practice to be terrible policy, so I assume they might shore up their defenses a bit.

    All the articles that talk about this as if it’s some big revelation just boil down to “company does exactly what every other big tech company does in America, except in China”

    • quant@leminal.space
      link
      fedilink
      English
      arrow-up
      11
      ·
      2 days ago

      By extension, anything that’s not self hosted means 3rd party actors snooping. American, Chinese, whoever happens to operate that machine.

        • dreadbeef@lemmy.dbzer0.com
          link
          fedilink
          English
          arrow-up
          1
          arrow-down
          2
          ·
          1 day ago

          Ah, just acquire such hardware, very simple and anyone can do it without supply chain knowledge or advantage

          • webghost0101@sopuli.xyz
            link
            fedilink
            arrow-up
            1
            ·
            edit-2
            1 day ago

            Sorry but you are just talking assumptions without even having looked at the facts.

            Its not cheap, but basically a single toptier gaming desktop with an additional graphics card (or 2) is literally all you need.

            I know multiple people who work normal IT jobs that have already started on setting up their own. They plan on running them for their whole family, many users at a time from the same machine.

            Here is someone who got it to work on a cluster of mac-minis. Again not cheap, but clearly within dedicated consumer enthusiast reach. https://digialps.com/deepseek-v3-on-m4-mac-blazing-fast-inference-on-apple-silicon/

            And this is before even considering how fast open source moves, i am expecting quantized models which can have double speed for negligible quality impact any second now.

  • Zip2@feddit.uk
    link
    fedilink
    arrow-up
    48
    arrow-down
    1
    ·
    2 days ago

    Did the American technology giants think they had the monopoly on capturing human input too?

  • P4ulin_Kbana@lemmy.eco.br
    link
    fedilink
    arrow-up
    2
    ·
    1 day ago

    Unrelated but yesterday I saw a post where the person was mocking those concerned by the chinese getting their data, saying things like “why would they care” and some people sarcastically saying they wouldn’t understand the data because “it was in another language”. Were those people right or not?

    • smb@lemmy.ml
      link
      fedilink
      arrow-up
      4
      ·
      2 days ago

      I think its called a data lake, so they don’t “store” it, its rather floating around there 🤪

      • howrar@lemmy.ca
        link
        fedilink
        arrow-up
        5
        ·
        2 days ago

        These lakes are formed when the cloud is saturated and gives us data precipitation.

        • smb@lemmy.ml
          link
          fedilink
          arrow-up
          1
          ·
          edit-2
          2 days ago

          thanks for the great picture 👍

          so here is the current cloud clima forecast:

          The saturated clouds will rain into the data lakes that are already overspilling here and there into the ransomstreams already taking all soil in their way with them. During the day there will be security clouds preventing from visible rain only while during the night those same security clouds rain themselves all collected data to their homelake while their homelake security already is corrupted and spills over regulary.

          As soon as the fort-cisc-pal-ocstricken-redm-ondams breach it’ll gonna have floods with multi-exabyte waveheights and the ripples of the release will be felt over to far east china and the currents will circulate around the world multiple times causing damage and devastation in their wake around the world and eventually even reach connected orbit.

          The floods will have the potential to also wash away and /or drown or choke all the big tech dinosaurs. Only small foss mammals and deep sea amphibics will survive this historic event.

          … you kinda asked for it 😉 same as “they” kinda asked for it too. 🤔

  • grey_maniac@lemmy.ca
    link
    fedilink
    arrow-up
    40
    arrow-down
    1
    ·
    2 days ago

    I’m confused. Isn’t “collecting keystroke data” just an alarmist way to describe text entry?

    • uis@lemm.ee
      link
      fedilink
      arrow-up
      6
      arrow-down
      2
      ·
      2 days ago

      Not exactly. Timing between key presses can be used to identify people.

      • grey_maniac@lemmy.ca
        link
        fedilink
        arrow-up
        2
        ·
        edit-2
        1 day ago

        I am literally so paranoid I regularly vary my keysteoke rhythms and explore polyrhytmic techniques to create variations. Not even joking.