• JackGreenEarth@lemm.ee
      link
      fedilink
      English
      arrow-up
      13
      arrow-down
      5
      ·
      1 month ago

      Can you search the screenshots with OCR though? That’s Recall’s main selling point

      • Aux@lemmy.world
        link
        fedilink
        arrow-up
        57
        ·
        1 month ago

        You can start by running sudo apt install tesseract-ocr and then reading its docs.

        • MacN'Cheezus@lemmy.today
          link
          fedilink
          English
          arrow-up
          4
          ·
          edit-2
          29 days ago

          It appears to be as simple as tesseract <infile> <outfile>. Possibly could even pipe (or tee) the screenshot straight into that and save both an image and a text file in a single command line.

          So something like this should do the trick:

          gnome-screenshot -f - | tee /Microsoft/yourPrivacy/$(date +%s).png | tesseract - /Microsoft/yourPrivacy/$(date +%s).txt
          

          Skip the database, just use grep to search that directory if you need to find anything. Voilà, homemade Recall.

        • not_amm@lemmy.ml
          link
          fedilink
          English
          arrow-up
          9
          ·
          30 days ago

          I found a small command to run KDE Spectacle (screenshot software) with Tesseract so I can OCR a screenshot if I want to, I only had to install Tesseract and a main language, you could easily do the same with an API and/or a local AI.

        • MacN'Cheezus@lemmy.today
          link
          fedilink
          English
          arrow-up
          3
          ·
          29 days ago

          Llava and Bakllava are two Ollama models than can not only extract text but also describe what’s happening on screen.

          Using tesseract-ocr, as the other guy suggested, is probably simpler and less resource intensive though.