It appears to be as simple as tesseract <infile><outfile>. Possibly could even pipe (or tee) the screenshot straight into that and save both an image and a text file in a single command line.
I found a small command to run KDE Spectacle (screenshot software) with Tesseract so I can OCR a screenshot if I want to, I only had to install Tesseract and a main language, you could easily do the same with an API and/or a local AI.
Can you search the screenshots with OCR though? That’s Recall’s main selling point
You can start by running
sudo apt install tesseract-ocr
and then reading its docs.Fulfills the AI quota 👍
It appears to be as simple as
tesseract <infile> <outfile>
. Possibly could even pipe (or tee) the screenshot straight into that and save both an image and a text file in a single command line.So something like this should do the trick:
Skip the database, just use
grep
to search that directory if you need to find anything. Voilà, homemade Recall.deleted by creator
I can’t imagine it’d be that hard to write some code that does that using an existing AI model.
I found a small command to run KDE Spectacle (screenshot software) with Tesseract so I can OCR a screenshot if I want to, I only had to install Tesseract and a main language, you could easily do the same with an API and/or a local AI.
You’re probably right.
Llava and Bakllava are two Ollama models than can not only extract text but also describe what’s happening on screen.
Using
tesseract-ocr
, as the other guy suggested, is probably simpler and less resource intensive though.This is a shitpost and not a real suggestion.