Passive OCR and other 'AI' tools on the Linux desktop

Artemis_Mystique@lemmy.ml to Linux@lemmy.ml – 98 points –

With modern CPU's supposedly shipping with 'AI cores': How long do you think it will take for a proper opensource, privacy respecting productivity tools(Something like whatever M$ copilot is supposed to be?) to be available?

Personally, i would love to see something like 'Passive' OCR integrated with the display server: the ability to pause any video and just select whatever text(even handwritten) there is naturally like it was a text document without any additional hassle will be really useful
Also useful in circumventing any blocks certain websites put on articles to prevent text from being copied

Or an AI grammar checker running natively for LibreOffice.

What are some AI tools you think should be developed for desktop Linux?

31

You are viewing a single comment

Like... now? Here are my notes about it https://fabien.benetou.fr/Content/SelfHostingArtificialIntelligence

You don't even need to wait for "AI" chips, "just" a high-end GPU will do.

Sure they are very very large models like Mistral or BLOOM you won't be able to run even on a 4090 (highest end gaming card right now) but there usually have lower quality versions that might give usable result.

IMHO though what I realized while testing all that at home is... it's rarely worth it. It's absolutely fun to play with, even interesting to learn about it all, but in terms of time/energy/ecology/costs versus result, so far it's been "meh". A cool experiment, like locally get transcript for my PeerTube server from the audio of my videos, but something that in fine I always end up not relying on.

It also allows me to do cool prototype, like code generation in XR, but again that's something I'd qualify as fun, not as productive.

TL;DR: it's feasible today but IMHO not worth it.

PS: best example would be Immich with it's optional ML, locally or not (as in serving content on a small Pi but doing the ML inference on your desktop)

you don’t even need to wait for “AI” chips, “just” a high-end GPU will do

Not everyone wants or can afford an expensive GPU, meanwhile the new AMD 8000G series looks highly compelling and I was just curious whether there are any roadmaps to productively incorporate their use on the Linux desktop in a similar way to what M$ is pushing with co-pilot(Obviously without the corporate surveillance)

If your Cool AI tool requires me to always connect to the internet and agree to a concerning EULA and privacy policies; i think i can live without it

If I were AMD or Intel I'd absolutely reserve the fancy new buzzword tech for only the highest spec cpus in order to give people a reason to overspend.

I expect only decently high end cpus would have this fancy "AI" tech.

You don't even need to wait for "AI" chips, "just" a high-end GPU will do.

You don't even need that. A decently high-end CPU will also work, just a good bit slower of course.

Mistral can be easily run on a 4090. I think you mean Mixtral.