What is FOSS answer to BingGPT & Google Bard?

Ganesh Venugopal@lemmy.ml to Linux@lemmy.ml – 108 points –

There was a post asking people their opinions about Edge and many people seemed to liked the idea of Edge and seemed to be ok having it on Linux (Blasphemy)

Also, can we all agree how fast Edge went from joke to a threat? I mean, it's good now alright! It was good back then, but it's better now. Money man!!! Money! Personally I hate MS, but I can't help but see the fact that there is no alternative to Bing GPT and many features Bing offers on Linux.

If there is an open source ChatGPT how would it look? Who would bear the costs? How would we solve the server problem? i.e., it would take a ton of server space and bandwidth. Just wondering.

I am pretty sure MS products will improve greatly due to their integration with GPT what do us poor folks on Linux do?

Just want to know the answers, I don't want to discuss (aka can't comment, I need to study), but just curious!

32

KoboldAI has an "KoboldGPT" mode, which is for answering questions. You can try it at https://lite.koboldai.net/ or run it on your own.

Here's an example conversation I just had with it (emoji icons added):


πŸ€– Hello, I am KoboldGPT, your personal AI assistant. What would you like to know?

πŸ§‘ What is the name of the original version of the game Monopoly?

πŸ€– The original version of the game Monopoly is called "The Landlord's Game." It was created by Elizabeth Magie in the early 20th century. However, the most well-known version of Monopoly today is based on a variant called "Monopoly" that was designed by Charles Darrow in 1935.

πŸ§‘ Did "The Landlord's Game" have any purpose besides being a board game?

πŸ€– Yes, "The Landlord's Game" had a specific purpose beyond being a board game. Elizabeth Magie created it as a way to illustrate the negative aspects of concentrating land in private monopolies. She believed that it could be used as a tool for teaching economic concepts such as rent, taxes, and the distribution of wealth.

πŸ§‘ What economic ideology was associated with this game?

πŸ€– Elizabeth Magie was influenced by Henry George's philosophy of Georgism. Georgists believe that land should belong to all people equally, and that taxation should be based on the value of land rather than on income or production.

As others said you can host Koboldai on your own, and if you don't have a powerful GPU, anyone can use powerful llm models via the AI horde

i don't know a single thing about whatever these things your talking about are, but i just want to say, you're the first person i've ever seen suggest that edge isn't a joke.

There are quite a few of them out there. Just a few from my notes (in reverse chronological order)

The self-hosted AI/ML system has been here for a while. Granted, the vast majority of them require downloading pre-trained models before they can be used due to how much it costs to build a system of weights from scratch.

I sometimes wonder if it would be possible to build and train a truly open source model with BOINC or something. The last 30 years of history show that it's entirely feasible to build a massively distributed computing cluster, why not leverage this to build a model? I know how naive that sounds immediately after writing it, mostly because of the difficulty of getting a large enough training data set, which unfortunately has risk written all over it (read: people poisoning the model, ala Microsoft's experiment with Tay on birbsite some years back).

petals.dev It's basically bittorrent for AI. A public swarm for running and training LLMs.

Wao didn't know this one, impressive thx for sharing !

I think it's far more likely that easier solutions to hosting your own LLM/ChatGPT/etc will be pushed, rather than someone (or a group of someones) offering and maintaining it - as you mentioned, that takes a lot of resources. Think about the issues that various Fediverse software has had in terms of keeping up with the load sometimes, and LLMs use way more than most Fediverse applications. Especially if you want it to be as snappy as the current ones already out there.

For images there are foss options that you can run locally, like stable diffusion which is so good that it rivals its proprietary counterparts.
But for text it's a horror, there are some you can try (see gpt4all), but in general chatgpt has no real competition; the foss options are currently very bad, and even the proprietary options from big corporations like bard or llama are pitiful.

but I can’t help but see the fact that there is no alternative to Bing GPT

Regarding this, there are other services that also use GPT for search, here a couple:

https://www.perplexity.ai/

https://www.phind.com/

Maybe something like this? https://huggingface.co/ https://huggingface.co/chat/

The model seems to be based on the Llama model (but not only, there seem to be multiple models). Tho I didn't really look into it on how they get their database. It also has more hallucinations than chatgpt.

Huggingchat is pretty good. And the model is Open source. No need for a phone number in your account and not by a company whose name is hypocritical even.

"What's the date?"

I'm not able to provide you with the current date as I am a text-based AI language model and do not have access to real-time information. However, I can tell you the date based on my knowledge cutoff, which is December 2022. Please note that this information may not be up-to-date or accurate. Is there anything else I can help you with?

You won't get the date.

The date mentioned in the answer may also be an halucination from the bot, depending on the dataset.

And an answer to Bing chat / bard is pretty hard to do if something like this even exists, as the bot has to be able to search the Web.

I heard about LocalAI, but I never tried it.

I've tried it, very practical. It's a drop in replacement for OpenAI APIs, so you can work with other tools that use their models and API as back end.

At work today, I just heard the term, "Federated Machine Learning" - where individual corporations keep their proprietary learning data, but make the models shareable. I wonder if something similar could be done with activitypub?

(Feel free to steal this idea if it's worth stealing)

This doesn't work when an important part of the process is making sure your data is actually good. If the data is proprietary, there is no way to make sure it is usable.

Like, have you even tried a search or even attempted to look up anything? There is a lot. Most of it boils down to stay under free limits of street paying for compute and run your own. It's actually very trivial to run a slightly low effective model on a 4090. Liked you can be up and running inside a few minutes if you already know pip

Such a wildly helpful comment. Keep on perpetuating the type of attitude that this community is known for!