Generative AI boom "could come to a fairly swift end"

floofloof@lemmy.ca to Technology@lemmy.ml – 165 points –
Generative AI boom "could come to a fairly swift end"
dezeen.com
72

Come on now, next you’ll be saying the tech industry consistently overplays its incremental improvements as Earth-shattering paradigm shifts purely for the investment money!

This message posted from the metaverse

Yup. As someone who works in tech, I was baffled by the number of people in my field who started freaking out about it. AI isn't some magic panacea, it's just another tool that needs to be designed for the task at hand. It's cool that ChatGPT can get 80% of the way there in so many fields, but that last 20% is where all the hard work is (see the pareto principle).

3 months ago: Everyone’s going to lose their jobs!

Today: Generative AI’s dead!

More realistically: Generative AI is a tool that will gradually get better over time. It is not universally applicable, but it does have a lot of potential applications. It is not going to take over the world, nor will it just suddenly go away.

IMO it'll be more like internet: society will take years to adapt to it and democratise its use. It took 30 years for Internet to bloom and it is now a primary service in Europe. I'm pretty sure AI will take this road.

That's pretty much been my take from the beginning. My main concerns were and still are:

  • IP law, specifically copyright infringement
  • correctness - ChatGPT makes stuff up
  • detection - esp for school

My main fear was that it would be more useful for scammers and fraudsters than legitimate uses because of the above issues. I still have those concerns.

With any new technology that people say well change the world overnight, take a step back and think it through. For example:

  • self driving cars - we still have taxis, Uber, etc, so it hasn't taken over despite being here for years
  • robotics in manufacturing - it's incredibly expensive to put together and end to end robotic factory, so there are still plenty of manufacturing jobs
  • automated fast food - again, the most I've seen is increased number of kiosks, that's it

And so on. People freak out about new tech, then a couple years later they realize that it's not "finished" and there will be plenty of time to adapt. Unless we recover an alien spaceship or something, that's just not how technology progresses. Eventually generative AI will redically change our society, but it'll take decades, so by the time your job is threatened, you'll be ready to retire.

"If hallucinations aren't fixable, generative AI probably isn't going to make a trillion dollars a year," he said. "And if it probably isn't going to make a trillion dollars a year, it probably isn't going to have the impact people seem to be expecting," he continued. "And if it isn't going to have that impact, maybe we should not be building our world around the premise that it is."

Well he sure proves one does not need an AI to hallucinate...

Clearly nothing can change the status quo if it doesn’t also make trillions

The assertion that our Earth orbits the sun is as audacious as it is perplexing. We face not one, but a myriad of profound, unresolved questions with this idea. From its inability to explain the simplest of earthly phenomena, to the challenges it presents to our longstanding scientific findings, this theory is riddled with cracks!

And, let us be clear, mere optimism for this 'new knowledge' does not guarantee its truth or utility. With the heliocentric model, we risk destabilizing not just the Church's teachings, but also the broader societal fabric that relies on a stable cosmological understanding.

This new theory probably isn't going to bring in a trillion coins a year. And if it probably isn’t going to make a trillion coins a year, it probably isn’t going to have the impact people seem to be expecting. And if it isn’t going to have that impact, maybe we should not be building our world around the premise that it is.

maybe we should not be building our world around the premise that it is

I feel like this is a really important bit. If LLMs turn out to have unsolvable issues that limit the scope of their application, that's fine, every technology has that, but we need to be aware of that. A fallible machine learning model is not dangerous; AI-based grading, plagiarism checking, resume-filtering, coding, etc. without skepticism is dangerous.

LLMs probably have very good applications that could not be automated in the past but we should be very careful of what we assume those things to be.

In the early 1980s, a teacher refused to let me word-process my homework (my penmanship was shit) on the grounds that I shouldn't be able to produce a paper at the touch of a button.

Upper management look at AI end results and imagine a similar scenario: they don't see the human effort behind the dumb-waiter and imagine a clerk can just tell an LLM to make me a sequel to Dumbo without getting very specific and then having a team of reviewers watch hundreds of terrible elephant films to curate the few good ones.

But what is telling is how our corporate bosses responded to the prospect of automated art. Much like the robot pizza company who did not automate the process and pass the savings on to you! (his offerings were typical pizza at typical prices and he kept all the savings for himself) our senior execs imagine ways to replace workers with cheaper automation rather than producing better stuff or cheaper movie tickets for their customers.

So maybe we should growl at them and change the system before they figure out how to actually pay fewer people while keeping more profits.

Companies will always keep all the savings and pass on all the expenses. That's just how they operate. You're not going to be able to change that system short of a revolution.

I can't believe this tech bubble will burst. All the other ones have fared so well.

Because they were far more useful to the average person, than the glorified spam making machine. Also it's not like something like this happened for the first time...

EDIT: forgot to grammar

Feels like a minds war waged by billionaires in this space when you're actually playing with this stuff. All this hype is a joke as are the proprietary junk. Get a decent comp and try offline AI yourself and see what it can do. Try Llama2 70B Q4 GGML. You need a machine with more than 10-12+ cores and at least 64GB of system memory. It really helps to have a Nvidia GPU with 16GB+ but you don't have to have that here. This model can write python snippets like you're searching stack overflow but an order of magnitude faster. If you know basic code elements, branching, and looping, this model can code, and resolve its errors when it gets something wrong by prompting it with the error message. A 30B like WizardLM or Vicuna are almost technically useful, but the 70B is a beast.

That doesn't sound simple by any measure.

What they're trying to say is: very soon these models will run on your smartphone without internet connection and OpenAi will be no more.

Kinda is, sure. The problem is when you become overly reliant on the tech without it being reliable. It's also kinda bad when it causes you to lose skills that you need to maintain it or further it.

I think they have some areas where they're very useful, but beyond those areas they're only OK at best. They don't come close to living up to the hype, which is mostly based on "the next version will be mind blowing!".

They are a new type of app, nothing more. New types of apps can be extremely useful, and make a lot of tasks easier, e.g. spreadsheets. I would say at best generative AI is as game changing as spreadsheets were, but maybe less.

The hype machine wants us to believe they are as revolutionary as the PC itself, or the car. In fact 10 times as revolutionary! I just don't buy it... at least not in the foreseeable future.

There was this point where VR gaming seemed like an inevitable successor to traditional gaming. It was everywhere and improving rapidly. There were core concerns, but most felt that those could be solved with time. The technology had so much potential.

This is how the current AI solutions feel to me right now. There are a small-ish group of people who find them very useful and use them often. There are a large group of people who are currently on the hype bandwagon, talking about all the potential they hold. But currently they have yet to truly hit mainstream use.

With VR, all that hype and potential seems largely dead. The promised advancements haven't seemed like enough to take over from traditional games, the fundamental issues haven't been fixed because they're too hard or too costly to fix.

I'm still unsure if AI will go this same route, or if it will eventually break into more mainstream. I think probably the most likely route is something like how Siri/Alexa worked out. Some people use voice assistants all the time, others basically never do. They never quite fully delivered on the revolution they promised, but they were useful enough to stick around. That's how I feel about the current AI approach.

I think long term we'll get some other approach that will once again kick off the AI hype machine, but the current AI approach is only going to find limited success because it's going to be really, really hard to get it to a place where you can reasonably trust the output.

We don't even know how they arrive at the output they arrive at, and it takes lengthy research just to find out how, say, an LLM picks the next word in an arbitrarily chosen sentence fragment. And that's for the simpler models! (Like GPT-2)

That's pretty crazy when you think about it.

So, I don't think it's fair to suggest they're just "a new type of app". I'm not sure what "revolutionary" really means but the technology behind the generative AI is certainly going to be applied elsewhere.

The paper that invented http had a not "interesting idea" from the researcher who reviewed it.

These AI are a revolution. But like all revolution it will take some time for the society to absorb it.

AI doesn’t seem to do well when it trains on its own data so I do think there’s a possibility it’s a one trick pony. Once there’s too much AI content in the data it’s trained on it will devolve into nonsense.

Ultimately, generative AI are tools, not magic. We're now past the hype phase and are now at the leveling out phase of the S-curve as people realizes that these things are limited.

I think ChatGPT is mostly going to be used as an automated copywriter for emails and resumes and such, whereas diffusion models will find their way into digital artists' workflow.

Life goes on.

Isn't ChatGPT's launch only less than 6 months old or something...

Reminds me of the article saying open ai is doomed because it can only last about thirty years with its current level of expenditure.

OpenAI must evolve into serving something other than generative AI.

The compute bills for OpenAI are crazy. They would need more paying customers to try and at least keep the service somewhat viable.

https://futurism.com/the-byte/chatgpt-costs-openai-every-day

Cost reduction in the field is orders of magnitude potential. Look at llama running on everything down to a raspy pi after 2 months.

There are massive gains to be made - once we have dedicated hardware for transformers, that’s orders of magnitude more.

See your phone being able to playback 24h of video but die after 3h of browsing? Dedicated hardware codec support

Yeah but Llama's quality cannot compete with ChatGPT models (Doesn't matter what model you use, if you want good and FAST results, you require serious compute). We do have commercial dedicated AI chips from NVDA, last time I checked you had to make an order to even get a price. George Hotz who is also working on something similar, by his account from a Lex Fridman podcast mentioned that a personal AI rig would have to be closer to a mainframe's size.

There's nothing I have seen so far that leads me to believe that generative AI gets more efficient with weaker hardware.

The trajectory is such that current L2 70B models are easily beating 3.5 and are approaching GPT4 performance - an A6000 can run them comfortably and this is a few months only after release.

Nah the trajectory is not in favor of proprietary, especially since they will have to dumb down due to alignment more and more

https://www.anyscale.com/blog/llama-2-is-about-as-factually-accurate-as-gpt-4-for-summaries-and-is-30x-cheaper?trk=feed_main-feed-card_feed-article-content

An A6000 ranges between $4500 and $7000 . We are a long long way from reaching efficiency on affordable consumer grade hardware.

A 30B model which will be fine for specialized tasks runs on a 3090 or any modern mac today.

We are months away from being affordable at current trajectory

Which modern Mac are you talking about and how much does that cost? Again, I doubt any of the opensource 30B models can compete even with ChatGPT 3.5. Which is the point I started with earlier.

Seems to me like you are riding this whole efficiency thing on nothing more than hopium.

I think at this point we are arguing belief.

I actually work with this stuff daily and there is a number of 30B models that are exceeding chatGPT for specific tasks such as coding or content generation, especially when enhanced with a lora.

airoboros-33b1gpt4-1.4.SuperHOT-8k for example comfortably outputs > 10 tokens/s on a 3090 and beats GPT-3.5 on writing stories, probably because it’s uncensored. It’s also got 8k context instead of 4.

Several recent LLama 2 based models exceed chatgpt on coding and classification tasks and are approaching GPT4 territory. Google bard has already been clobbered into a pulp.

The speed of advances is stunning.

M- architecture macs can run large LLMs via llama.cpp because of unified memory interface - in fact a recent macbook air with 64GB can comfortably run most models just fine. Even notebook AMD GPUs with shared memory have started running generative AI in the last week.

You can follow along at chat.lmsys.org. Open source LLMs are only a few months but have started encroaching on the proprietary leaders who have years of headstart

recent macbook air with 64GB

How much does this cost?

You will answer any and every question but this.

My points still stand

I doubt someone who can’t google the price of macbook air can afford or even operate anything remotely useful in the LLM space.

3 more...
3 more...
3 more...
3 more...
3 more...
3 more...
3 more...
3 more...
3 more...
3 more...
3 more...
3 more...

I’m curious about the development of artificial intelligence in the future, and I’m looking forward to seeing what GPT-5 can do. If it’s a huge leap forward, then I will agree that the future will be very different from what we have now. But if it’s only a slight improvement, like Llama 1 vs Llama 2, then large language models (LLMs) might face the same challenges as self-driving cars. They are somewhat functional, but not reliable enough to let you sleep on your commute, and they won’t be for a long time.
It might be impossible to eliminate all the hallucinations from LLMs, but if the next versions are incredibly useful, then we will learn to live with them. For example, currently 30% of chips fail on a wafer, but we still produce more CPUs and they are groundbreaking technology. But even GPT4+ will have a significant impact on our future, especially in education. Every kid will have an AI in their phone that is ready to answer all their questions with minimal effort. This will greatly enhance the intelligence of future generations and make education accessible to almost everyone on earth at a similar high level. But this will not make us all lose our jobs in 10 years.

This will greatly enhance the intelligence of future generations and make education accessible to almost everyone on earth at a similar high level.

I don't think that accessibility in AI somehow correlates with the intelligence of the subjects using it. It can actually work in the completely opposite way where people blindly trust it or people get used to using it in a degree that they're unable to do anything without the help from the technology. Like people who are unable to navigate 2 blocks from their house if they don't use google maps navigation even though they do the same route every day.

This will greatly enhance the intelligence of future generations and make education accessible to almost everyone on earth at a similar high level.

You mean at the current ChatGPT level? Because I'm unsure if the future versions will be open source or open access, if not surely it will just raise the disparity in education.

Lol. OpenAI haven't made GPT open source since version 2. With that said, their best interests are currently in keeping access public and their name in the headlines. They need an income source.

Kids as well as everyone already have AI on their phones.

We've had it for quite a while now. Even before chatgpt, what question could you not find an answer for?

6 more...

I wonder, could AI actually "collapse"? As in, once companies and people start leaving the AI hype space, could the external input become small enough so that the AI to AI input takes over to such a degree that all trained models become essentially useless?

I find that unlikely. AI is a subject much like space tech. It may not always be the giant it is now but it's a baseline research countries will be conducting. Even if only as a means to defend themselves.

I'm sure challenges like this are coming but I'd be surprised if it causes these applications to collapse completely. However, it may force the ML companies to pay some fees for the training data they use.

Genuine question: How hard is it to fix A.I. Hallucinations?

Very difficult, it's one of those "it's a feature not a bug" things.

By design, our current LLMs hallucinate everything. The secret sauce these big companies add is getting them to hallucinate correct information.

When the models get it right, it's intelligence, when they get it wrong, it's a hallucination.

In order to fix the problem, someone needs to discover an entirely new architecture, which is entirely conceivable, but the timing is unpredictable, as it requires a fundamentally different approach.

I have a weak and high level grasp of how LLMs work, but what you say in this comment doesn't seem correct. No one is really sure why LLMs sometimes make things up, and a corollary of that is that no one knows how difficult (up to impossible) it might be to fix it.

Let me expand a little bit.

Ultimately the models come down to predicting the next token in a sequence. Tokens for a language model can be words, characters, or more frequently, character combinations. For example, the word "Lemmy" would be "lem" + "my".

So let's give our model the prompt "my favorite website is"

It will then predict the most likely token and add it into the input to build together a cohesive answer. This is where the T in GPT comes in, it will output a vector of probabilities.

"My favorite website is"

"My favorite website is "

"My favorite website is lem"

"My favorite website is lemmy"

"My favorite website is lemmy."

"My favorite website is lemmy.org"

Woah what happened there? That's not (currently) a real website. Finding out exactly why the last token was org, which resulted in hallucinating a fictitious website is basically impossible. The model might not have been trained long enough, the model might have been trained too long, there might be insufficient data in the particular token space, there might be polluted training data, etc. These models are massive and so determine why it's incorrect in this case is tough.

But fundamentally, it made up the first half too, we just like the output. Tomorrow some one might register lemmy.org, and now it's not a hallucination anymore.

LLMs only predict the next token. Sometimes those predictions are correct, sometimes they're incorrect. Larger models trained on a greater number of examples make better predictions, but they are always just predictions. This is why incorrect responses often sound plausable even if logically they don't make sense.

Fixing hallucinations is more about decreasing inaccuracies rather than fixing an actual problem with the model itself.

I'm no expert, so take what I'm about to say with a grain of salt.

Fundamentally, a LLM is just a fancy autocomplete; there's no source of knowledge it's tapping into, it's just guessing words (though it is quite good at it). Correspondingly, even if it did have a pool of knowledge, even that can't be perfect, because the truth is never quite so black and white in many areas.

In other words, hard.

Oohh i really love when im listening to music and click on an article and it starts autoplaying it -_-