BBC will block ChatGPT AI from scraping its content

L4sBot@lemmy.worldmod to Technology@lemmy.world – 502 points –
BBC Will Block ChatGPT AI From Scraping Its Content
deadline.com

BBC will block ChatGPT AI from scraping its content::ChatGPT will be blocked by the BBC from scraping content in a move to protect copyrighted material.

71

You are viewing a single comment

More data fixes that flaw, not less.

It is not "a flaw", it is the way language learning models work. They try to replicate how humans write by guessing based on a language model. It has no knowledge of what is a fact or not, and that is why using LLMs to do research or use them as a search engine is both stupid and dangerous

How would it hallucinate information from an article you gave it. I haven't seen it make up information by summarizing text yet. I have seen it happen when I ask it random questions

It does not hallucinate, it guesses based on the model to make you think the text could be written by a human. Personal experience when I ask into summarize a text. It has errors in it, and sometimes it adds stuff to it. Same if you for instance ask it to make an alphabetic a list of X numbers of items. It may add random items.

I've had it make up things if I ask it for a list of say 5 things but there's only 4 things worth listing. I haven't seen it stray from summarizing something I've fed it though. If its giving text, its been pretty accurate. Only gets funky when you ask it things where information isn't available. Then it goes with what you probably want

Not too long ago, ChatGPT didn't know what year it is. You're telling me it needs more data than it already has to figure out the current year? I like AI for certain things (mostly some programming/scripting stuff) but you definitely don't need it to read the news.

Yes. The LLM doesn't know what year it currently is, it needs to get that info from a service and then answer.

It's a Large Language Model. Not an actual sentient being.

That's a fucking lame excuse. AI is not reliable, and you definitely shouldn't use it to get your news.

It's not an excuse, relax, it's just how it works and I don't see where I'm endorsing it to get your news.

It's not more data, the underlying architecture isn't designed for handling facts

1 more...