Data contamination expert ๐Ÿ‘Œ

ElCanut@jlai.lu to Lemmy Shitpost@lemmy.world – 1347 points –
86

You are viewing a single comment

I wonder how much these models are now learning from spam they were used to generate

Time to make a lot of wandering dwarf bots on reddit to make variations of various game phrases all over, so the LLM based bots just spout Rock And Stone and This is my favourite store on the Citadel?

Thing is, you could use a bot to do nothing but post pop culture references, and it would be indistinguishable from a garden variety Redditor. Reddit is one of the worst places to train an AI.

Johnson! Why the hell is your report the most unintelligible thing I've read since nineteen ninety eight when the undertaker threw mankind off hะตll in a cell, and plummeted sixteen feet through an announcer's table.

1 more...
1 more...
1 more...