Reddit signs $60M contract allowing AI company to train its models on the social media platform's content

minnix@lemux.minnix.dev to Technology@beehaw.org – 260 points –
reuters.com
106

You are viewing a single comment

Trained on 99% reposts

And the outputs of bots. There has been a shocking increase in auto-generated comments on reddit in the past years and it's turning the training data into a minefield.

Haven't touched reddit socially in 8 months, but every now and then I'll use it to search for opinions or instructions on things. Searched "reddit best domain registrar" recently and landed on a thread where top to bottom, every comment recommending a registrar was from a bot and/or banned account. No real person testimonials, all ads. And as AI implementations improve, that's going to get harder to spot. In the meantime, I'm formatting searches like "best domain registrar lemmy" because reddit is legit that bad rn.