Reddit has reportedly signed over its content to train AI models

return2ozma@lemmy.world to Technology@lemmy.world – 1043 points –
Reddit has reportedly signed over its content to train AI models
mashable.com
202

You are viewing a single comment

And what's to stop instance owners from selling their data?

The only thing stopping them is the fact that anyone who wants the data can just utilize the federation protocol to take any data they want, and there's not a lot anyone can do about it. You can't sell something that's trivial to get for free.

If the question you're really asking is "what's stopping content on Lemmy/Mastodon/etc from being used to train an LLM?" the answer is, nothing.

You can't put a price tag on it. Nothing is stopping anyone from scraping all of the data for free.

mass user exodus to one of the many other identical Instances. Also, data brokers prolly aren't interested in going after each Instance because no one instance has enough data to make it worthwhile. Yet again, the fediverse proves its resistance to enshitification.

Lmao, if it gets as big as Reddit then it's worth scraping. It's not the fediverse making it less worthwhile, just the size.

I wished they had evil lawyers looking after such stuff and sold strictly opt in data to AI corps. Free for FOSS though.

2 more...