Reddit has reportedly signed over its content to train AI models

return2ozma@lemmy.world to Technology@lemmy.world – 1043 points –
Reddit has reportedly signed over its content to train AI models
mashable.com
203

You are viewing a single comment

This is why they changed their API policy the way they did. They wanted to sell it rather than let bots scrape it for free.

I don’t think it’s going to be public data alone. I think it’s going to be DMs and chats as well. I wondered why Reddit was pushing chats so much suddenly, well it makes sense now.

Yeah. I think there is a kind of power grab under way. Social media will try to push that they own the IP rights to the large texts uses for LLM. This will then require that producers of LLM software aquire the licensing rights which will cost many millions which in turn restricts the free use of LLM and in general any AI software that requires training data.

The end result is that as the "means of production" become less based on human work the "means of generation" and AI will be controlled by the capitalists. If you can turn something into a commodity (like knowledge with patents and IP) you can control it. Leading to a darker timeline.