Robots.txt for LLMs

seasonone@opidea.xyz to Technology@lemmy.world – 21 points –
Robots.txt for LLMs
matt-rickard.com
6

creators of LLMs would like to know [...] that they haven’t been trained on copyrighted data.

I'm not quite sure about that LOL

Hasn't Google recently announced that the whole internet now belongs to them for the purpose of training their next models?

Hasn’t Google recently announced that the whole internet now belongs to them for the purpose of training their next models?

When did this happen? I mean I am aware of their privacy policy but this??

Forget about it. With all these nasty LLM stuff companies take it as granted that they can steal everything and everywhere.