Robots.txt for LLMs

seasonone@opidea.xyz to Technology@lemmy.world – 21 points –
Robots.txt for LLMs
matt-rickard.com
6

You are viewing a single comment

creators of LLMs would like to know [...] that they haven’t been trained on copyrighted data.

I'm not quite sure about that LOL

Hasn't Google recently announced that the whole internet now belongs to them for the purpose of training their next models?

Hasn’t Google recently announced that the whole internet now belongs to them for the purpose of training their next models?

When did this happen? I mean I am aware of their privacy policy but this??