Internet Archive update.

TheOne@lemmy.blahaj.zone to Technology@lemmy.world – 513 points –
12

I'm curious if this was some kind of DDOS, or if someone was trying to use it to train AI and was pulling everything to make a local copy

About the only way to ensure you're not training AI on bots, is to use "old internet" when bots were more obvious

Someone already announced on X that they took the archive down because "NATO bad"

Yeah, but that's the same as someone writing on a bathroom stall at this point.

use "old internet" when bots were more obvious

This is going to become a valued commodity like pre-atomic low background steel, isn't it?

And it's mostly all in one place...

I never realized how valuable that data was

It already is as far as I know. I've heard before that ChatGPT is strictly trained on data from before, like 2018 or so for this reason.

Nice theory but it was some political bullshit group who did it for some bullshit reason. Odds are it'll turn out to be some Russian funded shit.

Right but they've achieved absolutely nothing other than being mildly annoying. Doesn't really seem like it would be worth funding.

That’s not entirely correct: One achievement was me donating money to the IA after this nonsense 😁