I have my own backup of the git repo and I downloaded this to compare and make sure it's not some modified (potentially malicious) copy. The most recent commit on my copy of master was dc94882c9062ab88d3d5de35dcb8731111baaea2
(4 commits behind OP's copy). I can verify:
- that the history up to that commit is identical in both copies
- after that commit, OP's copy only has changes to translation files which are functionally insignificant
So this does look to be a legitimate copy of the source code as it appeared on github!
Clarifications:
- This was just a random check, I do not have any reason to be suspicious of OP personally
- I did not check branches other than
master
(yet?) - I did not (and cannot) check the validity of anything beyond the git repo
- You don't have a reason to trust me more than you trust OP... It would be nice if more people independently checked and verified against their own copies.
I will be seeding this for the foreseeable future.
Exactly this. I can't believe how many comments I've read accusing the AI critics of holding back progress with regressive copyright ideas. No, the regressive ideas are already there, codified as law, holding the rest of us back. Holding AI companies accountable for their copyright violations will force them to either push to reform the copyright system completely, or to change their practices for the better (free software, free datasets, non-commercial uses, real non-profit orgs for the advancement of the technology). Either way we have a lot to gain by forcing them to improve the situation. Giving AI companies a free pass on the copyright system will waste what is probably the best opportunity we have ever had to improve the copyright system.