Is it okay to essentially copy a subreddit to lemmy?

notexecutive@sh.itjust.works to Asklemmy@lemmy.ml – 1 points –

I want to add a community that is essentially the Lemmy version of an existing subreddit (r/mcmansionhell). Is this allowed?

14

Fuck yeah it is! Make as many subs as you want baby! This is the Wild West and we have blackjack and hookers! Make a McMansion magazine. Hell, make two!

I've seen it talked about a few times across different platforms (Hackernews) where people have pondered the idea of cloning old posts, keeping the poster name but to a non-existent account. Acting as both a way to populate a community and archive content away from Reddit's control.

I haven't seen any examples of this done yet, not sure if anyone has.

It would be a bit of a project, but the most time-consuming part is already done. /r/datahoarder has a backup of reddit comment/post history that goes from the beginning of reddit up to March of 2023 (text only, no media). It's compressed down to about 2TB in size, but already in json format and anyone can download it, would just need some work to convert that to a format a fediverse instance could work with and somehow inject it into a new instance.

Do you mean they have a backup of comment and post history from /r/datahoarder or a backup of literally all reddit?

Literally all of reddit. The entire platform.

Wow. That's really impressive. I really enjoyed that sub.

Also surprising it's only 2TB.

It's JSON so it's only text, and it's compressed, but that's still 1 trillion characters assuming it's UTF-16. I've not actually taken a look myself at the data though, I just contributed to let my PC contribute to the project.

Okay, thanks everyone!

WE'RE IN BABY!!!

So is there a way to view this on kbin or do I always have to view it on sh.itjust.works?

https://kbin.social/m/mcmansionhell@sh.itjust.works

This should work after searching the URL straight from kbin... but searching it right now gives me a 500 Server error. Seems like it's having technical issues.

I've been getting 500s all day so it's most likely the server not being able to keep up with the Reddit migration

400s errors are server side. 500s errors are your side.

Other way around. 400s are client errors, 500s are server errors.

You can see the full list of http error codes and descriptions of what they mean here

Hah, thanks. Am a potato.