Reddit Will License Its Data to Train LLMs, So We Made a Firefox Extension That Lets You Replace Your Comments

Blaze@lemmy.blahaj.zone to Reddit@lemmy.world – 320 points –
theluddite.org

cross-posted from: https://lemmy.ca/post/19946388

An anticapitalist tech blog. Embrace the technology that liberates us. Smash that which does not.

74

You are viewing a single comment

I think I have about 4000 comments on reddit. I've stopped using reddit last year in summer when they pushed their fucking API changes; have been on Lemmy since and never looked back. However, I still have the account, because sometimes I had really nice conversations, which I would like to look up once in a while, or to pick up something which I wanted to keep for another time, like a bookmark basically. I'm also one of the people who sometimes write really really much; walls of text as a product of a lot of effort I put in. It would be sad to see it all go away. Then again, fuck reddirt and it's management.

Is there a tool to back up my comments (or also the corresponding threads)? After that I'll gladly use the tool provided by luddite.

You can request your data from Reddit and they'll send you a CSV file of all your activity. Takes a couple weeks though.

You can request to download your data from reddit, and they'll provide it to you. I did that and made my comments available on github.

I did that and made my comments available on github

How? I've been looking for a way to host my data elsewhere.

I found this website https://www.rareddit.com, but I'm not sure how to do that, and I contacted the author and didn't get a response.

Instructions for downloading data is here:

https://support.reddithelp.com/hc/en-us/articles/360043048352-How-do-I-request-a-copy-of-my-Reddit-data-and-information

Submit form here:

https://www.reddit.com/settings/data-request

then host the data wherever you like (preferrably somewhere it will show up when searched)

Then replace every comment/post with instructions on how to find that data.

Example of redacted post:

https://www.reddit.com/r/paradoxplaza/comments/126ka7a/paradox_wants_to_shut_down_development_studios_in/

Results from search:

https://duckduckgo.com/?q=reddit-u-iceblade02+github

Destination:

https://github.com/Iceblade02/reddit-u-iceblade02?tab=readme-ov-file#reddit-u-iceblade02

The destination part is the issue. That github link works very poorly. The rareeddit example is much better.

The rareddit example is much better.

I'll admit rareddit looks nicer and is more convenient for the user - but it doesn't seem like an option, since (as you said) the author isn't responding.

My data is off reddit (most important part) and findable (bonus).