Any tool for easy access of reddit export data?

Crul@lemm.ee to Reddit@lemmy.world – 10 points –

Hi, I requested reddit for my data and I got 16Mb of CSVs... which is a considerable amount. Do anyone know of any tool to process / visualize / search ... the data. I asume the format is the same for everyone, so maybe someone has already built something like that.

EDIT: the problem is not performance, with files <5Mb I can search with notepad++ in miliseconds. What I'm looking for is a user friendly interface (ideally with thumbnail images, links and such).

The problem with searching for "reddit export data visualizer" is that Google shows posts from reddit about visualization of generic data.

Thanks.

10

You are viewing a single comment

Power Query can search line by line without loading a file much bigger than your RAM.

The links you posted are weird:

  • https://pixeldrain.com/u/KfgV7bqn: It offers to download a file with the name Antimutt in r-Excel ultra.paq8o which I have no idea what is for.

  • https://the-eye.eu/redarcs: It says "This Reddit Community Has Been Archived"

The first is the result when I extracted all lines with my nick in them from the csv, stored with the best compression around. The second is where to get the csv - and a lot of communities have been archived there, like it says.

Just to confirm I understand: you are talking about Power Query VS Power Bi for dealing with huge datasets, right?

Because, in my case, with 16Mb, I don't see the need for anything specially powerful. My problem is not performace, but convenience.

Thanks for the input.

Power Query is a component of Excel and Power BI.