cross-posted to: https://lemmy.world/post/2499861
As I said, I made a lossy reformat of the database and a lossless one for 6.0 Gib (6,477,905,920). compared to ~26GIB from Reddit, where fields are almost intentionally anti-compressed to take up more room.
If there is somewhere I can host it, let me know.
also, I couldn’t figure this out, do sqlite databses store any information on the creator or editor of a document?
why it's lossy
It’s missing a large table of base64 urandom technically required to recreate the document fully
here are a few options that I see but never actually use.
Your data don’t seem to be massive compared to the types of data people store on there. So I don’t think it’s gonna be an issue. Plus, if you deposit your data in 1 archivist place + 1 research place, the data may be used by more people. Don’t forget about licenses btw.
EDIT: added https://socialmediaarchive.org/ to the list, just found out about that.