ORKUT [text Only] Dataset, Created From Internet Archive Raw Data

So guys, Im still uploading, about 150GB, about 1.1 billion replies, most from Brazil users (pt-br)

Also give a look at https://github.com/rodrigosf672/orkut-pydataglobal2025 and https://snap.stanford.edu/data/com-Orkut.html

So this one is just raw data, for now, I will later do ML analysis on this, if anyone want to write a paper together about it DM me.

Anyway on HF SalatielJordao/orkut-communities

submitted by /u/Grand-Prize1371
[link] [comments]

Leave a Reply

Your email address will not be published. Required fields are marked *