Page 1 of 1

Can someone ask Putt to handover the Voat Site & DB?

Posted: Wed Dec 23, 2020 7:56 pm
by TheDonaldTrump
So that any other enterprising Goat can continue Voat.co under another domain name.

There was a torrent site called Nyaa.si and someone else could resurrect Nyaa.si since he had a backup of the site,database.

Not really a tech guy but I can help wherever need and also I am not asking myself as I'm not necessarily the most popular user(name).

Re: Can someone ask Putt to handover the Voat Site & DB?

Posted: Thu Dec 24, 2020 7:50 pm
by ethan123
Archiveteam is working on scraping Voat ([here](https://archiveteam.org/index.php?title=Voat), [here](https://github.com/archiveteam/voat-grab), and [here](https://tracker.archiveteam.org/voat/#show-all))

(The "warrior" instructions won't work, you'll have to download the Github repo and run the scripts as instructed or use a precompiled Docker image)

Re: Can someone ask Putt to handover the Voat Site & DB?

Posted: Fri Dec 25, 2020 4:22 am
by Germ22
did i read that right, there is already 270 Gb of data being scraped by one of those teams? and once it's done, then what?

Re: Can someone ask Putt to handover the Voat Site & DB?

Posted: Fri Dec 25, 2020 9:36 am
by TheDonaldTrump
ethan123 wrote: Thu Dec 24, 2020 7:50 pm Archiveteam is working on scraping Voat ([here](https://archiveteam.org/index.php?title=Voat), [here](https://github.com/archiveteam/voat-grab), and [here](https://tracker.archiveteam.org/voat/#show-all))

(The "warrior" instructions won't work, you'll have to download the Github repo and run the scripts as instructed or use a precompiled Docker image)
Great I hope they succeed despite the timeouts.

Still have my fingers crossed that Putt leaves behind the DB for us to begin afresh.

Re: Can someone ask Putt to handover the Voat Site & DB?

Posted: Fri Dec 25, 2020 4:51 pm
by ethan123
Germ22 wrote: Fri Dec 25, 2020 4:22 am did i read that right, there is already 270 Gb of data being scraped by one of those teams? and once it's done, then what?
Archives (as a WARC file (https://en.wikipedia.org/wiki/Web_ARChive)) get ingested into archive.org: https://archive.org/details/archiveteam_voat. Then they will be searchable on the Wayback Machine eventually.

Yes, that is (now) 290 GB of compressed data. Uncompressed is likely much more