BIG NEWS: Built a backup bot and automatic archiver for this subverse! (pizzagate)

submitted 8.3 years ago by gittttttttttttttttt

Hello people. I am the person behind the github repos that originally were on reddit collab tools and now are on the sidebar that have been getting a lot of downloads/views. I was running into an issue with saving voat posts on this sub-verse since there is so many coming in daily. I noticed as a community we fell behind on archiving every post to archive.is. My guess is we were only archiving about 30% of them. The top posts were getting archived but a few diamond in the rough post that fell through were not. Another big issue I ran into when trying to retrieve older post was the voat admins disabled pagination recently past page 19. There is a lot of talk about it on /v/voatdev/ and it may get restored. The API is also not ready for production use so I was not able to get a key. I am also working with one of the people on /v/voatdev/ to get a full backup of the older post so that way we will for sure have 100% of the data backed up all over the world and to multiple sites.

The bot will go through page 1-19 on new every day on a cron job and make a folder of that day. it will then push to the git repos once done. Every HTML page will be downloaded with wget and saved as the post ID in the posts folder for that day. There is also a file called ids.txt in every day folder that will have the unique post ids. The post will also be automatically archived at archive.is through a POST request.

One thing I discovered last week about http://archive.is/https://voat.co/v/pizzagate/* is that they also have pagination issues. If someone could send an email about this issue to [email protected] I would really appreciate it. Make sure to post below you sent an email so the person does not receive multiple. We should request to be able to view all of them and that 950-1000 is not enough. The good thing though is they are archived even though they are not in the pagination ( I checked with a few older posts ). As long as we have all the post ids we can easily back track. I am going to try and create a master-post-ids.txt file in the main folder in the repo that will have every post ID ever on here. I brought this up just so you are all aware.

NOTE: PLEASE STILL USE ARCHIVE.IS THOUGH BECAUSE WE NEED TO BACK UP POSTS WITH MULTIPLE SCREENSHOTS BECAUSE PEOPLE ADD COMMENTS, DELETE COMMENTS ETC. THE BOT WON'T BE ABLE TO GET THE NEWEST ACTIVITY SO PLEASE KEEP ARCHIVING WHEN POST GET COMMENTS ETC. ALSO KEEP SAVING POSTS LOCALLY. DO NOT JUST RELY ON ME AND MY BOT.

Here is the repos: https://github.com/pizzascraper/pizzagate-voat-backup https://gitlab.com/pizzascraper/pizzagate-scraper

TO DO: Need to figure out CSS/JS/IMG assets. Viewing HTML post locally is currently not calling any stylesheets/scripts/images since the urls are not absolute in the html files so it looks pretty plain. This is not critical as it can always be fixed later. What is important is preserving the data. If you have an idea on how to fix this please file an issue or comment here. Also if you have any suggestions or any ideas on how to improve this please let me know. I really appreciate all the help I can get.

Can be cloned:

git clone https://github.com/pizzascraper/pizzagate-voat-backup.git

git clone https://gitlab.com/pizzascraper/pizzagate-scraper.git

Non tech users can download by going to https://github.com/pizzascraper/pizzagate-voat-backup/archive/master.zip .

61 comments

Pizzafundthrowaway 8.3 years ago

@gittttttttttttttttt @adam_danischewski @pizza_gator @bikergang_accountant @freetibet @totesgoats908234 @Ivegotredditcancer @ParadiseFaIIs @pembo210

Okay, guys. I've tagged everyone who has expressed an interest/ability to contribute to this effort. My offer of $200 bitcoin stands to deliver a solution by consensus. Please work together and propose a solution.

In the meantime, I'm hitting a wall with this bitcoin thing. I've looked at bitcoin.org and CORE. I found the download, but I'm having trouble figuring out how to get my hands on a signed copy of the download for the wallet. I am utterly clueless about cryptography. I followed the guides, but for some reason, my stupid brain can't figure out how to validate the signature of any wallet download. Is there a complete idiot's guide for how to do this? I have googled my ass off, but haven't been able to find a simpleton pleb's guide to get started. Sorry for my ignorance, and thanks in advance for any pointers!

link

pizza_gator 8.3 years ago

Look into using electrum as your wallet instead of core. i think its more straight forward. plus you don't have to wait for the blockchain to load

link

Pizzafundthrowaway 8.2 years ago

I got electrum, but buying bitcoin anonymously seems virtually impossible. Every exchange has a ridiculous privacy policy. Is there a way to buy bitcoin anonymously?

link

pizza_gator 8.2 years ago

localbitcoins.com would be the way to go

link

bikergang_accountant 8.3 years ago

For the wallet just use electrum. You don't need a full client like core.

link

Pizzafundthrowaway 8.2 years ago

I got electrum. Now how do I buy bitcoin anonymously? The exchanges I found are asking for too much personal information and demand that I allow them to share my info with just about anyone they want.

link

bikergang_accountant 8.2 years ago

You can clean coins. Also you can use local bitcoins. They ask for information but it's optional.

The exchanges with your information would produce the lowest fee, then you can clean them in a casino. You just set your risk to almost zero (that's what the casinos are actually for, and they are cryptographically proveably fair).

The other factor is that bitcoin is pretty fucking anonymous. The address the exchange pays you with is recycled over and over and you can set your electrum client to behave more anonymously.

Let's put it this way. A VPN puts your ip into a certain geography. Well the FBI could see that 99% of your traffic is going to that geography and a single ip, and they see traffic increasing and decreasing on your end and they see similar patterns out of the VPN to certain sites. Bitcoin even without cleaning is magnitudes more anonymous than a VPN. You could even pay random amounts into change addresses.

Third option. dash is like bitcoin but cryptographically certain to be anonymous. You could use something like shapeshift or even one of those exchanges to go from bitcoin -> dash -> bitcoin and it will be clean. Or usd -> dash -> bitcoin would save you a transaction. Or you could pay someone in dash.

link

Pizzafundthrowaway 8.3 years ago

Zip download link is broken

YOU DA REAL MVP