You are viewing a single comment's thread.

view the rest of the comments →

xeemee ago

archive.org obeys robots.txt - there is nothing nefarious about that and they are transparent about it

what does that mean? it means that if i archive, for example, my own web page and then someone else takes over my domain because i no longer wanted it, or i failed to renew it, or it was taken over by the gubberment, and the new owner disallows the archive.org crawler in the robots.txt file, then that page will no longer be available

archive.is does not obey robots.txt, which is why i prefer it

also, archive.org does not get all of it's funding from Alexa - they get it from the public as well, which anyone can easily see

on a side note, if you are running Firefox and want a really simple way to archive web pages, see Save URL to Wayback Machine - it works with both archive.org and archive.is