Regarding the Wayback Machine

12:50pm, 6th December 2007

I fixed a broken link in one of my more wretchedly trollsome old posts with the help of the magical Internet Archive. I noticed two things while creating a local mirror:

  1. You can’t use wget on the Archive, as the links are left pointing to the old, presumably AWOL, site and then resolved with JavaScript. Will there ever be a JavaScript-aware wget? Unlikely.
  2. Sites can disappear from the Archive, even without the original author requesting their removal. If you lose control of your domain, a new owner can buy it and install an Archive-unfriendly robots.txt file, which will wipe all that domain’s history from the Archive. Keep control of your domains, people.

Leave a comment

You can use these tags: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>


Please enter the following words to prove your humanity: