Name		Name	Last commit message	Last commit date
Latest commit History 101 Commits
bin		bin
lib		lib
test		test
.gitignore		.gitignore
.travis.yml		.travis.yml
Gemfile		Gemfile
README.md		README.md
Rakefile		Rakefile
wayback_machine_downloader.gemspec		wayback_machine_downloader.gemspec

Repository files navigation

Wayback Machine Downloader

Download any website from the Internet Archive Wayback Machine.

Installation

You need to install Ruby on your system (>= 1.9.2) - if you don't already have it. Then run:

gem install wayback_machine_downloader

Tip: If you run into permission errors, you might have to add sudo in front of this command.

Basic Usage

Run wayback_machine_downloader with the base url of the website you want to retrieve as a parameter (e.g., http://example.com):

wayback_machine_downloader http://example.com

How it works

It will download the last version of every file present on Wayback Machine to websites/example.com/. It will also re-create a directory structure and auto-create index.html pages to work seamlessly with Apache and Nginx. All files downloaded are the original ones and not Wayback Machine rewritten versions. This way, URLs and links structure are the same than before.

Optional Timestamp

You may want to supply a specific timestamp to lock your backup to an older version of the website, which can be found inside the urls of the regular Wayback Machine website (e.g., http://web.archive.org/web/20060716231334/http://example.com). Wayback Machine Downloader will then fetch only file versions on or prior to the timestamp specified:

wayback_machine_downloader http://example.com --timestamp 20060716231334

Optional Only URL Filter

You may want to retrieve files that are of certain type (e.g., .pdf, .jpg, .wrd...) or are in a specific directory. To do so, you can supply the --only flag with a string or a Regex to limit what Wayback Machine Downloader will download.

wayback_machine_downloader http://example.com --only \.pdf

Contributing

Contributions are welcome! Just submit a pull request via GitHub.

To run the tests:

bundle install
bundle exec rake test

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Wayback Machine Downloader

Installation

Basic Usage

How it works

Optional Timestamp

Optional Only URL Filter

Contributing

About

Releases

Packages

Languages

tellMemistressANDguru/wayback-machine-downloader

Folders and files

Latest commit

History

Repository files navigation

Wayback Machine Downloader

Installation

Basic Usage

How it works

Optional Timestamp

Optional Only URL Filter

Contributing

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages