Last active
December 20, 2015 11:19
-
-
Save christophergorexyz/6122118 to your computer and use it in GitHub Desktop.
A wget command for mirroring a single domain. This will recursively retrieve every image, script, and stylesheet loaded or linked to through HTML and CSS. Will rename urls with query strings at the end to *.html files. Can't scrape dynamic requests, like AJAX. Doesn't necessarily play nicely with windows and may require some additional URL rewri…
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
wget -mkpE -e robots=off --no-cookies --no-cache $1 |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment