forked from s-rah/onionscan
-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Adding the structure needed to being configurable web crawling. Currently there are only two options: base - which configures the base URL (to ignore all other parts of a site and focus on a specific set of URLs e.g. /forums) and exclude - which tells the scanner to ignore URLs which contain one or more of the given strings - this allows explicitly ignoring uninteresting URLs (e.g. /profile or /settings) and also for avoiding URLs which might mess up the scan (e.g. /logout) This commit also fixes a bug in web crawler where the depth parameter was overridden by a constantly updating crawl map.
- Loading branch information
Showing
4 changed files
with
58 additions
and
60 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters