crawler

Recursively crawl links from a given webpage in a breadth first (BFS) approach.

An UPPER_LIMIT can be set on the number of links to be crawled.

USAGE:

Download the project zip from this page and extract. Run Link crawler using:

python3 LinkCrawler.py URL UPPER_LIMIT

EXAMPLE:

python3 LinkCrawler.py https://google.com 100

NOTE:

Currently implemented in python3 only.

If no UPPER_LIMIT is specified the program will run indefinitely untill stopped manually.

The output will be saved in a file output.txt. It's path will be displayed upon script execution.

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
bs4		bs4
.project		.project
.pydevproject		.pydevproject
LinkCrawler.py		LinkCrawler.py
README.md		README.md
_config.yml		_config.yml
webcrawler.txt		webcrawler.txt