A simple internal link crawler written in Go.
This bot will scrape the DOM for all <a>
tags, follow the URL, and log a line to file with the URL and HTTP status code.
- gocolly/colly scraping library
- Zero dependancy executable (thanks Go!)
- Download the executable for your platform (MacOS, Windows, Linux)
- Execute the file with the flag
-domain
to crawl a website (minus the https://)
./crawler -domain example.com
- Execute the file with the flag
-domain
to crawl a website (minus the https://)
./crawler.exe -domain example.com
- Feel free to raise an issue or submit a pull request.