Crawlly: Web Page Crawler

Crawlly is a straightforward Python tool designed for web page crawling and scraping. Using the requests library, it sends a GET request to the specified URL and utilizes the BeautifulSoup library to parse the HTML content of the response. The script then extracts all anchor <a> tags from the page and prints the URLs of the links present.

Features:

Lightweight and efficient web page crawler.
Scrapes and extracts URLs from a provided web page.
Utilizes the requests library for HTTP requests and BeautifulSoup for HTML parsing.

Installation:

Clone the repository to your local machine using the following command:

git clone https://github.com/Toothless5143/Crawlly.git && cd crawlly

Install the required dependencies by executing the following command:
```
pip install -r requirements.txt
```
Launch the tool by running the following command:
```
python3 crawlly.py
```

License: This tool is open source and available under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
LICENSE		LICENSE
README.md		README.md
crawlly.py		crawlly.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LICENSE

LICENSE

README.md

README.md

crawlly.py

crawlly.py

requirements.txt

requirements.txt

Repository files navigation

Crawlly: Web Page Crawler

About

Releases

Packages

Languages

License

Toothless5143/Crawlly

Folders and files

Latest commit

History

Repository files navigation

Crawlly: Web Page Crawler

About

Topics

Resources

License

Stars

Watchers

Forks

Languages