sea-spider

A humble SEO spider and link checker

Usage

Initial setup

Install dependencies

pip install -r requirements.txt

Config file setup

Rename config-sample.json to config.json
Set the origin_domain value in config.json to your site

config.json
"origin_domain": "example.com",

Basic example

python seaspider.py

Advanced usage

Domain restriction

This is on by default. It prevents the spider from crawling URLs outside a given domain.

config.json
"allow_outside_starting_domain": false,
"origin_domain": "example.com",
"operation_mode": "domain_scan"

Increasing crawl depth

The max_crawl_depth setting controls how many levels of links the spider will recursively crawl (crawls a page, harvests its links, starts crawling each of those links, repeating until the max depth is reached).

⚠ Warning: Depending on the number of total links, unique links your crawl network generates, the computational expense of traversing the entire network increases exponentially as the max crawl depth increases.

config.json
"max_crawl_depth": 5,

Name		Name	Last commit message	Last commit date
Latest commit History 63 Commits
docs/css		docs/css
.gitignore		.gitignore
README.md		README.md
Usage-example-screen-recording.gif		Usage-example-screen-recording.gif
config-sample.json		config-sample.json
create_report.py		create_report.py
find_errors.py		find_errors.py
purge_cache.py		purge_cache.py
requirements.txt		requirements.txt
seaspider.py		seaspider.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs/css

docs/css

.gitignore

.gitignore

README.md

README.md

Usage-example-screen-recording.gif

Usage-example-screen-recording.gif

config-sample.json

config-sample.json

create_report.py

create_report.py

find_errors.py

find_errors.py

purge_cache.py

purge_cache.py

requirements.txt

requirements.txt

seaspider.py

seaspider.py

Repository files navigation

sea-spider

Usage

Initial setup

Install dependencies

Config file setup

Basic example

Advanced usage

Domain restriction

Increasing crawl depth

About

Releases 6

Packages

Languages

viperior/sea-spider

Folders and files

Latest commit

History

Repository files navigation

sea-spider

Usage

Initial setup

Install dependencies

Config file setup

Basic example

Advanced usage

Domain restriction

Increasing crawl depth

About

Topics

Resources

Stars

Watchers

Forks

Languages