GitHub - karim-el-sayed/crawler-user-agents: Lists syntactic patterns of HTTP user-agents used by robots/crawlers/spiders

This repository contains a list of of HTTP user-agents used by robots, crawlers, and spiders. I regularly maintain this list based on my own logs.

If you are using Ruby, Voight-Kampff and isbot provide libraries for accessing this data.

Other systems for spotting robots, crawlers, and spiders that you may want to consider include isBot (Node.JS), Crawler-Detect (PHP), BrowserDetector (PHP), and browscap (JSON files).

License

The list is under a MIT License. The versions prior to Nov 7, 2016 were under a CC-SA license.

Contributing

I do welcome additions contributed as pull requests.

The pull requests should:

contain a single addition
specify a discriminant relevant syntactic fragment (for example "totobot" and not "Mozilla/5 totobot v20131212.alpha1")
contain the pattern (generic regular expression), the discovery date (year/month/day) and the official url of the robot
result in a valid JSON file (don't forget the comma between items)

Example:

{
  "pattern": "rogerbot",
  "addition_date": "2014/02/28",
  "url": "http://moz.com/help/pro/what-is-rogerbot-"
}

--Martin

Name		Name	Last commit message	Last commit date
Latest commit History 128 Commits
.gitignore		.gitignore
.travis.yml		.travis.yml
LICENSE		LICENSE
README.md		README.md
crawler-user-agents.json		crawler-user-agents.json
package.json		package.json
test_validation.py		test_validation.py
validate.py		validate.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.gitignore

.gitignore

.travis.yml

.travis.yml

LICENSE

LICENSE

README.md

README.md

crawler-user-agents.json

crawler-user-agents.json

package.json

package.json

test_validation.py

test_validation.py

validate.py

validate.py

Repository files navigation

License

Contributing

About

Releases

Packages

Languages

License

karim-el-sayed/crawler-user-agents

Folders and files

Latest commit

History

Repository files navigation

License

Contributing

About

Resources

License

Stars

Watchers

Forks

Languages