Skip to content

Python 3 script for collecting articles from one of Wikipedia's language-specific subdomains

Notifications You must be signed in to change notification settings

bornabesic/wikipedia-crawler

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Wikipedia Crawler

usage: crawler.py [-h] [-s] time_limit subdomain

positional arguments:
  time_limit     crawling time limit in seconds
  subdomain      crawling subdomain (e.g. en, de, fr)

optional arguments:
  -h, --help     show this help message and exit
  -s, --summary  collect summaries instead of full articles

About

Python 3 script for collecting articles from one of Wikipedia's language-specific subdomains

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages