Skip to content

A spider crawls user information of stargazers and forkers of given repositories, then saves such information into a .csv file with pandas.

License

Notifications You must be signed in to change notification settings

WenjieDu/Spider_on_GitHub_Star_Fork

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

A Spider Crawling Info of Stargazers and Forkers

This spider crawls user information of stargazers and forkers of given repositories and saves such information into a .csv file with pandas.

Given GitHub repositories in SPECIFIED_REPOS, this spider will crawl all stargazers and forkers of them. If given repositories in FILTERING_REPOS, the spider will filter out stargazers and forkers of them from those of SPECIFIED_REPOS. For sure, any user in EXCLUSION_USERS will also be filtered out. If you want to get the email information of the crawled users, you have to provide valid GitHub credentials in file github_credential.json, which will be used by the spider to sign in GitHub. spider.log is an example log file that can show you how it works.

You can quickly create a usable python environment with an anaconda command conda env create -f conda_env_dependencies.yml. ❗️Note that this file is for MacOS. Some errors may jump out if you use it on other platforms like Linux. But you can still use it for dependency reference.

👏 Click here to view stargazers and forkers of this repo

Stargazers repo roster for @WenjieDu/GitHub_Spider_on_Star_Fork

Forkers repo roster for @WenjieDu/GitHub_Spider_on_Star_Fork

About

A spider crawls user information of stargazers and forkers of given repositories, then saves such information into a .csv file with pandas.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Sponsor this project

 

Packages

No packages published

Languages