New Coronavirus Outbreak Map Crawler Project

English | 简体中文

Content

New Coronavirus Outbreak Map Crawler Project

Project Profile

This project was developed on February 16, 2020, initially as a real-time display of the extent of proliferation in China, and the crawlers were sourced from major Internet companies in real time, using PHP and HTML for the front-end of the exhibition

Support data display via echart
Data Crawl in Mainland China
Overseas regional support (all regions except Korea are consolidated data)
Data crawling of counties and cities in China (Jiangsu area, other areas just need to change the relevant data sources)
Support exporting data for calculation

Run Way

$python time.py

Dependencies

python

pymysql
lxml
selenium(Webdriver[google chrome])
BeautifulSoup

Web Support

Baidu Echart
Apache

Ideas

Crawl the major epidemic display version of the site data, using lxml to structure the site analysis, through the For loop to traverse the fixed format XPATH address

Python crawler->Epidemic website: crawl
Python Crawler->MySql:Crawl through the epidemic site into the database
PHP->MySql:request data
MySql->PHP:Return data

Name		Name	Last commit message	Last commit date
Latest commit History 51 Commits
.idea		.idea
allMap		allMap
css		css
dep		dep
echarts/map		echarts/map
images		images
img		img
js		js
test		test
.DS_Store		.DS_Store
0.png		0.png
1.cpp		1.cpp
5d10389e233c4f3f66fcc3c487973d57.png		5d10389e233c4f3f66fcc3c487973d57.png
LICENSE		LICENSE
README-zh.md		README-zh.md
README.md		README.md
back.png		back.png
bd.py		bd.py
bootstrap.min.css		bootstrap.min.css
ciyun.py		ciyun.py
component.css		component.css
cs.html		cs.html
d.py		d.py
d3c9a6bf00042c43c286a1741349d481.jpg		d3c9a6bf00042c43c286a1741349d481.jpg
debug.log		debug.log
demo.css		demo.css
dict.txt		dict.txt
dsrw.py		dsrw.py
etry.py		etry.py
gen.svg		gen.svg
ghostdriver.log		ghostdriver.log
github.png		github.png
graph.php		graph.php
guangZhou.html		guangZhou.html
hade.php		hade.php
hae.php		hae.php
hare.php		hare.php
hared.php		hared.php
hd.jpg		hd.jpg
huaian.php		huaian.php
index.php		index.php
js.py		js.py
kr.php		kr.php
kr.py		kr.py
krcity.py		krcity.py
krd.php		krd.php
krnews.py		krnews.py
krre.php		krre.php
krred.php		krred.php
krree.php		krree.php
krrenew.py		krrenew.py
les-miserables.gexf		les-miserables.gexf
light.php		light.php
marquee.png		marquee.png
mat500.txt		mat500.txt
normalize.css		normalize.css
npmdepgraph.min10.json		npmdepgraph.min10.json
re.php		re.php
read.php		read.php
red.php		red.php
ree.php		ree.php
refresh.png		refresh.png
reload.png		reload.png
reload.svg		reload.svg
rl.py		rl.py
rmrb.py		rmrb.py
sc.php		sc.php
ten.py		ten.py
test.html		test.html
test.php		test.php
time.py		time.py
virus.sql		virus.sql

License

gabrielpondc/Coronavirus-Outbreak-Map-Crawler

Folders and files

Latest commit

History