Hack open web data with web scraping approaches with Google Sheets, Python, and R at Access 2019

Note: This workshop was given at Access 2019 on October 2, 2019.

Content of this repository

You can find exercises files in Python and R.

Abstract

The Web has become a source of data for daily and scientific research. Although there are many initiatives to facilitate data exchange, most of the Web content are written in plain HTML. This workshop will introduce three approaches (Google Sheets, Python, and R) from simple to advanced to scrape web data in a standard format like CSV, XML, and JSON and how these techniques can be applied to daily work and research.

Slides

You can find my slides used for this workshop in Zenodo.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
Python exercise files		Python exercise files
R exercise files		R exercise files
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Python exercise files

Python exercise files

R exercise files

R exercise files

README.md

README.md

Repository files navigation

Hack open web data with web scraping approaches with Google Sheets, Python, and R at Access 2019

Content of this repository

Abstract

Slides

About

Releases

Packages

Languages

yooylee/access-2019-web-scraping

Folders and files

Latest commit

History

Repository files navigation

Hack open web data with web scraping approaches with Google Sheets, Python, and R at Access 2019

Content of this repository

Abstract

Slides

About

Resources

Stars

Watchers

Forks

Languages