Skip to content

Latest commit

 

History

History
12 lines (8 loc) · 883 Bytes

README.md

File metadata and controls

12 lines (8 loc) · 883 Bytes

Hack open web data with web scraping approaches with Google Sheets, Python, and R at Access 2019

Note: This workshop was given at Access 2019 on October 2, 2019.

Content of this repository

You can find exercises files in Python and R.

Abstract

The Web has become a source of data for daily and scientific research. Although there are many initiatives to facilitate data exchange, most of the Web content are written in plain HTML. This workshop will introduce three approaches (Google Sheets, Python, and R) from simple to advanced to scrape web data in a standard format like CSV, XML, and JSON and how these techniques can be applied to daily work and research.

Slides

You can find my slides used for this workshop in Zenodo.