Skip to content

ssciwr/jupyter-data-exploration

Repository files navigation

Course description

Jupyter notebooks are a great tool for exploring and interacting with data using the Python programming language and its rich ecosystem of libraries. In this course we will cover basic usage of the Pandas library to download a dataset, explore its contents, clean up missing or invalid data, filter the data according to different criteria, and plot visualizations of the data.

Preparation

You will need a computer with Python, Jupyter and pandas installed.

If you don't already have this, I recommend installing Anaconda (which contains all of this and more):

After installing

  • open Jupyter (e.g. on Windows: Start Menu -> Anaconda3 -> Jupyter Notebook)

  • create a new notebook (the web browser where Jupyter appeared, in top right click on New -> Notebook Python 3)

  • type import pandas and press Shift+Enter (or click Cell -> Run Cells in the menu)

  • if no error message appears, you are ready to start the course!

setup screenshot

During the course

The notebooks created during the course will be automatically updated as they are edited at jupyter-data-exploration-live.

Course materials

External resources