pandas

Materials for teaching the introductory pandas workshop at UC Berkeley's D-Lab.

Set Up

For this workshop we'll be using a Jupyter notebook. (An IPython notebook will be provided for attendees who are not yet on IPython 3.0.0 or above.)

The best learning experience happens when you can edit and run code. So, please have pandas, Matplotlib, and Jupyter or IPython (and the notebooks) installed. There are several options for getting your environment set up.

BCE Summer 2015
Anaconda
A package manager such as pip

Both BCE and the Anaconda distribution will install everything you need for this workshop. If you decide to use pip, you can do the following:

# install pandas and Matplotlib
$ pip install pandas matplotlib

# install Jupyter
$ pip install --upgrade jupyter

Once those are installed, you can get the necessary files for this workshop by doing the following:

# clone the repository
$ git clone https://github.com/dlab-berkeley/introduction-to-pandas.git

# navigate to the repo
$ cd introduction-to-pandas

# start the interactive session
$ jupyter notebook

# alternatively
$ ipython notebook

Outline

For this workshop, we'll go through an example using European unemployment data. We'll load, view, and modify the data as well as calculate some descriptive statistics. The idea is to get a sense of what it would be like to use pandas as part of your workflow.

We plan to cover:

pandas data structures
loading data
subsetting and filtering
calculating summary statistics
dealing with missing values
merging data sets
creating new variables
basic plotting
exporting data

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
data		data
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
introduction-to-pandas-2.4.1.ipynb		introduction-to-pandas-2.4.1.ipynb
introduction-to-pandas.ipynb		introduction-to-pandas.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data

data

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

introduction-to-pandas-2.4.1.ipynb

introduction-to-pandas-2.4.1.ipynb

introduction-to-pandas.ipynb

introduction-to-pandas.ipynb

Repository files navigation

pandas

Set Up

Outline

About

Releases

Packages

License

sahabA/introduction-to-pandas

Folders and files

Latest commit

History

Repository files navigation

pandas

Set Up

Outline

About

Resources

License

Stars

Watchers

Forks