Skip to content

hsaputra/ConnectIntensive

 
 

Repository files navigation

Getting Started

This repo is a collection of Jupyter Notebooks to accompany the Udacity Connect Intensive Machine Learning Nanodegree. The code is written for Python 2.7, but should be (mostly) compatible with Python 3.x.

Installing Python and Jupyter Notebook

If you haven't already done so, you'll need to download and install Python 2.7. If using Mac OS X, you may want to use Homebrew as a package manager, following these instructions to install Python 2.7 or Python 3. You can also use Anaconda as a package manager. Then, you can follow these instructions to install Jupyter notebook. These instructions explain how to install both Python 2 and Python 3 kernels.

Fork and Clone this Repo

You can follow these instructions to create a fork of the ConnectIntensive repo, and clone it to your local machine. Once you've done so, you can navigate to your local clone of the ConnectIntensive repo and follow these instructions to run the Jupyter Notebook App.

Required Libraries and Packages

The required packages and libraries vary in each of these Jupyter Notebooks. The most commonly used ones are listed below:

Each Lesson Notebook lists its own specific prerequisites along with the objectives.

Lesson Notebooks

Most lesson notebooks have a corresponding solutions notebook with the outputs of each cell shown. For example, the notebook solutions-01.ipynb displays the output and shows the solutions to the exercises from lesson-01.ipynb.

  • lesson-00.ipynb : Hello Jupyter Notebook!
    • A "hello world" notebook to introduce the Jupyter IDE
    • Introduces import statements for commonly-used modules and packages
  • lesson-01.ipynb : An intro to Statistical Analysis using pandas
  • lesson-02.ipynb : Working with the Enron Data Set
  • lesson-03-part-01.ipynb : Building and Evaluating Models with sklearn (part 1)
    • Perform exploratory data analysis on a dataset
    • Tidy a data set so that it will be compatible with the sklearn library
  • lesson-03-part-02.ipynb : Building and Evaluating Models with sklearn (part 2)
  • lesson-04-part-01.ipynb : Bayes NLP Mini-Project
  • lesson-05.ipynb : Classification with Support Vector Machines
  • lesson-06-part-01.ipynb : Clustering Mini-Project
    • Perform k-means clustering on the Enron Data Set.
    • Visualize different clusters that form before and after feature scaling.
    • Plot decision boundaries that arise from k-means clustering using two features.
  • lesson-06-part-02.ipynb : PCA Mini-Project

Additional Resources

I find that learning Python from Jupyter Notebooks is addictive. Here are some other great resources.

About

Jupyter Notebooks for the Connect Intensive MLND Program

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 100.0%