Skip to content

sbstn-gbl/lfbd-22-23

Repository files navigation

GitHub Repository for Course "Learning from Big Data 2022/23" (Modules 2 and 3)

README content

Repository content

.
├── Makefile                  # run `make help` to see make targets
├── README.md                 # this readme file
├── requirements.txt          # virtualenv requirements file
├── lectures                  # lecture notebooks
├── preparation               # course preparation notebooks
└── source                    # sources, e.g., images for notebooks

Please consider the following instructions and the material in this repository carefully. The repository content is designed to make participation in Learning from Big Data as easy and enjoyable for you as possible.

Requirements

  1. Python 3.8
  2. virtualenv

Optional:

  1. node (for plotly)
  2. graphviz (install with brew install graphviz)

Please familiarize yourselves with virtualenv (or a similar tool such as conda). Some background information can be found in the virtualenv docs or here.

In the lectures, we will use Jupyter notebooks to illustrate implementation-related key points. The notebooks will be published in this repository well ahead of the lecture. Please make sure that you can execute the notebooks before joining the class so you can easily follow the coding parts in the lectures.

For the homework assignments, use an IDE of your choice. IDE choice really depends on personal preferences. A very popular choice is PyCharm (JetBrains offers a free pro license for students). If you are familiar with coding this should be easy to manage. Other people like Spyder, JupyterLab or Google Colab. Do some research to figure out which IDE suits your background and preferences best.

Setup

Makefile targets

The Makefile included in this repository is purely for convenience (e.g., setting up the virtual environment, launching a notebook server). It should work on Linux and Mac OS X systems.

$ make help
Make targets:
  build          create virtualenv and install packages
  build-lab      `build` + lab extensions
  freeze         persist installed packaged to requirements.txt
  clean          remove *.pyc files and __pycache__ directory
  distclean      remove virtual environment
  run            run jupyter lab
Check the Makefile for more details

Step-by-step instructions

  1. Open a terminal and navigate to the path that you want to clone the repository to
  2. Clone the repository
    $ git clone git@github.com:sbstn-gbl/lfbd-22-23.git
    
  3. Navigate to repository path, create virtual environment and install required modules with
    $ cd lfbd-22-23 && make build
    
    or make build-lab to include jupyterlab dependencies.
  4. Start a notebook server with
    $ make run
    

If make does not work on your computer run the steps included in the Makefile manually. You only need to do this setup once.

GIFs

Clone repository and run make build:

Start Jupyter lab with make run:

Windows

See Canvas.

Course preparation

Please try to work on the following three pre-course assignments before the first lecture of module 2.

Also consider studying the material covered in the following online courses:

Use textbooks or online resources to fill gaps in your skills. The pre-course assignments will prepare you for the materials covered in Learning from Big Data and help you assess how ready you are for this course.

Lecture notebooks

Module 2

Module 3

About

GitHub repository for the course `Learning From Big Data (2022/23)`

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages