Skip to content

peigangzhang/MSBX5420_Spring2020

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

73 Commits
 
 
 
 
 
 
 
 

Repository files navigation

MSBX 5420 - Spring 2020

Unstructured and Distributed Data Modeling and Analysis

Leeds School of Business, University of Colorado Boulder

Contact Information

Instructor:

Office hours:

Course Info

  • Course #: MSBX 5420-003
  • Topic: Unstructured and Distributed Data Modeling and Analysis
  • Room: HUMN 1B90
  • Days: Tuesdays
  • Time: 5:30 pm to 8:00 pm
  • Zoom ID: 486-957-298
    • Meeting ID: 486-957-298
    • Join via web browser: https://cuboulder.zoom.us/j/486957298
    • Join via Zoom app (using meeting ID)
    • Join via One tap mobile: +16699006833,,486957298# or +16465588656,,486957298#
    • Join via telephone: 1-669-900-6833 or 1-646-558-8656

Schedule (subject to change)

Date Topic
Week 1
January 14
Section 1:Course Introduction.
Section 2: Github, Jupyter notebook, Binder, Python and Spark Helloworld
Week 2
January 21
Section 1:Distributed File System - HDFS.
Section 2: HDFS Command Line, Pydoop, Linux, SSH, Run Jupyter on Cluster Server
Week 3
January 28
Section 1:Distributed Computing - Spark RDD 1.
Section 2: Docker and PySpark Programming
Week 4
February 4
Section 1:Distributed Computing - Spark RDD 2.
Section 2: Git, Pycharm and PySpark Programming
Week 5
February 11
Section 1:Spark SQL.
Section 2: DataFrame Programming
Week 6
February 18
Section 1:Spark Batch Processing.
Section 2: Create Cluster and Run Spark Job in Cluster
Week 7
February 25
Spark Programming Patterns
Week 8
March 3
Section 1:Spark Streaming and Kafka.
Section 2: Code Review with GitHub
Week 9
March 10
Section 1:NoSQL and Apache Cassandra.
Section 2: Class Presentation
Week 10
March 17
Class Presentation
March 24 No Class, Spring break
Week 11
March 31
Section 1:Cloud Computing and AWS.
Section 2: Running Spark job in EMR Cluster
Week 12
April 7
Team Project and AWS
Week 13
April 14
Big data machine learning, Exam
Week 14
April 21
Team Project, ElasticSearch and Kibana
Week 15
April 28
Final Project presentation

About

CU Leeds School MSBX5420 Spring 2020

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published