Skip to content

Latest commit

 

History

History
48 lines (39 loc) · 3.72 KB

README.md

File metadata and controls

48 lines (39 loc) · 3.72 KB

MSBX 5420 - Spring 2020

Unstructured and Distributed Data Modeling and Analysis

Leeds School of Business, University of Colorado Boulder

Contact Information

Instructor:

Office hours:

Course Info

  • Course #: MSBX 5420-003
  • Topic: Unstructured and Distributed Data Modeling and Analysis
  • Room: HUMN 1B90
  • Days: Tuesdays
  • Time: 5:30 pm to 8:00 pm
  • Zoom ID: 486-957-298
    • Meeting ID: 486-957-298
    • Join via web browser: https://cuboulder.zoom.us/j/486957298
    • Join via Zoom app (using meeting ID)
    • Join via One tap mobile: +16699006833,,486957298# or +16465588656,,486957298#
    • Join via telephone: 1-669-900-6833 or 1-646-558-8656

Schedule (subject to change)

Date Topic
Week 1
January 14
Section 1:Course Introduction.
Section 2: Github, Jupyter notebook, Binder, Python and Spark Helloworld
Week 2
January 21
Section 1:Distributed File System - HDFS.
Section 2: HDFS Command Line, Pydoop, Linux, SSH, Run Jupyter on Cluster Server
Week 3
January 28
Section 1:Distributed Computing - Spark RDD 1.
Section 2: Docker and PySpark Programming
Week 4
February 4
Section 1:Distributed Computing - Spark RDD 2.
Section 2: Git, Pycharm and PySpark Programming
Week 5
February 11
Section 1:Spark SQL.
Section 2: DataFrame Programming
Week 6
February 18
Section 1:Spark Batch Processing.
Section 2: Create Cluster and Run Spark Job in Cluster
Week 7
February 25
Spark Programming Patterns
Week 8
March 3
Section 1:Spark Streaming and Kafka.
Section 2: Code Review with GitHub
Week 9
March 10
Section 1:NoSQL and Apache Cassandra.
Section 2: Class Presentation
Week 10
March 17
Class Presentation
March 24 No Class, Spring break
Week 11
March 31
Section 1:Cloud Computing and AWS.
Section 2: Running Spark job in EMR Cluster
Week 12
April 7
Team Project and AWS
Week 13
April 14
Big data machine learning, Exam
Week 14
April 21
Team Project, ElasticSearch and Kibana
Week 15
April 28
Final Project presentation