Skip to content

Chipdelmal/dataViz_CADi

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

dataViz CADi

This repository contains the materials for the "Data Visualization" three days intensive CADi ("Cursos de Actualización en las Disciplinas") course taught to faculty members at "Tecnológico de Monterrey" Institute. It hosts a compendium of materials and activities designed to develop and improve skills to make charts and plots more interactive/appealing, and to show some of the tools we can use to achieve better exposure of our work in scientific, and engineering applications.

For other data-analysis related topics please take a look at the dataPy_CADi repository. Which contains exercises on data wrangling in Python.



Requirements

Although not strictly required, having some knowledge on one of the following programming languages is suggested (as we'll be using them throughout the course):

  • Mathematica: Most of the graphics showcased in the website were developed in this platform due to its flexibility in terms of graphical capabilities (as well as personal preference of the teacher).
  • Python: One of the most popular programming languages. Some of the more versatile data visualization frameworks are compatible with it.
  • R: A popular statistical framework with lots of community support.



There's examples developed in each of the platforms according to the application and availability of frameworks for specific tasks.

Additionaly, it is also suggested to have the atom text editor for the markdown and python examples. For a useful guide on how to install R and Python kernels in Atom follow this link. Some other useful packages for development in atom are:

It is also suggested to install the RStudio IDE for R development and the github desktop app to for the repository.




Given that the course is intended to be useful for several disciplines, the workshop was created with flexibility in mind. As such, modules are fairly independent and can be taken in different order. Alternatively, take a look at the sitemap for the full tree of contents and exercises contained in this repository.

Day 01 (8h)

Goal: To describe the basic principles of data visualization, the types of plots that better describe certain datasets, and perform some common data visualization examples that are common across different fields.

  1. Introduction: Objectives, Scope, My background, Software Installation
  2. Data Visualization Primer: Data visualization workflow
  3. Mathematica/R/Python Primer: Brief introduction to programming languages
  4. Media Formats: Raster-based, Vector-based
  5. Plot Types (first part with exercises): Counts, Scatter, Time Series
  6. Data Handling/Data Sources: Data Formats, Data Handling Frameworks



Day 02 (8h)

Goal: To describe and run through some examples of popular data visualization frameworks in R and Python.

  1. Working with Python and Anaconda: Setting up, Basics, VirtualEnv, Anaconda, Jupyter, Spyder, Atom
  2. Colors: Color Palettes
  3. Plot Types (second part with exercises): Time Series, Transitions, Clustering, Factorial, Multidimensional, Geographic
  4. Good Practices: Suggestions to make data visualization clearer



Day 03 (8h)

Goal: To be able to put together a project website and host some of the examples created throughout the course for better exposure of our work.

  1. Github Introduction: Introduction to github, setting up an account, and our first repository
  2. Markdown + HTML Primer: Introduction to MD and HTML for github and presentations
  3. gh-pages: Github pages, "Docs" folder, "gh-pages" branch
  4. Remark: One of the frameworks to create simple markdown presentations
  5. Revealjs: MathJax-supported javascript HTML presentations framework
  6. ffmpeg: Stop-motion animations, Further video editing




Resources

This is a list of complementary sources and tools that are useful in data visualization applications.

Tools

  • anaconda: DataScience/Package manager platform for python and R
  • atom: Versatile IDE for R, Python, Markdown, Javascript, amongst others
  • ffmpeg: Video Manipulation command line tool (can be used to create "stop-motion" animations)
  • ggplot2: Plotting in R
  • gimp: Free "photoshop" alternative
  • github pages: Github pages
  • irkernel: R kernel for jupyter
  • jekyll: Blog-like templates for github pages (Ruby)
  • jupyter: Jupyter project
  • leaflet: Open-source JavaScript library for interactive maps
  • matplotlib: Python plotting framework
  • mathjax: Use latex in html documents through javascript
  • networkD3: R Network Plotting
  • plotly: Interactive plots (both in R, and Python)
  • python: General-purpose programming language
  • R: Statistical computing programming language
  • RColorBrewer: Color palettes for R
  • remark: Markdown presentations
  • replit: Online python environments project
  • revealjs: Javascript presentations
  • rStudio: R IDE
  • sciweavers: Latex to image converter to embed them into markdown
  • shiny: Interactive web development though R
  • slides: GUI for revealjs
  • spyder: "RStudio"-like IDE for Python
  • tydiverse: Collection of R packages designed for data science.

Online

Books


Alumni

Faculty

Camilo René Duque Becerra • Carlos Daniel Prado Pérez • Donovan manuel Esqueda Merino • Edgar Emmanuel Vallejo Clemente • Faustino Yescas Martinez • Francisco Javier Delgado Cepeda • Guillermo Sandoval Benítez • Ivonne Abud Urbiola • José Luis Gómez Muñoz • Juan Carlos del Valle Sotelo • Lizethe Pérez Fuertes • Luis Miguel Méndez Díaz • María de Lourdes Quezada Batalla • Miguel Rocha Romero • Rafael Benitez Medina • Ramón Marín Solís • Raúl Gómez Castillo • Raúl Martinez Rosado • Salvador Elías Venegas Andraca • Saul Juarez Ordoñez • Sergio Santiago Rentería

Masters and PhD Students

Hugo I. Velasco • Martín Molinero • Myriam Elizabeth • Rodrigo Careaga


Contact: [ sanchez.hmsc@berkeley.edu | chipdelmal@gmail.com ]
My main projects: [ MGDrivE & MoNeT ]
My personal website: [ chipdelmal.github.io ]