Skip to content

grisreyesrios/Introduction-to-Data-Science-Specialization-IBM

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

25 Commits
 
 
 
 
 
 

Repository files navigation

Introduction to Data Science Specialization IBM

This repository contains all the material related with the specialization of IBM hosted in Coursera. The courses offer an introduction into the foundations and tools that are required as first steps into the Data Science world.

General Learning Objectives:

  • Understanding the concept of Data Science and the activities related to this one.
  • Open source tools.
  • The know-how to solve problems.
  • Relational database concepts and the use of SQL to query databases.

The specialization is divided into 4 courses which learning objectives are:

- [x] Course 1: What is Data Science?

  • Motivation to jump into the Data Science world. (Experiences shared by professionals in this area).
  • Skills required for anyone interested in pursuing a career in this field. First steps about the process of mining a given dataset and regression analysis.
  • Importance of story-telling and the importance of an effective final.

- [x] Course 2: Open source tools for data science, Go to Directory

  • Introduction to Jupyter Notebooks.
  • Introduction to Apache Zeppelin Notebooks.
  • Learn about an enterprise-ready data science platform by IBM (IBM Watson), tools that Data Scientists use in industry.

-[ ] Course 3: Data Science Methodology

  • Approach a problem with: 1.The Business Understanding 2.The Analytic Approach 3.Data Requirements 4.Data Collection

  • Understand data such as prepare and clean data.

  • Data modeling.

  • Deployment of the model.

-[ ] Course 4: Databases and SQL for Data Science

  • The basic about databases and SQL statements.
  • Use string patterns and ranges to search data, sort and group data in result sets and how to work with multiple tables in a relational database using join operations.
  • Create tables, load data, query data using SQL and analyze data using Python.

Extra links to get started in Data Science

  1. Towards data science
  2. Kaggle: It is the place to do Data Science Projects
  3. Top 50 Data Science Resouces
  4. Data Camp

Free Data Mining Tools

  1. Google N-Grams
  2. Yelp Dataset
  3. Lending Club Statistics
  4. Wikipedia Database Download

License

This repository is under the MIT License