Introdução a técnicas de modelagem de dados para modelos de Regressão Linear utilizando StatsModels e Scikit Learn.
-
Updated
Jul 8, 2019 - Jupyter Notebook
Introdução a técnicas de modelagem de dados para modelos de Regressão Linear utilizando StatsModels e Scikit Learn.
Data Modeling using CassandraQL for a Music Industry Project (Sparkify). Course Project by Udactiy Data Engineering Nano Degree.
[Data Modeling/ ETL] This project aims to create a star schema optimized for song play analysis queries for the music streaming service Sparkify.
deriving insights from data contained in sql or nosql databases for dashboards
Investigated Lyft riders’ data set, by performing data wrangling, conducting exploratory data analysis, and building statistical machine-learned model, using python packages, to determine KPIs, that guide riders’ cancellation decision
This project involved Data Engineering and Data Analysis where I designed the tables to hold data from 6 CSV files, imported the CSVs into a SQL database using PostgreSQL and wrote SQL queries to answer the given questions. A bonus analysis included creating some charts to analyze employee salary data.
7. SQL Engineering and Data Analysis
A Server Side Pre-Rendered Company Website build using NextJs
Modeling the data with Postgres and building an ETL pipeline using Python. I will define fact and dimension tables for a star schema for a particular analytic focus, and write an ETL pipeline that transfers data from files in two local directories into these tables in Postgres using Python and SQL.
Data modelling with postgres. First project of udacity data engineering nano degree
ETL and Dimensional Modeling with PostgreSQL
A data science hackathon project to draw top 10 insights from several datasets consisting of 3 lacs of data
ETL (Extract, Transform & Load) Pipline to extract user activity and song data from json files and ingestion into a Postgres Database. This Project is part of the [Udacity Data Engineering nanodegree](https://www.udacity.com/course/data-engineer-nanodegree--nd027).
Exploring, cleaning and wrangling SFM Technologies' water consumption dataset using python and pandas library, in order to obtain clean data. This is data is modeled and visualized in a Power BI dashboard.
Data model to keep track of a collector's extensive retro electronics collection.
This project is part of Virtual Internship Program from ID/X Partners. Create credit score based on logistic regression model and give some business insight
Research project using data modeling, data engineering, and data analysis to create a database of employees hired at Pewlett Hackard in the 1980s and 1990s.
The repository contains code that retrieves data on the latest GitHub repositories using the GitHub API. The retrieved data is then cleaned and transformed before being sent to Google BigQuery for storage and analysis. In BigQuery, exploratory data analysis (EDA) techniques are applied to gain insights into the data.
This is the backend for my MERN Stack App - Blogly.
Developed a machine learning model using the Cleveland Heart Disease dataset to accurately predict heart disease presence in individuals based on 14 medical attributes. Conducted comprehensive data exploration, visualization, model selection, training, hyperparameter tuning, and evaluation. Identified crucial features to aid diagnosis and treatment
Add a description, image, and links to the data-modeling topic page so that developers can more easily learn about it.
To associate your repository with the data-modeling topic, visit your repo's landing page and select "manage topics."