You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Simple CSV parser for huge volumes of data with the use of the library Pandas for Python for getting specific columns of a CSV file and putting the extracted data into one or more files (each column in a separated file or all of them in the same output) in a short amount of time.
Project 3 OpenClassroom AI Engineer. This project consists in analyzing and preprocessing data on OpenFoodFact project and presenting data on web page with interactions
This repo 📚 contains a `docker-compose.yml` file for running a Jupyter notebook 📓 server in a Docker container 🐳. Easily save your notebooks and conda environments 🐍 to your local machine for simple access and restoration. Say goodbye to dependency issues and installation headaches 🤕 and start using Jupyter now! 🚀
Project used sampling/de-dup technique based on schema knowledge on HoloClean system. Goal was to reduce number of training points in order to decrease memory footprint and in same improve run-time.
Machine learning is a method of data analysis that automates analytical model building. It is a branch of artificial intelligence based on the idea that systems can learn from data, identify patterns and make decisions with minimal human intervention.