Open source project for data preparation of LLM application builders
-
Updated
May 25, 2024 - Python
Open source project for data preparation of LLM application builders
A passion project focused on analyzing my own readinglists & fanworks hosted on Archive Of Our Own!
A simple tool to consolidate multiple files into a single .txt file. Perfect for feeding your files to AI tools without any fuss.
🚚 Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark
Main Repository
Prepping tables for machine learning
The data extraction and processing involved thorough exploration, preprocessing, and visualization of the "Video Game Sales with Ratings" dataset.
A tool to streamline AI image captioning
This project is my personal project about Marketing Campaign using dataset from Big Tech Company provided by Rakamain Academy. I created Clustering Model with Python (Sklearn) to get best model from the dataset that can used for arrange their next marketing strategic planning.
This project predicts MBTI personality types from users' recent 50 posts using NLP and ML techniques.
Analytics for a leading Brazilian E-commerce firm, Olist Store
Multiple models for binary classification and checking the accuracy with each model.
An open source book to learn data science, data analysis and machine learning, suitable for all ages!
Diwali-Sales-Analysis-Project
In this project I predict the 2016 MLS season using historical data and Poisson regression. The project includes cleaning, preprocessing and analyzing the dataset, building and evaluating predictive models for match outcomes, forecasting team performance and simulating the league table. It uses Pandas, Numpy, MatPlotLib and StatsModel libraries.
Minha resolução para um teste prático de uma vaga de Analytics Engineer Júnior
This project will focus on data preparation and will follow the steps : data cleaning, handling text and categorical attributes, and feature scaling.
Un cours pour apprendre à construire des interactions homme-données
A simple website to label images for classification locally.
ABAP unit testing framework, prepare in Excel, reuse in abap code
Add a description, image, and links to the data-preparation topic page so that developers can more easily learn about it.
To associate your repository with the data-preparation topic, visit your repo's landing page and select "manage topics."