Simple tool to split COCO annotations into train/test datasets.
-
Updated
Aug 15, 2023 - Python
Simple tool to split COCO annotations into train/test datasets.
Data Visualization, EDA , Model Building and Deployment etc..
Analyzing the HR Criteria of a Company and how they promote their Employees and keep Balance between them using Data Analytics, Data Visualizations, and Machine Learning Models for Classification Purposes.
Roadmap for Data Engineering
⚡ GUI for editing LLM vector embeddings. No more blind chunking. Upload content in any file extension, join and split chunks, edit metadata and embedding tokens + remove stop-words and punctuation with one click, add images, and download in .veml to share it with your team.
Material for Data Mining Lab Session (Fall Semester @ NTHU)
Cereja is a bundle of useful functions we don't want to rewrite and .. just pure fun!
Tokenizer is an independent Open Source, Natural Language Processing python library which implements a tokenizer to create token from Both Sentence and Paragraph.
All my Machine Learning Projects from A to Z in (Python & R)
Web Based Exploratory Data Analysis platform. Can be used as a precursor to the Data Preperation/ Preprocessing stage to understand the data through visualizations
Wind Power Forecasting Based on Hybrid CEEMDAN-EWT Deep Learning Method
Image Classification model for detecting and classifying *DIABETIC RETINOPATHY* using retina images
Predicting whether a person who has applied for a loan in a bank would get his/her loan approved or not using Classification Algorithms in Machine Learning, by looking at some common and useful attributes.
Data analysis using supervised learning techniques. The primary goal is to find potential donors for charity based on the features like age, income, etc.
An end to end ML model to predict whether a person has cardiovascular disease or not based on various features.
Predictions on NHANES3 dataset, predicting mortality
All the ML models collectively on standard datasets
Monotonic Optimal Binning algorithm is a statistical approach to transform continuous variables into optimal and monotonic categorical variables.
Power BI exercises for courses on DataCamp's Data Analyst in Power BI Career Track
Add a description, image, and links to the datapreprocessing topic page so that developers can more easily learn about it.
To associate your repository with the datapreprocessing topic, visit your repo's landing page and select "manage topics."