datapreprocessing

Star

Here are 343 public repositories matching this topic...

akarazniewicz / cocosplit

Star

Simple tool to split COCO annotations into train/test datasets.

coco deeplearning datapreprocessing

Updated Aug 15, 2023
Python

bharatc9530 / Machine-Learning

Star

Data Visualization, EDA , Model Building and Deployment etc..

data-science machine-learning deep-neural-networks machine-learning-algorithms embeddings artificial-intelligence artificial-neural-networks flask-api model-deployment datapreprocessing

Updated Nov 21, 2022
Jupyter Notebook

Analyzing the HR Criteria of a Company and how they promote their Employees and keep Balance between them using Data Analytics, Data Visualizations, and Machine Learning Models for Classification Purposes.

python statistics xgboost machinelearning boosting datavisualization analytics-vidhya-competition dataanalysis catboost datapreprocessing googlecolab featureextraction lgboost

Updated May 23, 2019
Jupyter Notebook

ErdemOzgen / Data-Engineering-Roadmap

Star

Roadmap for Data Engineering

devops data-science machine-learning development roadmap awesome cloud database deep-learning interview ci-cd awesome-list guidelines datawarehouse datapipeline dataengineering awesome-resources datapreprocessing mlops

Updated May 9, 2024
Java

srishilesh / Machine-learning

Star

python machine-learning linear-regression machine-learning-algorithms coursera python3 artificial-intelligence octave flask-api gnu-octave datapreprocessing accuracy-metrics

Updated Dec 24, 2020
Jupyter Notebook

IngestAI / embedditor

Star

⚡ GUI for editing LLM vector embeddings. No more blind chunking. Upload content in any file extension, join and split chunks, edit metadata and embedding tokens + remove stop-words and punctuation with one click, add images, and download in .veml to share it with your team.

nlp php laravel ml embeddings markup-language datascience nltk vectorization datapreprocessing vector-search embedding-vectors vector-database llm genai veml

Updated Nov 21, 2023
PHP

omarsar / data_mining_lab

Sponsor

Star

Material for Data Mining Lab Session (Fall Semester @ NTHU)

data datamining datavisualization datapreprocessing

Updated Oct 8, 2018
Jupyter Notebook

cereja-project / cereja

Star

Cereja is a bundle of useful functions we don't want to rewrite and .. just pure fun!

python console utilities progress-bar tokenizer python-library python3 colab file-converter data-tools tfidf hacktoberfest array-manipulations progress-view datapreprocessing freq hacktoberfest2020 freqitems

Updated May 3, 2024
Python

sayanmondal2098 / easytoken

Star

Tokenizer is an independent Open Source, Natural Language Processing python library which implements a tokenizer to create token from Both Sentence and Paragraph.

nlp data-science natural-language-processing tokenizer natural-language python-library python3 text-summarization token text-processing nlp-library nlp-machine-learning dataprocessing datapreprocessing

Updated Mar 1, 2024
Python

Joycechidi / MachineLearning

Star

All my Machine Learning Projects from A to Z in (Python & R)

python r naive-bayes regression classification logistic-regression polynomial-regression decision-tree-regression kernel-svm simple-linear-regression random-forest-regression multiple-linear-regression datapreprocessing support-vector-regression--svr evaluating-regression-models-perf regularization-methods k-nearest-neighbors-k-nn support-vector-machine-svm decision-tree-classification random-forest-classification

Updated Aug 15, 2019
Jupyter Notebook

rahulrajpl / wexda

Star

Web Based Exploratory Data Analysis platform. Can be used as a precursor to the Data Preperation/ Preprocessing stage to understand the data through visualizations

data-science machine-learning datascience machinelearning dataanalysis datapreprocessing

Updated Dec 8, 2022
HTML

irenekarijadi / CEEMDAN-EWT-LSTM

Star

Wind Power Forecasting Based on Hybrid CEEMDAN-EWT Deep Learning Method

prediction artificial-intelligence lstm forecasting deeplearning renewable-energy datapreprocessing ceemdan

Updated Sep 28, 2023
Python

adityasurana / APTOS-Blindness-Detection-Kaggle

Star

Image Classification model for detecting and classifying *DIABETIC RETINOPATHY* using retina images

data-science data deep-learning image-classification transfer-learning pretrained-models vgg16 epoch keras-tensorflow diabetic-retinopathy-detection imagedatagenerator diabetic-retinopathy cnn-classification datapreprocessing diabetic-retinopathy-prediction

Updated May 15, 2022
Jupyter Notebook

sharmaroshan / Loan-Prediction

Star

Predicting whether a person who has applied for a loan in a bank would get his/her loan approved or not using Classification Algorithms in Machine Learning, by looking at some common and useful attributes.

machine-learning eda data-visualization data-analysis beginner datapreprocessing

Updated Mar 31, 2019
Jupyter Notebook

jbp261 / Finding-Donors-for-Charity

Star

Data analysis using supervised learning techniques. The primary goal is to find potential donors for charity based on the features like age, income, etc.