A Doctor for your data
-
Updated
Jan 12, 2024 - Python
A Doctor for your data
A curated, but incomplete, list of data-centric AI resources.
Pytorch implementation of DoReMi, a method for optimizing the data mixture weights in language modeling datasets
Data-IQ: Characterizing subgroups with heterogeneous outcomes in tabular data (NeurIPS 2022)
Data-SUITE: Data-centric identification of in-distribution incongruous examples (ICML 2022)
TRIAGE: Characterizing and auditing training data for improved regression (NeurIPS 2023)
Enhancing Efficiency in Multidevice Federated Learning through Data Selection
Data Clustering using Expectation Maximization algorithm. To cite this Original Software Publication: https://www.sciencedirect.com/science/article/pii/S2352711021001771
Implementation of data typology for imbalanced datasets.
Add a description, image, and links to the data-centric-machine-learning topic page so that developers can more easily learn about it.
To associate your repository with the data-centric-machine-learning topic, visit your repo's landing page and select "manage topics."