Supplementary programmes for DeRDaVa: Deletion-Robust Data Valuation for Machine Learning.
-
Updated
Dec 23, 2023 - Jupyter Notebook
Supplementary programmes for DeRDaVa: Deletion-Robust Data Valuation for Machine Learning.
Algorithms for data valuation and benchmarks
Code for paper 'Interpretable Triplet Importance for Personalized Ranking' in submission
The Medium of Exchange of Ecosystem
Simulation environment for data collection dynamics.
The pyDVL slides for pyData Berlin 2024
This is an official repository for "2D-Shapley: A Framework for Fragmented Data Valuation" (ICML2023).
Code for the submission to the ML Reproducibility Challenge 2022, reproducing "If you like Shapley then you'll love the core"
Papers about training data quality management for ML models.
Code for the paper "The Journey, Not the Destination: How Data Guides Diffusion Models"
PyTorch reimplementation of computing Shapley values via Truncated Monte Carlo sampling from "What is your data worth? Equitable Valuation of Data" by Amirata Ghorbani and James Zou [ICML 2019]
Intriguing Properties of Data Attribution on Diffusion Models (ICLR 2024)
This is an official repository for "LAVA: Data Valuation without Pre-Specified Learning Algorithms" (ICLR2023).
OpenDataVal: a Unified Benchmark for Data Valuation in Python (NeurIPS 2023)
pyDVL is a library of stable implementations of algorithms for data valuation and influence function computation
💱 A curated list of data valuation (DV) to design your next data marketplace
Add a description, image, and links to the data-valuation topic page so that developers can more easily learn about it.
To associate your repository with the data-valuation topic, visit your repo's landing page and select "manage topics."