d2m - Data to model

A machine learning pipeline for trustworthy and green models, enabling responsible AI:

Explainable AI, using SHAP, LIME or both.
Uncertainty estimation, using Bayesian dropout for neural networks.
Carbon emissions tracking and reporting, using CodeCarbon.

d2m lets you easily create and evaluate machine learning models for tabular and time series data, with built-in data profiling and feature engineering.

Usage

Tested on:

Linux
macOS
Windows with WSL 2

Clone/download this repository.
Place your datafiles (csv) in a folder with the name of your dataset (DATASET) inside assets/data/raw/, so the path to the files is assets/data/raw/[DATASET]/.
Update params.yaml with the name of your dataset (DATASET), the target variable, and other configuration parameters.
Build Docker container:

docker build -t d2m -f Dockerfile .

Run the container:

docker run -p 5000:5000 -it -v $(pwd)/assets:/usr/d2m/assets -v $(pwd)/.dvc:/usr/d2m/.dvc d2m

Open the website at localhost:5000 to use the graphical user interface.

Creating models on the command line

Copy params.yaml from the host to the container (find CONTAINER_NAME by running docker ps):

docker cp params.yaml  [CONTAINER_NAME]:/usr/d2m/params.yaml

Inside the interactive session in the container, run:

docker exec [CONTAINER_NAME] dvc repro

Name		Name	Last commit message	Last commit date
Latest commit History 79 Commits
assets		assets
docs		docs
src		src
test		test
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
dvc.yaml		dvc.yaml
params.yaml		params.yaml
params_default.yaml		params_default.yaml
requirements.txt		requirements.txt
run.sh		run.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

assets

assets

docs

docs

src

src

test

test

.dockerignore

.dockerignore

.gitignore

.gitignore

Dockerfile

Dockerfile

LICENSE

LICENSE

README.md

README.md

dvc.yaml

dvc.yaml

params.yaml

params.yaml

params_default.yaml

params_default.yaml

requirements.txt

requirements.txt

run.sh

run.sh

Repository files navigation

d2m - Data to model

Usage

Creating models on the command line

About

Releases

Packages

Languages

License

SINTEF-9012/d2m

Folders and files

Latest commit

History

Repository files navigation

d2m - Data to model

Usage

Creating models on the command line

About

Resources

License

Stars

Watchers

Forks

Languages