Transformers for Data Scientists in a rush

Low-code pre-built pipelines for experiments with huggingface/transformers for Data Scientists in a rush.

This repository contains low-code, easy to understand, pre-built pipelines for fast experimentation on NLP tasks using huggingface/transformers pre-trained language models, which are explained and explored in a post series on Medium about the theme.

This was inspired by a LinkedIn post of Thomas Wolf, HuggingFace's CSO in which there was an image of a low-code pipeline for fast experimentation on their Transformers repo. As I could not see anything like it implemented on the internet, I've decided to do it myself.

Index

As of now, we have:

classification, with a classification example.

Classification

On the classification example, we use a dataset for email spam classification from Kaggle, and use optuna for hyperparameter tuning.

You might run it, on the classification directory, with:

python classification-experiment.py --model-name bert-base-multilingual-cased ---metric f1_score --train-data-path train.csv --test-data-path test.csv --max-sequence-length 25 --label-nbr 2

It should yield a f1_score higher than 0.9.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.idea		.idea
classification		classification
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.idea

.idea

classification

classification

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

requirements.txt

requirements.txt

Repository files navigation

Transformers for Data Scientists in a rush

Index

Classification

Made by Pi Esposito

About

Languages

License

piEsposito/transformers-low-code-experiments

Folders and files

Latest commit

History

Repository files navigation

Transformers for Data Scientists in a rush

Index

Classification

Made by Pi Esposito

About

Topics

Resources

License

Stars

Watchers

Forks

Languages