(Simplified) Solution to Favorita Competition

Sorry, no CPU-only mode. You have to use an nvidia card to train models.

Test environment:

GTX 1070
16 GB RAM + 8 GB Swap
At least 30 GB free disk space

(it can be less if you turn off some of the joblib disk caching)

Docker 17.12.0-ce
Nvidia-docker 2.0

Acknowledgement

Transformer model comes from Yu-Hsiang Huang's implementation. His repo is included in "attention-is-all-you-need-pytorch" folder via git subtree.
LSTNet model is largely inspired from GUOKUN LAI's implementation.
The model structure is inspired by the work of Sean Vasquez and Arthur Suilin.

Docker Usage

First build the image. Example command: docker build -t favorita .

Then spin up a docker container:

docker run --runtime=nvidia --rm -ti \
    -v /mnt/Data/favorita_cache:/home/docker/labs/cache \
    -v /mnt/Data/favorita_data:/home/docker/labs/data \
    -p 6006:6006 favorita bash

It is recommended to manually mount the data and cache folder
port 6006 is for running tensorboard inside the container

Where to put the data

Download and extract the data files from Kaggle into data folder.

We're going to assume you're using the BASH prompt inside the container in the rest of this README.

Model Training

Preprocessing

python prepare_seq_data.py

Train Model

For now there are two types of model ready to be trained:

Transformer (fit_transformer.py)
LSTNet (fit_lstnet.py)

The training scripts use Sacred to manage experiments. It is recommended to set a seed explicitly via CLI:

python fit_transformer.py with seed=93102

You can also use Mongo to save experiment results and hyper-parameters for each run. Please refer to the Sacred documentation for more details.

Prediction for Validation and Testing Dataset

The CSV output will be saved in cache/preds/val/ and cache/preds/test/ respectively.

Tensorboard

Training and validation loss curves, and some of the embeddings are logged in tensorboard format. Launch tensorboad via:

tensorboard --logdir runs

Then visit http://localhost:6006 for the web interface.

TODO (For now you need to figure them out yourself)

Ensembling script: I made some changes to the outputs of model training scripts so they are more readable. But that means ensembling script needs to be updated as well. (For those who want to try: the ground truth for validation set is stored in cache/yval_seq.npy.)
Encoder/Decoder and Encoder/MLP models with LSTM, GRU, QRNN, SRU units: I tried a lot of different stuffs for this competition. But I feel the code could use some refactoring, so they are removed for now.
Tabular data preparation and models: My GBM models is mediocre at best, so not really worth sharing here. But as I mentioned in the blog post. For those store/item combination that were removed by the 56-day nonzero filter, using a GBM model to predict values for them will give you a better score than predicting zeros.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
attention-is-all-you-need		attention-is-all-you-need
cache		cache
data		data
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
bots.py		bots.py
components.py		components.py
dataset.py		dataset.py
fit_lstnet.py		fit_lstnet.py
fit_transformer.py		fit_transformer.py
io_utils.py		io_utils.py
locked_dropout.py		locked_dropout.py
models.py		models.py
prepare_seq_data.py		prepare_seq_data.py
preprocess.py		preprocess.py
transformer		transformer
weight_norm_rnn.py		weight_norm_rnn.py

ceshine/favorita_sales_forecasting

Folders and files

Latest commit

History

Repository files navigation

(Simplified) Solution to Favorita Competition

Acknowledgement

Docker Usage

Where to put the data

Model Training

Preprocessing

Train Model

Prediction for Validation and Testing Dataset

Tensorboard

TODO (For now you need to figure them out yourself)

About

Topics

Resources

Stars

Watchers

Forks

Languages