TabNet Reduced

Most of the code is taken from here for "TabNet: Attentive Interpretable Tabular Learning" by Sercan O. Arik and Tomas Pfister (paper: https://arxiv.org/abs/1908.07442).

The modified model, reduced TabNet, is defined in model/tabnet_reduced.py. There are two modifications:

there is now 1 shared feature transformer and 1 decision-dependent feature transformer (from 2 and 2 before respectively), and
the SparseMax mask for feature selection has been replaced by EntMax 1.5 (implementation in TensorFlow from here).

The combination of these modifications has improved the performance of TabNet with fewer parameters, particularly with a sharper mask for feature selection.

Training and Evaluation

As in the original repository, this repository contains an example implementation of TabNet on the Forest Covertype dataset (https://archive.ics.uci.edu/ml/datasets/covertype).

To run the script, run run.sh. Otherwise, a manual approach can be taken as follows.

First, run python download_prepare_covertype.py to download and prepare the Forest Covertype dataset. This command creates train.csv, val.csv, and test.csv files under the data/ directory (will create the directory if it does not exist).

To run the pipeline for training and evaluation, simply use python train_classifier.py. Note that Tensorboard logs are written in tflog/.

For simplicity, the hyperparameters for both the reduced TabNet and TabNet model are kept the same. These can be found in config/covertype.py. To set training to reduced TabNet, set REDUCED = True, else set REDUCED = False.

Modifications for Other Datasets

To modify the experiment to other tabular datasets:

Substitute the train.csv, val.csv, and test.csv files under data/ directory,
Create a new config in config/ by copying config/covertype.py for the numerical and categorical features of the new dataset and hyperparameters,
Reoptimize the TabNet hyperparameters for the new dataset in your config,
Import the parameters in train_classifier.py,
Select the reduced TabNet architecture by setting REDUCED = True, and
Change MODEL_NAME in your config to a name you desire.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
activation		activation
config		config
model		model
util		util
.gitignore		.gitignore
README.md		README.md
download_prepare_covertype.py		download_prepare_covertype.py
requirements.txt		requirements.txt
run.sh		run.sh
train_classifier.py		train_classifier.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

activation

activation

config

config

model

model

util

util

.gitignore

.gitignore

README.md

README.md

download_prepare_covertype.py

download_prepare_covertype.py

requirements.txt

requirements.txt

run.sh

run.sh

train_classifier.py

train_classifier.py

Repository files navigation

TabNet Reduced

Training and Evaluation

Modifications for Other Datasets

About

Releases

Packages

Contributors 2

Languages

ptuls/tabnet-modified

Folders and files

Latest commit

History

Repository files navigation

TabNet Reduced

Training and Evaluation

Modifications for Other Datasets

About

Topics

Resources

Stars

Watchers

Forks

Languages