Skip to content

dhiraa/tener

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Tensorflow-Keras port of https://github.com/fastnlp/TENER.

The motivation behind this port is:

  • To check the claim that this architecture can perrform well in small dataset
  • To build the basic data preprocessing from scratch
  • Implement the Tener architecture in Tensorflow
  • Tryout different position strategy after mimiking the basic metrics

Checklist

  • Dataset preparation and test cases
  • Gin-config
  • Plug and play trainer for datasets and model architecture
  • Vanilla transformer model integration
  • Tener transformer model
    • Embeddings
      • Sinusoidal
      • RelativeSinusoidal
      • Character Embedding
    • Attention
      • MultiHeadNaive
      • MiltiHeadRelative
  • Tuning and Debugging
    • Vanilla Transformer Model
    • Tener Transformer Model

Setup

pip install -r requirements.txt

Module Design

  • Google Gin-Config based configuration
  • Trainer script that reads the config and selects the dataset and model to be used
  • Model class that encapsulates the Keras model layers, loss, metrics and train step
gin config file ---> trainer ---> dataset and model ---> Keras Model 

How to run?:

  • Train
cd tener/
# To use vanilla transformer architecture from Tensorflow Tutorial
python bin/trainer.py --config_file=config/vanilla_transformer.gin
# Tener absed architecture 
python bin/trainer.py --config_file=config/tener.gin
  • Test
cd tener/src/

pytest -s

# some times pytest can be picked from global installation
# breaking the pytest to use your environment related packages
/home/{user_home}/anaconda3/envs/{env}/bin/pytest -s

References:

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published