nlu_tools

NLU tools for sentence semantics matching and classification in tensorflow with simple software architecture(base on Tensorflow Project Template).

So, here's a simple tensorflow template that help you get into your main project faster and just focus on your core (Model, Training, ...etc)

Usage

build data

Run command in ./main directory, but you may can skip this process since there is preprocessed data in ./data/xx

python run.py -c ../configs/smp/cnn.json -s build_data

train model

Run command in ./main directory

python run.py -c ../configs/smp/cnn.json

using tensorboard

 tensorboard  --logdir=./experiments/smp_cnn/summary/

In a Nutshell

In a nutshell here's how to use this template, so for example assume you want to implement CNN model for text classification so you should do the following:

In models folder create a class named CNN that inherit the "base_model" class

    class CNN(BaseModel):
        def __init__(self, config):
            super(CNN, self).__init__(config)
            #call the build_model and init_saver functions.
            self.build_model() 
            self.init_saver()

Override these two functions "build_model" where you implement the cnn model, and "init_saver" where you define a tensorflow saver, then call them in the initalizer.

     def build_model(self):
        # here you build the tensorflow graph of any model you want and also define the loss.
        pass
            
     def init_saver(self):
        # here you initalize the tensorflow saver that will be used in saving the checkpoints.
        self.saver = tf.train.Saver(max_to_keep=self.config.max_to_keep)

In trainers folder create a Classify trainer that inherit from "base_train" class

    class ClassifyTrainer(BaseTrain):
        def __init__(self, sess, model, data, config, logger):
            super(ClassifyTrainer, self).__init__(sess, model, data, config, logger)

Override these two functions "train_step", "train_epoch" where you write the logic of the training process

    def train_epoch(self):
        """
       implement the logic of epoch:
       -loop on the number of iterations in the config and call the train step
       -add any summaries you want using the summary
        """
        pass

    def train_step(self):
        """
       implement the logic of the train step
       - run the tensorflow session
       - return any metrics you need to summarize
       """
        pass

Config module above in a config file for this model(full example file in configs/smp/cnn.json).

{
  "task_config": {
    "name": "models.classify.cnn.CNN",
    "trainer": "trainers.classify_trainer.ClassifyTrainer",
    "data_generator": "data_loader.sent_classify_loader.SMPGenerator"
  },
  ...(other configs)
}

You will find a template file and a simple example in the model and trainer folder that shows you how to try your first model simply.

In Details

Project architecture

Folder structure

├──  base
│   ├── base_model.py   - this file contains the abstract class of the model.
│   └── base_train.py   - this file contains the abstract class of the trainer.
│
│
├── model               - this folder contains any model of your project.
│   └── example_model.py
│
│
├── trainer             - this folder contains trainers of your project.
│   └── example_trainer.py
│   
├──  mains              - here's the main(s) of your project (you may need more than one main).
│    └── example_main.py  - here's an example of main that is responsible for the whole pipeline.

│  
├──  data _loader  
│    └── data_generator.py  - here's the data_generator that is responsible for all data handling.
│ 
└── utils
     ├── logger.py
     └── any_other_utils_you_need

Main Components

Models

Base model

Base model is an abstract class that must be Inherited by any model you create, the idea behind this is that there's much shared stuff between all models. The base model contains:
- Save -This function to save a checkpoint to the desk.
- Load -This function to load a checkpoint from the desk.
- Cur_epoch, Global_step counters -These variables to keep track of the current epoch and global step.
- Init_Saver An abstract function to initialize the saver used for saving and loading the checkpoint, Note: override this function in the model you want to implement.
- Build_model Here's an abstract function to define the model, Note: override this function in the model you want to implement.
Your model

Here's where you implement your model. So you should :
- Create your model class and inherit the base_model class
- override "build_model" where you write the tensorflow model you want
- override "init_save" where you create a tensorflow saver to use it to save and load checkpoint
- call the "build_model" and "init_saver" in the initializer.

Trainer

Base trainer

Base trainer is an abstract class that just wrap the training process.
Your trainer

Here's what you should implement in your trainer.
1. Create your trainer class and inherit the base_trainer class.
2. override these two functions "train_step", "train_epoch" where you implement the training process of each step and each epoch.

Data Loader

This class is responsible for all data handling and processing and provide an easy interface that can be used by the trainer.

Logger

This class is responsible for the tensorboard summary, in your trainer create a dictionary of all tensorflow variables you want to summarize then pass this dictionary to logger.summarize().

Configuration

I use Json as configuration method and then parse it, so write all configs you want then parse it using "utils/config/process_config" and pass this configuration object to all other objects.

Main

Here's where you combine all previous part.

Parse the config file.
Create a tensorflow session.
Create an instance of "Model", "Data_Generator" and "Logger" and parse the config to all of them.
Create an instance of "Trainer" and pass all previous objects to it.
Now you can train your model by calling "Trainer.train()"

Future Work

add new model

Contributing

Any kind of enhancement or contribution is welcomed.

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
base		base
baselines		baselines
configs		configs
data		data
data_loader		data_loader
figures		figures
mains		mains
models		models
trainers		trainers
utils		utils
.gitignore		.gitignore
README.md		README.md

zqhZY/nlu_tools

Folders and files

Latest commit

History

Repository files navigation

nlu_tools

Table Of Contents

Usage

build data

train model

using tensorboard

In a Nutshell

In Details

Project architecture

Folder structure

Main Components

Models

Base model

Your model

Trainer

Base trainer

Your trainer

Data Loader

Logger

Configuration

Main

Future Work

Contributing

About

Topics

Resources

Stars

Watchers

Forks

Languages