Dialog State Agent

Dialog State Agent created for the course 1-GS Methods in AI research (INFOMAIR) 2020-2021 at Utrecht University. Current functionality is Classification (classification.py, baseline.py) and start dialogue agent.

Run main.py for the text based chatbot for restaurant domain. classification.py contains functions to train and test classifiers.

Packages and imports

python-Levenshtein
NLTK
Numpy
Pandas
SkLearn
random
time
re

Some dictionaries might have to be downloaded:

'punkt', use nltk.download('punkt')
'stopwords' use nltk.download('stopwords')

dialogue_agent.py

Class to build a dialog agent. Agent works with states, as depicted in the "STD.pdf" file. The dialogue agent initializes a classifier from class classification.py trained on dialog acts in part 1a and initializes the data from the database "restaurant_info.csv".

Basic usage:

    from dialogue_agent import Dialogue_Agent
    #third parameter indicates which machine learning model to use. "nn" for Neural Network, empty string for Logistic Regression 
    da = Dialogue_Agent("dialog_acts.dat","restaurant_info.csv","nn")
    da.start_dialogue()

Entering 'exit' will exit the recursive function and stop the program. The dialog agent also keeps track of its states. These can be printed with:

    da.statelog

Extra Configurations:

    The agent can be configured after starting the dialogue. Use the following utterances to configure:
    
    "configure formal" #use formal sentences. 
    "configure informal" #use informal sentences. Standard configuration.
    "configure delay" #put a 0.5s delay on each answer from the system
    "configure no delay" #remove delay. Standard configuration.

States: The agent starts in the initialization state and progresses the conversation and changes states to find a suitable restaurant for the user. Some states include "answer" (to suggest restaurants if it finds any) and "fill_blanks" (used to fill the preference slots).

baseline.py

Implementation of 2 baselines:

classify every utterance as majority class
classify every utterance based on self-defined rules Score for both baselines is based on accuracy. To get the error, output 1-accuracy

Example code:

    from baseline import Baseline

    b = Baseline()
    b.open_dataset("dialog_acts.dat")
    b.split_dataset()
    #test baseline 1
    b.get_highest_label()
    b.test_highest_label()
    print(b.score())

To test the keyword rules, simply run the function:

    #test baseline 2
    b.test_keyword_rule()
    print(b.score())

To get the wrongly predicted sentences of the keyword_rule function:

    print(b.get_wrong_predictions())

To classify user utterance, simply run the following command:

    b.user_input()

classification.py

Split and preprocess data, train LR or NN classifier on training set and test on test set Usage:

    clf=Classification()
    clf.initialize_data("dialog_acts.dat")
    clf.train_lr()#or clf.train_nn()
    clf.test_clf() #to apply to test set

Predict a single sentence after training phase

    sentence="Hi, I would like to get a suggestion"
    clf.predict(sentence):

To get wrongly classified sentences, after testing:

    wrong_preds=clf_agent.get_wrong_predictions()
    print(wrong_preds)

Cross Validation. For this function, create a classifier and call the cv function. Second parameter for cv function is a boolean indicating whether or not to oversample.

    lr=LogisticRegression(random_state=0, max_iter=200, penalty='l2')
    clf.cv(lr,False)

GridSearch:

    clf_agent=Classification()
    clf_agent.open_dataset("dialog_acts.dat")
    clf=MLPClassifier()
    clf_agent.prepare_gs()
    params={'learning_rate':['constant'],
            'learning_rate_init':[0.01,0.001,0.0001],
             'solver' : ['adam'],
             'hidden_layer_sizes':[(100,100,100)],
             "max_iter":[100]
             }
    gs=clf_agent.grid_search(clf, params)
    gs.cv_results_

Name		Name	Last commit message	Last commit date
Latest commit History 87 Commits
.spyproject/config		.spyproject/config
__pycache__		__pycache__
.replit		.replit
Final_Report_Methods_in_AI.pdf		Final_Report_Methods_in_AI.pdf
Methods_in_AI_deliverables.zip		Methods_in_AI_deliverables.zip
README.md		README.md
STD.pdf		STD.pdf
baseline.py		baseline.py
classification.py		classification.py
dialog_acts.dat		dialog_acts.dat
dialogue_agent.py		dialogue_agent.py
main.py		main.py
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
restaurant_info.csv		restaurant_info.csv

lucalin17081994/INFOMAIR

Folders and files

Latest commit

History

Repository files navigation

Dialog State Agent

Packages and imports

dialogue_agent.py

baseline.py

classification.py

About

Topics

Resources

Stars

Watchers

Forks

Languages