Classification with Embeddings + kNN

We use Faiss (and Elasticsearch) for embedding-based kNN classification for the agent selection task.

Usage

1. Fill Elasticsearch server with 'training' data for different agents

Create and fill the index with python ../elasticsearch/create_index.py ../elasticsearch/config.json --new. Adapt the config with your choice of agents and adapt the paths to the data.

If you want to use docker, then you can start the server with es_docker.sh beforehand.

2. Pre-Compute Embeddings

Run python ../faiss/create_embedding.py ../faiss/config.json to pre-compute and save the embeddings for the training data of the above created index.

3. Evaluation

Run python faiss_classifier.py $config.json --eval to evaluate with the given config. See our config for an example. The names for the agents have to be identical to those used during index creation. In general, you only change the path from the training to the test folders.

Parameters:

sentence_transformer_model
Either a URL for a Universal-Sentence-Encoder model (e.g. https://tfhub.dev/google/universal-sentence-encoder-qa/3) or the name for a sentence-transformer model.

k
The number of retrieved neighbors which make up the votes for the weighted voting for the classification.

weighting
uniform : each vote is weighted the same (1/k)
score: each vote is weighted proportional to the score given by Elasticsearch

class_weighting
If true, each vote is normalized by the number of examples for the agent in the training data

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Classification with Embeddings + kNN

Usage

1. Fill Elasticsearch server with 'training' data for different agents

2. Pre-Compute Embeddings

3. Evaluation

Parameters:

Files

README.md

Latest commit

History

README.md

File metadata and controls

Classification with Embeddings + kNN

Usage

1. Fill Elasticsearch server with 'training' data for different agents

2. Pre-Compute Embeddings

3. Evaluation

Parameters: