- filter_queries.py - keeps only important queries from dataset AOL-user-ct-collection
- diff_training_validation.py - removes duplicate queries between training and validation set
- parametrize_queries.py - creates the xml file needed by the Indri indexing and query engine to run queries
- docs_sparse.py - creates compressed sparse representation of corpus documents
- queries_sparse.py - creates compressed sparse representation of training queries
- embeddings.py - create word embeddings of documents
- query_embeddings.py - create word embeddings of training queries
- score_sparse.py - uses sparse representation as input for the score neural network
- rank_embed.py - uses embeddings as input for rank neural network
- bm25_baseline.py - runs BM25 on the queries and corpus
- score_model.py - uses dense representation on score neural network
mikvrax/TrecingLab
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
No description, website, or topics provided.
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published