Add new dataset: GermanGovServiceRetrieval #731

malteos · 2024-05-15T14:15:14Z

GermanGovServiceRetrieval: LHM-Dienstleistungen-QA is a German question answering dataset for government services of the Munich city administration. It associates questions with a textual context containing the answer

Checklist for adding MMTEB dataset

Reason for dataset addition: Domain-specific retrieval dataset for German

malteos · 2024-05-15T14:21:05Z

Do you have time to review? @guenthermi @Muennighoff @PhilipMay

KennethEnevoldsen

I think everything looks good. Feel free to add points. You might also consider using ndcg_5 instead of 10 since the dataset is quite small.

malteos · 2024-05-15T18:22:24Z

I think everything looks good. Feel free to add points. You might also consider using ndcg_5 instead of 10 since the dataset is quite small.

Good point. I changed the main metric.

PhilipMay · 2024-05-17T06:57:35Z

Do you have time to review? @guenthermi @Muennighoff @PhilipMay

Hey @malteos . I have almost no knowledge with MTEB. Never implemented anything here. Sorry.
Maybe @rasdani could have a look?

KennethEnevoldsen

Have enabled auto-merge and updated the points. Let me know if you disagree. Thanks for the addition!

docs/mmteb/points/731.jsonl

malteos and others added 2 commits May 15, 2024 16:13

Add new dataset: GermanGovServiceRetrieval

84c79b9

Merge branch 'main' into germangovservice

eab455d

KennethEnevoldsen approved these changes May 15, 2024

View reviewed changes

malteos and others added 2 commits May 15, 2024 20:12

Merge branch 'main' into germangovservice

52732e4

points for GermanGovServiceRetrieval; changed main metric

c6c3404

malteos requested a review from KennethEnevoldsen May 15, 2024 18:54

isaac-chung assigned KennethEnevoldsen May 15, 2024

Merge branch 'main' into germangovservice

ee8c723

KennethEnevoldsen approved these changes May 17, 2024

View reviewed changes

docs/mmteb/points/731.jsonl Outdated Show resolved Hide resolved

KennethEnevoldsen added 2 commits May 17, 2024 10:37

Update docs/mmteb/points/731.jsonl

e4e3bd8

Merge branch 'main' into germangovservice

78a67eb

KennethEnevoldsen enabled auto-merge (squash) May 17, 2024 08:38

KennethEnevoldsen merged commit 66792ef into embeddings-benchmark:main May 17, 2024
7 checks passed

malteos mentioned this pull request May 22, 2024

Adding contributor information #794

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add new dataset: GermanGovServiceRetrieval #731

Add new dataset: GermanGovServiceRetrieval #731

malteos commented May 15, 2024 •

edited

malteos commented May 15, 2024

KennethEnevoldsen left a comment

malteos commented May 15, 2024

PhilipMay commented May 17, 2024 •

edited

KennethEnevoldsen left a comment

Add new dataset: GermanGovServiceRetrieval #731

Add new dataset: GermanGovServiceRetrieval #731

Conversation

malteos commented May 15, 2024 • edited

Checklist for adding MMTEB dataset

malteos commented May 15, 2024

KennethEnevoldsen left a comment

Choose a reason for hiding this comment

malteos commented May 15, 2024

PhilipMay commented May 17, 2024 • edited

KennethEnevoldsen left a comment

Choose a reason for hiding this comment

malteos commented May 15, 2024 •

edited

PhilipMay commented May 17, 2024 •

edited