Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Unclear in general how to use classification models requiring sentence pairs #14107

Open
alex2awesome opened this issue Dec 23, 2023 · 0 comments
Assignees

Comments

@alex2awesome
Copy link

Link to the documentation pages (if available)

There are many NLP tasks that classify pairs of text. E.g. NLI, Stance Detection, Question-Similarity.

It's very unclear how to use these models, many of which exist on SparkNLPHub, in a real pipeline.

Here's an example of great documentation: https://sparknlp.org/2021/08/25/classifierdl_electra_questionpair_pipeline_en.html

it's clear how to format q1 and q2.

However, here's an example of bad documentation:
https://sparknlp.org/2023/09/18/albert_xxlarge_v2_snli_mnli_fever_anli_r1_r2_r3_nli_en.html
https://sparknlp.org/2023/11/29/roberta_classifier_large_snli_mnli_fever_anli_r1_r2_r3_nli_en.html

The first has no string mentioned, even. The second just says "PUT YOUR STRING HERE". I have no idea how to format my input.

How could the documentation be improved?

In general, having a uniform way to treat sentence pairs would be great.

In these specific cases, I would love to have an example of real input.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants