GitHub - mojtabasajjadi/FarSSiM: A Farsi (Persian) Semantic Similarity Measurement Dataset (FarSSiM)

Contact Info.

mojtabasadjadi@gmail.com
https://www.linkedin.com/in/smsajjadi/
https://github.com/mojtabasajjadi/FarSSiM

Introduction

FarSSiM is the first STS dataset for the informal Persian language. It consists of about 1123 informal Farsi short text pairs. Each text pair is annotated for relatedness and semantics in meaning and for the entailment relation between the two elements. This dataset is collected by identifying paraphrases between Persian tweets.

File Structure: xlsx file

Fields

tweet 1: the first text
tweet 2: the second text
1st: the first annotator's score
2st: the second annotator's score
3st: the third annotator's score
4st: the fourth annotator's score
average: the mean of 4 annotators' score
standard deviation: the standard deviation of 4 annotators' score
variance: the variance of 4 annotators' score

Statistic

Total pairs: 1123

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
FarSSiM- A Farsi Semantic Similarity Measurement Dataset.xlsx		FarSSiM- A Farsi Semantic Similarity Measurement Dataset.xlsx
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FarSSiM- A Farsi Semantic Similarity Measurement Dataset.xlsx

FarSSiM- A Farsi Semantic Similarity Measurement Dataset.xlsx

README.md

README.md

Repository files navigation

Contact Info.

Introduction

Fields

Statistic

About

Releases

Packages

mojtabasajjadi/FarSSiM

Folders and files

Latest commit

History

FarSSiM- A Farsi Semantic Similarity Measurement Dataset.xlsx

FarSSiM- A Farsi Semantic Similarity Measurement Dataset.xlsx

README.md

README.md

Repository files navigation

Contact Info.

Introduction

Fields

Statistic

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages