Skip to content

pauli31/czech-subjectivity-dataset

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 

Repository files navigation

Czech Subjectivity Dataset

This is the repository for the newly created Czech Subjectivity Dataset (Subj-CS) and our paper:

Czech Dataset for Cross-lingual Subjectivity Classification

Accepted to LREC 2022 conference.

Dataset Download:

The Czech Subjectivity Dataset is available for download from this https://drive.google.com/file/d/1R0bPPWJ7sdIaCxyPrO_rmTVFNNsd9RaI/view?usp=sharing

The dataset is also available in the HuggingFace Datasets

Usage:

We will add usage and setup soon.

python3 baseline.py...

Setup:

Create conda enviroment

  1. Clone github repository

    git clone git@github.com:pauli31/czech-subjectivity-dataset.git
    
  2. Setup conda

  3. Setup Data

License:

The dataset and code can be freely used for academic and research purposes. It is strictly prohibited to use the dataset for any commercial purpose.

Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.

Publication:

If you use our dataset or software for academic research, please cite our paper

@inproceedings{priban-steinberger-2022-czech,
    title = "{C}zech Dataset for Cross-lingual Subjectivity Classification",
    author = "P{\v{r}}ib{\'a}{\v{n}}, Pavel  and
      Steinberger, Josef",
    booktitle = "Proceedings of the Thirteenth Language Resources and Evaluation Conference",
    month = jun,
    year = "2022",
    address = "Marseille, France",
    publisher = "European Language Resources Association",
    url = "https://aclanthology.org/2022.lrec-1.148",
    pages = "1381--1391",
}

Contact:

pribanp@kiv.zcu.cz

http://nlp.kiv.zcu.cz

About

This is the repository for the newly created Czech Subjectivity Dataset (Subj-CS) and our paper:

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published