Reasoning for Sentiment Analysis •

Update 22 May 2024: ⚠️ in GoogleColab you might find zero evaluation results (see #8 issue) (0) on test part, which is due to locked availablity on labels 🔐. In order to apply your model for test part, please proceed with the Official RuSentNE-2023 codalab competition page or at Github.

Update 06 March 2024: 🔓 attrdict represents the main limitation for code launching in Python 3.10 and hence been switched to addict (see Issue#7).

🔥 Update 24/04/2024: We released fine-tuning log for the prompt-based and THoR-based techniques applied for the train competition data as well as checkpoints for downloading. More ...

💻 Update 19/04/2024: We open quick_cot code repository for lauching quick CoT zero-shot-learning / few-shot-learning experiments with LLM, utilied in this studies. More ...

📊 Update 19/04/2024: We open a separate 📊 👉RuSentNE-benchmark repository👈 📊 for LLM-resonses, including answers on reasoning steps in THoR CoT for ChatGPT model series. More ...

Studies and Collection of LLM-based reasoning frameworks for Target Sentiment Analysis. This repository contains source code for paper @ LJoM journal titled as: Large Language Models in Targeted Sentiment Analysis for Russian.

Installation

We separate dependencies necessary for zero-shot and fine-tuning experiments:

pip install -r dependencies_zs.txt
pip install -r dependencies_ft.txt

Preparing Data

Simply launch the following script for obtaining both original texts and Translated:

python rusentne23_download.py

Manual Data Translation

You could launch manual data translation to English language (en) via GoogleTrans:

 python rusentne23_translate.py --src "data/train_data.csv" --lang "en" --label
 python rusentne23_translate.py --src "data/valid_data.csv" --lang "en" --label
 python rusentne23_translate.py --src "data/final_data.csv" --lang "en"

Zero-Shot

This is a common script for launching LLM model inference in Zero-shot format using manual or predefined prompts:

python zero_shot_infer.py \
    --model "google/flan-t5-base" \
    --src "data/final_data_en.csv" \
    --prompt "rusentne2023_default_en" \
    --device "cpu" \
    --to "csv" \
    --temp 0.1 \
    --output "data/output.csv" \
    --max-length 512 \
    --hf-token "<YOUR_HUGGINGFACE_TOKEN>" \
    --openai-token "<YOUR_OPENAI_TOKEN>" \
    --limit 10000 \
    --limit-prompt 10000 \
    --bf16 \
    --l4b

Notes

Mixtral-7B: you need to install transformers 4.36.0, read the blog post; [model size]

Usage Examples

Chat mode

Simply setup model name and device you wish to use for launching model.

python zero_shot_infer.py --model google/flan-t5-base --device cpu

Inference with the predefined prompt

Use the prompt command for passing the predefined prompt or textual prompt that involves the {text} information.

python zero_shot_infer.py --model google/flan-t5-small \
    --device cpu --src data/final_data_en.csv --prompt 'rusentrel2023_default_en'

OpenAI models

Use the model parameter prefixed by openai:, followed by model names as follows:

python zero_shot_infer.py --model "openai:gpt-3.5-turbo-1106" \
    --src "data/final_data_en.csv" --prompt "rusentrel2023_default_en_short" \
    --max-length 75 --limit 5

Zero-Shot Chain-of-Thought

This functionality if out-of-scope of this repository.

We release a tiny framework, dubbed as quick_cot for applying CoT schemas, with API similar to one in Zero-Shot section, based on schemas written in JSON notation.

📝 👉 `thor-zero-shot-cot-english-shema.json` 👈

💻 👉 Tiny CoT-framework (quick_cot) 👈

Fine-tuned Flan-T5

👉 Prompt-Fine-Tuning Logs

👉 THoR-Fine-Tuning Logs

Model	prompt	THoR
FlanT5-base	-	[Google Drive Link]
FlanT5-large	-	[Google Drive Link]
FlanT5-xl	[Google Drive Link]	[Google Drive Link]

Three Hop Chain-of-Thought THoR

python thor_finetune.py -r "thor" -d "rusentne2023" 
    -li <PRETRAINED_STATE_INDEX> \
    -bs <BATCH_SIZE> \
    -es <EPOCH_SIZE> \
    -f "./config/config.yaml"

Parameters list

-c, --cuda_index: Index of the GPU to use for computation (default: 0).
-d, --data_name: Name of the dataset (rusentne2023)
-r, --reasoning: Specifies the reasoning mode (engine), with single prompt or multi-step thor mode.
-li, --load_iter: load a state on specific index from the same data_name resource (default: -1, not applicable.)
-es, --epoch_size: amount of training epochs (default: 1)
-bs, --batch_size: size of the batch (default: None)
-t, --temperature: temperature (default=gen_config.temperature)
-z, --zero_shot: running zero-shot inference with chosen engine on test dataset to form answers.
-f, --config: Specifies the location of config.yaml file.

Configure more parameters in config.yaml file.

Answers

Results of the zero-shot models obtained during experiments fall outside the scope of this repository. We open a separate for LLM-resonses, including answers on reasoning steps in THoR CoT for ChatGPT model series:

👉 RuSentNE-benchmark repository 👈

References

You can cite this work as follows:

@misc{rusnachenko2024large,
      title={Large Language Models in Targeted Sentiment Analysis}, 
      author={Nicolay Rusnachenko and Anton Golubev and Natalia Loukachevitch},
      year={2024},
      eprint={2404.12342},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
config		config
src		src
CHANGESET.md		CHANGESET.md
DOC_FLANT5_PROMPT_FT.md		DOC_FLANT5_PROMPT_FT.md
DOC_FLANT5_THOR_FT.md		DOC_FLANT5_THOR_FT.md
LICENSE		LICENSE
README.md		README.md
Reasoning_for_Sentiment_Analysis_Framework.ipynb		Reasoning_for_Sentiment_Analysis_Framework.ipynb
dependencies_ft.txt		dependencies_ft.txt
dependencies_zs.txt		dependencies_zs.txt
rusentne23_download.py		rusentne23_download.py
rusentne23_translate.py		rusentne23_translate.py
thor_finetune.py		thor_finetune.py
thor_prepare_data.py		thor_prepare_data.py
utils.py		utils.py
utils_prompt.py		utils_prompt.py
zero_shot_infer.py		zero_shot_infer.py
zero_shot_response_parser.py		zero_shot_response_parser.py

License

nicolay-r/Reasoning-for-Sentiment-Analysis-Framework

Folders and files

Latest commit

History

Repository files navigation

Reasoning for Sentiment Analysis •

Contents