Loss Re-scaling VQA: Revisiting the Language Prior Problem from a Class-imbalance View

This repository is built upon the code. In this paper, we propose a simple loss re-weighting method for tackling the language prior problem in VQA. This repo implements methods on VQA v1 and v2, VQA-CP v1 and v2. I believe everything is simplified in this code and you can easily add your own module!

The LXMERT version can be found at another repo - https://github.com/guoyang9/LXMERT-VQACP.

Almost all flags can be set by yourself at utils/config.py!

This repo also re-implements CSS and LMH, please feel free to take a try.

Repo-download

git clone --recursive https://github.com/guoyang9/class-imbalance-VQA.git

Prerequisites

* python==3.7.7
* nltk==3.4
* bcolz==1.2.1
* tqdm==4.31.1
* numpy==1.18.4
* pytorch==1.4.0
* tensorboardX==2.1
* torchvision==0.6.0

Dataset

First of all, make all the data in the right position according to the utils/config.py!

Please download the VQA-CP datasets in the original paper.
The image features can be found at the UpDn repo.
The pre-trained Glove features can be accessed via GLOVE.

Pre-processing

process questions and dump dictionary:
```
python tools/create_dictionary.py
```
process answers and question types:
```
python tools/compute_softscore.py
```

convert image features to h5:

python tools/detection_features_converter.py

Model Training

python main.py --loss-fn Plain --name test-VQA --gpu 0

Note that the loss re-scaling works w/. or w/o. the answer mask pre-training module.

The script for fine-tuning and re-implement our results:

set use_mask = True and use_miu = False in config.py:

python main.py --loss-fn Plain --name test-VQA --gpu 0

fine-tune with the loss re-scaling approach, set both flags with True:

python main.py --loss-fn Plain --fine-tune --name test-VQA --name-new fine_tune --gpu 0

Model Evaluation

python main.py --loss-fn Plain --name test-VQA --eval-only

Citation

Please cite the following article if you find this code helpful:

@article{rescale-vqa,
  title={Loss Re-scaling VQA: Revisiting the Language Prior Problem from a Class-imbalance View},
  author={Guo, Yangyang and Nie, Liqiang and Cheng, Zhiyong and Tian, Qi and Zhang, Min},
  journal={IEEE TIP},
  year={2021}
}

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
data		data
logs		logs
modules		modules
tools		tools
utils		utils
vqa_eval		vqa_eval
.gitignore		.gitignore
README.md		README.md
acc_per_type.py		acc_per_type.py
main.py		main.py
train-css.py		train-css.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data

data

logs

logs

modules

modules

tools

tools

utils

utils

vqa_eval

vqa_eval

.gitignore

.gitignore

README.md

README.md

acc_per_type.py

acc_per_type.py

main.py

main.py

train-css.py

train-css.py

train.py

train.py

Repository files navigation

Loss Re-scaling VQA: Revisiting the Language Prior Problem from a Class-imbalance View

Repo-download

Prerequisites

Dataset

Pre-processing

Model Training

Model Evaluation

Citation

About

Releases

Packages

Languages

guoyang9/class-imbalance-VQA

Folders and files

Latest commit

History

Repository files navigation

Loss Re-scaling VQA: Revisiting the Language Prior Problem from a Class-imbalance View

Repo-download

Prerequisites

Dataset

Pre-processing

Model Training

Model Evaluation

Citation

About

Topics

Resources

Stars

Watchers

Forks

Languages