The Wisdom of Hindsight Makes Language Models Better Instruction Followers

Intro

This is an implementation of paper:

Citation

If you use this code in your own work, please cite our paper:

@article{zhang2023wisdom,
  title={The Wisdom of Hindsight Makes Language Models Better Instruction Followers},
  author={Zhang, Tianjun and Liu, Fangchen and Wong, Justin and Abbeel, Pieter and Gonzalez, Joseph E},
  journal={arXiv preprint arXiv:2302.05206},
  year={2023}
}

Installation

Install BigBench

# When creating a new task, replace this with your forked repository (see below)
git clone https://github.com/google/BIG-bench.git
cd BIG-bench
python setup.py sdist
pip install -e .

Modify BIG_BENCH_DIR in utils.py to be the installation path of BigBench.

# Install other dependencies
conda env create -f conda_env.yml
conda activate hir

Train FLAN-T5 on BigBench Tasks

Modify MODEL_TYPE in utils.py to be the desired model (e.g. google/flan-t5-xl).

Change TASK to be the desired BigBench Task (e.g. logical_deduction_five_objects). Then get the results through iterative sampling and training:

bash run.sh

Please note: if running multiple experiments on one node, please assign different port numbers to different runs by changing random_port in run.sh

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.gitignore		.gitignore
CoT.txt		CoT.txt
README.md		README.md
conda_env.yml		conda_env.yml
evaluation.py		evaluation.py
offline_trainer.py		offline_trainer.py
online_sampler.py		online_sampler.py
run.sh		run.sh
utils.py		utils.py
verifier.py		verifier.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.gitignore

.gitignore

CoT.txt

CoT.txt

README.md

README.md

conda_env.yml

conda_env.yml

evaluation.py

evaluation.py

offline_trainer.py

offline_trainer.py

online_sampler.py

online_sampler.py

run.sh

run.sh

utils.py

utils.py

verifier.py

verifier.py

Repository files navigation

The Wisdom of Hindsight Makes Language Models Better Instruction Followers

Intro

Citation

Installation

Train FLAN-T5 on BigBench Tasks

About

Releases

Packages

Contributors 2

Languages

tianjunz/HIR

Folders and files

Latest commit

History

Repository files navigation

The Wisdom of Hindsight Makes Language Models Better Instruction Followers

Intro

Citation

Installation

Train FLAN-T5 on BigBench Tasks

About

Resources

Stars

Watchers

Forks

Languages