GitHub - amazon-science/dq-bart: DQ-BART: Efficient Sequence-to-Sequence Model via Joint Distillation and Quantization (ACL 2022)

DQ-BART: Efficient Sequence-to-Sequence Model via Joint Distillation and Quantization

This repository contains the authors' implementation of the ACL 2022 paper "DQ-BART: Efficient Sequence-to-Sequence Model via Joint Distillation and Quantization."

Requirements

Install PyTorch from the official website.
Install dependencies via pip install -r requirements.txt.
The teacher model should be available locally, e.g., downloading manually from the huggingface model hub.

Sample Command

The following command will train an 8-8-8 3-1 model on CNN/DailyMail dataset. You may use accelerate for distributed training.

python3 run_summarization_no_trainer.py \
  --model_name_or_path ainize/bart-base-cnn \
  --dataset_name cnn_dailymail \
  --dataset_config_name 3.0.0 \
  --pred_distill \
  --intermediate_distill \
  --num_train_epochs 20 \
  --weight_bits 8 \
  --do_train \
  --do_test \
  --distill_encoder 3 \
  --distill_decoder 1 \
  --learning_rate 3e-5

Citation

You may cite our work using

@inproceedings{li2022dqbart,
  title={DQ-BART: Efficient Sequence-to-Sequence Model via Joint Distillation and Quantization},
  author={Li, Zheng and Wang, Zijian and Tan, Ming and Nallapati, Ramesh and Bhatia, Parminder and Arnold, Andrew and Xiang, Bing and Roth, Dan},
  booktitle={Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)},
  pages={203--211},
  year={2022}
}

Security

See CONTRIBUTING for more information.

License

This project is licensed under the Apache-2.0 License.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
quant		quant
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
NOTICE		NOTICE
README.md		README.md
requirements.txt		requirements.txt
run_summarization_no_trainer.py		run_summarization_no_trainer.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

quant

quant

CODE_OF_CONDUCT.md

CODE_OF_CONDUCT.md

CONTRIBUTING.md

CONTRIBUTING.md

LICENSE

LICENSE

NOTICE

NOTICE

README.md

README.md

requirements.txt

requirements.txt

run_summarization_no_trainer.py

run_summarization_no_trainer.py

Repository files navigation

DQ-BART: Efficient Sequence-to-Sequence Model via Joint Distillation and Quantization

Requirements

Sample Command

Citation

Security

License

About

Releases

Packages

Contributors 4

Languages

License

amazon-science/dq-bart

Folders and files

Latest commit

History

Repository files navigation

DQ-BART: Efficient Sequence-to-Sequence Model via Joint Distillation and Quantization

Requirements

Sample Command

Citation

Security

License

About

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Languages