Skip to content

phellonchen/awesome-visual-dialog

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

17 Commits
 
 
 
 

Repository files navigation

awesome-Visual Dialog

Recent Advances in Visual Dialog Maintained by Feilong Chen. Last update on 2022/08/19.

Table of Contents

Image-based Visual Dialog

Visual Dialog

  1. Visual Dialog, CVPR 2017, [code]

  2. Best of Both Worlds: Transferring Knowledge from Discriminative Learning to a Generative Visual Dialog Model, NIPS 2017, [code]

  3. Are You Talking to Me? Reasoned Visual Dialog Generation through Adversarial Learning, CVPR 2018

  4. Image-Question-Answer Synergistic Network for Visual Dialog, CVPR 2019

  5. Reasoning Visual Dialogs with Structural and Partial Observations, CVPR, 2019, [code]

  6. Recursive Visual Attention in Visual Dialog, CVPR 2019, [code]

  7. Dual Visual Attention Network for Visual Dialog, IJCAI 2019

  8. Making History Matter: History-Advantage Sequence Training for Visual Dialog, ICCV 2019

  9. Granular Multimodal Attention Networks for Visual Dialog, ICCV Workshop 2019

  10. Multi-step Reasoning via Recurrent Dual Attention for Visual Dialog, ACL 2019

  11. Dual Attention Networks for Visual Reference Resolution in Visual Dialog, EMNLP 20219, []code

  12. DMRM: A Dual-channel Multi-hop Reasoning Model for Visual Dialog, AAAI 2020, [code]

  13. Modality-Balanced Models for Visual Dialogue, AAAI 2020

  14. DualVD: An Adaptive Dual Encoding Model for Deep Visual Understanding in Visual Dialogue, AAAI 2020, [code]

  15. Two Causal Principles for Improving Visual Dialog, CVPR 2020, [code]

  16. DAM: Deliberation, Abandon and Memory Networks for Generating Detailed and Non-repetitive Responses in Visual Dialogue, IJCAI 2020, [code]

  17. KBGN: Knowledge-Bridge Graph Network for Adaptive Vision-Text Reasoning in Visual Dialogue, ACM MM 2020

  18. Large-scale Pretraining for Visual Dialog: A Simple State-of-the-Art Baseline, ECCV 2020, [code]

  19. Visual Dialog: Light-weight Transformer for Many Inputs, ECCV 2020, [code]

  20. Multi-View Attention Network for Visual Dialog, ACL 2020, [code]

  21. History for Visual Dialog: Do we really need it?, ACL 2020, [code]

  22. VD-BERT: A Unified Vision and Dialog Transformer with BERT, EMNLP 2020, [code]

  23. GoG: Graph-over-Graph Network for Visual Dialog, ACL Findings 2021

  24. Multimodal Incremental Transformer for Visual Dialogue Generation, ACL Findings 2021

  25. Learning to Ground Visual Objects for Visual Dialog, EMNLP Findings 2021

  26. VU-BERT: A Unified framework for Visual Dialog, ICASSP 2022

  27. Improving Cross-Modal Understanding in Visual Dialog via Contrastive Learning, ICASSP 2022

  28. UTC: A Unified Transformer with Inter-Task Contrastive Learning for Visual Dialog, CVPR 2022

  29. Unsupervised and Pseudo-Supervised Vision-Language Alignment in Visual Dialog, ACM MM 2022

GuessWhat

GuessWhat?! Visual object discovery through multi-modal dialogue, CVPR 2017, [code]

GuessWhich

Learning Cooperative Visual Dialog Agents with Deep Reinforcement Learning, ICCV 2017, [code]

Video-based Visual Dialog

Bridging Text and Video: A Universal Multimodal Transformer for Video-Audio Scene-Aware Dialog, AAAI 2020, [code]

Other Resources