It Takes Two to Tango: Towards Theory of AI's Mind

Authors

Arjun Chandrashekharan, Deshraj Yadav, Prithvijit Chattopadhyay, Viraj Prabhu, Devi Parikh

Abstract

Theory of Mind is the ability to attribute mental states (beliefs, intents, knowledge, perspectives, etc.) to others and recognize that these mental states may differ from one's own. Theory of Mind is critical to effective communication and to teams demonstrating higher collective performance. To effectively leverage the progress in Artificial Intelligence (AI) to make our lives more productive, it is important for humans and AI to work well together in a team. Traditionally, there has been much emphasis on research to make AI more accurate, and (to a lesser extent) on having it better understand human intentions, tendencies, beliefs, and contexts. The latter involves making AI more human-like and having it develop a theory of our minds.

In this work, we argue that for human-AI teams to be effective, humans must also develop a theory of AI's mind -- get to know its strengths, weaknesses, beliefs, and quirks.

We instantiate these ideas within the domain of Visual Question Answering (VQA). We find that using just a few examples (50), lay people can be trained to better predict responses and oncoming failures of a complex VQA model. Surprisingly, we find that having access to the model's internal states -- its confidence in its top-k predictions, explicit or implicit attention maps which highlight regions in the image (and words in the question) the model is looking at (and listening to) while answering a question about an image -- do not help people better predict its behavior.

Arxiv Link

https://arxiv.org/abs/1704.00717

Citing this work

If you find this work useful in your research, please consider citing:

@article{toaim,
  author = {Arjun Chandrasekaran and Deshraj Yadav and Prithvijit Chattopadhyay and Viraj Prabhu and Devi Parikh},
  title = {{I}t {T}akes {T}wo to {T}ango: {T}owards {T}heory of {AI}'s {M}ind},
  journal = {arXiv preprint arXiv:1704.00717},
  year = {2017}
}

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
Assets		Assets
LICENSE		LICENSE
README.md		README.md
index.html		index.html

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Assets

Assets

LICENSE

LICENSE

README.md

README.md

index.html

index.html

Repository files navigation

It Takes Two to Tango: Towards Theory of AI's Mind

Authors

Abstract

Arxiv Link

Citing this work

License

About

Releases

Packages

Languages

License

deshraj/TOAIM

Folders and files

Latest commit

History

Repository files navigation

It Takes Two to Tango: Towards Theory of AI's Mind

Authors

Abstract

Arxiv Link

Citing this work

License

About

Topics

Resources

License

Stars

Watchers

Forks

Languages