Skip to content

deshraj/TOAIM

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 
 
 

Repository files navigation

It Takes Two to Tango: Towards Theory of AI's Mind

Authors

Arjun Chandrashekharan, Deshraj Yadav, Prithvijit Chattopadhyay, Viraj Prabhu, Devi Parikh

Abstract

Theory of Mind is the ability to attribute mental states (beliefs, intents, knowledge, perspectives, etc.) to others and recognize that these mental states may differ from one's own. Theory of Mind is critical to effective communication and to teams demonstrating higher collective performance. To effectively leverage the progress in Artificial Intelligence (AI) to make our lives more productive, it is important for humans and AI to work well together in a team. Traditionally, there has been much emphasis on research to make AI more accurate, and (to a lesser extent) on having it better understand human intentions, tendencies, beliefs, and contexts. The latter involves making AI more human-like and having it develop a theory of our minds.

In this work, we argue that for human-AI teams to be effective, humans must also develop a theory of AI's mind -- get to know its strengths, weaknesses, beliefs, and quirks.

We instantiate these ideas within the domain of Visual Question Answering (VQA). We find that using just a few examples (50), lay people can be trained to better predict responses and oncoming failures of a complex VQA model. Surprisingly, we find that having access to the model's internal states -- its confidence in its top-k predictions, explicit or implicit attention maps which highlight regions in the image (and words in the question) the model is looking at (and listening to) while answering a question about an image -- do not help people better predict its behavior.

Arxiv Link

https://arxiv.org/abs/1704.00717

Citing this work

If you find this work useful in your research, please consider citing:

@article{toaim,
  author = {Arjun Chandrasekaran and Deshraj Yadav and Prithvijit Chattopadhyay and Viraj Prabhu and Devi Parikh},
  title = {{I}t {T}akes {T}wo to {T}ango: {T}owards {T}heory of {AI}'s {M}ind},
  journal = {arXiv preprint arXiv:1704.00717},
  year = {2017}
}

License

MIT