Is Now a Good Time? Learning when agent can talk to drivers using INAGT dataset and multisensor fusion

Overview

This project examines sensor fusion techniques for modeling opportunities for proactive speech-based in-car interfaces. We leverage the Is Now a Good Time (INAGT) dataset, which consists of automotive, physiological, and visual data collected from drivers who self-annotated responses to the question “Is now a good time?,” indicating the opportunity to receive non-driving information during a 50-minute drive. We augment this original driver-annotated data with third-party annotations of perceived safety, in order to explore potential driver overconfidence. We show that fusing automotive, physiological, and visual data allows us to predict driver labels of availability, achieving an 0.874 F1-score by extracting statistically relevant features and training with our proposed deep neural network, PazNet. Using the same data and network, we achieve an 0.891 F1-score for predicting third-party labeled safe moments. We train these models to avoid false positives—determinations that it is a good time to interrupt when it is not—since false positives may cause driver distraction or service deactivation by the driver. Our analyses show that conservative models still leave many moments for interaction and show that most inopportune moments are short. This work lays a foundation for using sensor fusion models to predict when proactive speech systems should engage with drivers.

Publication

Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies (IMWUT 2021) & The ACM international joint conference on Pervasive and Ubiquitous Computing (UbiComp 2021)

Learning when agents can talk to drivers using the inagt dataset and multisensor fusion

Dataset (https://github.com/FAR-Lab/INAGT-data)

1915 samples across 46 drivers during a 28.5 km route
Yes / No annotations from drivers if it is a good or bad time to interact
3rd part rating of Safe / Unsafe moments (from MTurk raters)
Road video, driver face video, driver side video, and driver over shoulder video
Automotive data (CAN) from 2015 Toyota Prius
Physiological data from driver
Pre-comuted road scene understanding (via I3D Inception V1)
Pre-computed Font body pose (via OpenPose)
Pre-computed facial landmarks (via OpenPose)
Pre-computed side body pose (via OpenPose)

Sample data from participants 40 and 76

Requesting Data

The dataset is available for research use. Interesed parties will need to have IRB approval to use the data. Please fill out our dataset request form to get in contact with us about downloading the full dataset.

Related Research

Rob Semmens, Nikolas Martelaro, Pushyami Kaveti, Simon Stent, and Wendy Ju. 2019. Is Now A Good Time? An Empirical Study of Vehicle-Driver Communication Timing. In Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems (CHI '19). Association for Computing Machinery, New York, NY, USA, Paper 637, 1–12. DOI:https://doi.org/10.1145/3290605.3300867

Tong Wu, Nikolas Martelaro, Simon Stent, Jorge Ortiz, and Wendy Ju. 2021. Learning When Agents Can Talk to Drivers Using the INAGT Dataset and Multisensor Fusion. Proc. ACM Interact. Mob. Wearable Ubiquitous Technol. 5, 3, Article 133 (Sept 2021), 28 pages. DOI:https://doi.org/10.1145/3478125

Recommended Citation

@article{wu_martelaro_2021, author = {Wu, Tong and Martelaro, Nikolas and Stent, Simon and Ortiz, Jorge and Ju, Wendy}, title = {Learning When Agents Can Talk to Drivers Using the INAGT Dataset and Multisensor Fusion}, year = {2021}, issue_date = {Sept 2021}, publisher = {Association for Computing Machinery}, address = {New York, NY, USA}, volume = {5}, number = {3}, url = {https://doi.org/10.1145/3478125}, doi = {10.1145/3478125}, journal = {Proc. ACM Interact. Mob. Wearable Ubiquitous Technol.}, month = sep, articleno = {133}, numpages = {28}, keywords = {multi-modal learning, dataset, interaction timing, deep convolutional network, vehicle} }

Name		Name	Last commit message	Last commit date
Latest commit History 158 Commits
I3D-Feature-Extractor		I3D-Feature-Extractor
emotion		emotion
forecasting		forecasting
gluon		gluon
vision		vision
README.md		README.md
video_clip.py		video_clip.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

I3D-Feature-Extractor

I3D-Feature-Extractor

emotion

emotion

forecasting

forecasting

gluon

gluon

vision

vision

README.md

README.md

video_clip.py

video_clip.py

Repository files navigation

Is Now a Good Time? Learning when agent can talk to drivers using INAGT dataset and multisensor fusion

Overview

Publication

Dataset (https://github.com/FAR-Lab/INAGT-data)

Requesting Data

Related Research

Recommended Citation

About

Releases

Packages

Languages

twmoveon/INAGT-code

Folders and files

Latest commit

History

Repository files navigation

Is Now a Good Time? Learning when agent can talk to drivers using INAGT dataset and multisensor fusion

Overview

Publication

Dataset (https://github.com/FAR-Lab/INAGT-data)

Requesting Data

Related Research

Recommended Citation

About

Topics

Resources

Stars

Watchers

Forks

Languages