Multimodal agents for cooperative interaction

Strout, Joseph J., author; Beveridge, Ross, advisor; Ortega, Francisco, committee member; Daunhauer, Lisa, committee member

Multimodal agents for cooperative interaction

Files

Strout_colostate_0053N_16283.pdf (911.64 KB)

Date

2020

Authors

Strout, Joseph J., author

Beveridge, Ross, advisor

Ortega, Francisco, committee member

Daunhauer, Lisa, committee member

Abstract

Embodied virtual agents offer the potential to interact with a computer in a more natural manner, similar to how we interact with other people. To reach this potential requires multimodal interaction, including both speech and gesture. This project builds on earlier work at Colorado State University and Brandeis University on just such a multimodal system, referred to as Diana. I designed and developed a new software architecture to directly address some of the difficulties of the earlier system, particularly with regard to asynchronous communication, e.g., interrupting the agent after it has begun to act. Various other enhancements were made to the agent systems, including the model itself, as well as speech recognition, speech synthesis, motor control, and gaze control. Further refactoring and new code were developed to achieve software engineering goals that are not outwardly visible, but no less important: decoupling, testability, improved networking, and independence from a particular agent model. This work, combined with the effort of others in the lab, has produced a "version 2'' Diana system that is well positioned to serve the lab's research needs in the future. In addition, in order to pursue new research opportunities related to developmental and intervention science, a "Faelyn Fox'' agent was developed. This is a different model, with a simplified cognitive architecture, and a system for defining an experimental protocol (for example, a toy-sorting task) based on Unity's visual state machine editor. This version too lays a solid foundation for future research.

Subject

artificial intelligence

gesture

speech

communication

agents

multimodal

URI

https://hdl.handle.net/10217/219514

Collections

2020-
Theses and Dissertations

Full item page

Multimodal agents for cooperative interaction

Files

Date

Authors

Journal Title

Journal ISSN

Volume Title

Abstract

Description

Rights Access

Subject

Citation

URI

Associated Publications

Collections