Automatic question detection from prosodic speech analysis

Hirsch, Rachel, author; Draper, Bruce, advisor; Whitley, Darrell, advisor; Kirby, Michael, committee member

Automatic question detection from prosodic speech analysis

Files

Hirsch_colostate_0053N_15618.pdf (927.19 KB)

Date

2019

Authors

Hirsch, Rachel, author

Draper, Bruce, advisor

Whitley, Darrell, advisor

Kirby, Michael, committee member

Abstract

Human-agent spoken communication has become ubiquitous over the last decade, with assistants such as Siri and Alexa being used more every day. An AI agent needs to understand exactly what the user says to it and respond accurately. To correctly respond, the agent has to know whether it is being given a command or asked a question. In Standard American English (SAE), both word choice and intonation of the speaker are necessary to discern the true sentiment of an utterance. Much Natural Language Processing (NLP) research has been done into automatically determining these sentence types using word choice alone. However, intonation is ultimately the key to understanding the sentiment of a spoken sentence. This thesis uses a series of attributes to characterize vocal prosody of utterances to train classifiers to detect questions. The dataset used to train these classifiers is a series of hearings by the Supreme Court of the United States (SCOTUS). Prosody-trained classifier results are compared against a text-based classifier, using Google Speech-to-Text transcriptions of the same dataset.

Subject

lexicon

natural language processing

sentiment detection

machine learning

human-computer interaction

prosody

URI

https://hdl.handle.net/10217/197389
https://doi.org/10.25675/3.023427

Collections

2000-2019
Theses and Dissertations

Full item page

Automatic question detection from prosodic speech analysis

Files

Date

Authors

Journal Title

Journal ISSN

Volume Title

Abstract

Description

Rights Access

Subject

Citation

URI

Associated Publications

Collections