Repository logo
 

A questionnaire integration system based on question classification and short text semantic textual similarity

dc.contributor.authorQiu, Yu, author
dc.contributor.authorPallickara, Sangmi Lee, advisor
dc.contributor.authorPallickara, Shrideep, committee member
dc.contributor.authorLi, Kaigang, committee member
dc.date.accessioned2019-01-07T17:19:09Z
dc.date.available2019-01-07T17:19:09Z
dc.date.issued2018
dc.description.abstractSemantic integration from heterogeneous sources involves a series of NLP tasks. Existing re- search has focused mainly on measuring two paired sentences. However, to find possible identical texts between two datasets, the sentences are not paired. To avoid pair-wise comparison, this thesis proposed a semantic similarity measuring system equipped with a precategorization module. It applies a hybrid question classification module, which subdivides all texts to coarse categories. The sentences are then paired from these subcategories. The core task is to detect identical texts between two sentences, which relates to the semantic textual similarity task in the NLP field. We built a short text semantic textual similarity measuring module. It combined conventional NLP techniques, including both semantic and syntactic features, with a Recurrent Convolutional Neural Network to accomplish an ensemble model. We also conducted a set of empirical evaluations. The results show that our system possesses a degree of generalization ability, and it performs well on heterogeneous sources.
dc.format.mediumborn digital
dc.format.mediummasters theses
dc.identifierQiu_colostate_0053N_15125.pdf
dc.identifier.urihttps://hdl.handle.net/10217/193103
dc.languageEnglish
dc.language.isoeng
dc.publisherColorado State University. Libraries
dc.relation.ispartof2000-2019
dc.rightsCopyright and other restrictions may apply. User is responsible for compliance with all applicable laws. For information about copyright law, please see https://libguides.colostate.edu/copyright.
dc.titleA questionnaire integration system based on question classification and short text semantic textual similarity
dc.typeText
dcterms.rights.dplaThis Item is protected by copyright and/or related rights (https://rightsstatements.org/vocab/InC/1.0/). You are free to use this Item in any way that is permitted by the copyright and related rights legislation that applies to your use. For other uses you need to obtain permission from the rights-holder(s).
thesis.degree.disciplineComputer Science
thesis.degree.grantorColorado State University
thesis.degree.levelMasters
thesis.degree.nameMaster of Science (M.S.)

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Qiu_colostate_0053N_15125.pdf
Size:
7.75 MB
Format:
Adobe Portable Document Format