Adapting RGB pose estimation to new domains

Mulay, Gururaj, author; Draper, Bruce, advisor; Beveridge, J. Ross, advisor; Maciejewsky, Anthony, committee member

Adapting RGB pose estimation to new domains

dc.contributor.author	Mulay, Gururaj, author
dc.contributor.author	Draper, Bruce, advisor
dc.contributor.author	Beveridge, J. Ross, advisor
dc.contributor.author	Maciejewsky, Anthony, committee member
dc.date.accessioned	2019-06-14T17:06:55Z
dc.date.available	2019-06-14T17:06:55Z
dc.date.issued	2019
dc.description	Zip file contains supplementary videos.
dc.description.abstract	Many multi-modal human computer interaction (HCI) systems interact with users in real-time by estimating the user's pose. Generally, they estimate human poses using depth sensors such as the Microsoft Kinect.For multi-modal HCI interfaces to gain traction in the real world, however, it would be better for pose estimation to be based on data from RGB cameras, which are more common and less expensive than depth sensors. This has motivated research into pose estimation from RGB images. Convolutional Neural Networks (CNNs) represent the state-of-the-art in this literature, for example [1–5], and [6]. These systems estimate 2D human poses from RGB images. A problem with current CNN-based pose estimators is that they require large amounts of labeled data for training. If the goal is to train an RGB pose estimator for a new domain, the cost of collecting and more importantly labeling data can be prohibitive. A common solution is to train on publicly available pose data sets, but then the trained system is not tailored to the domain. We propose using RGB+D sensors to collect domain-specific data in the lab, and then training the RGB pose estimator using skeletons automatically extracted from the RGB+D data. This paper presents a case study of adapting the RMPE pose estimation network [4] to the domain of the DARPA Communicating with Computers (CWC) program [7], as represented by the EGGNOG data set [8]. We chose RMPE because it predicts both joint locations and Part Affinity Fields (PAFs) in real-time. Our adaptation of RMPE trained on automatically-labeled data outperforms the original RMPE on the EGGNOG data set.
dc.format.medium	born digital
dc.format.medium	masters theses
dc.format.medium	ZIP
dc.format.medium	MPEG
dc.identifier	Mulay_colostate_0053N_15457.pdf
dc.identifier.uri	https://hdl.handle.net/10217/195409
dc.language	English
dc.language.iso	eng
dc.publisher	Colorado State University. Libraries
dc.relation.ispartof	2000-2019
dc.rights	Copyright and other restrictions may apply. User is responsible for compliance with all applicable laws. For information about copyright law, please see https://libguides.colostate.edu/copyright.
dc.subject	CWC
dc.subject	human pose estimation
dc.subject	RMPE
dc.subject	HCI
dc.subject	convolutional neural networks
dc.subject	Microsoft Kinect
dc.title	Adapting RGB pose estimation to new domains
dc.type	Text
dcterms.rights.dpla	This Item is protected by copyright and/or related rights (https://rightsstatements.org/vocab/InC/1.0/). You are free to use this Item in any way that is permitted by the copyright and related rights legislation that applies to your use. For other uses you need to obtain permission from the rights-holder(s).
thesis.degree.discipline	Computer Science
thesis.degree.grantor	Colorado State University
thesis.degree.level	Masters
thesis.degree.name	Master of Science (M.S.)

Files

Original bundle

Now showing 1 - 2 of 2

Name:: Mulay_colostate_0053N_15457.pdf
Size:: 1.65 MB
Format:: Adobe Portable Document Format

Download

Name:: supplemental.zip
Size:: 10.98 MB
Format:: Zip File
Description:

Download

Collections

2000-2019
Theses and Dissertations