Embedding based clustering of time series data using dynamic time warping
dc.contributor.author | Mendis, R. A. C. Laksheen, author | |
dc.contributor.author | Pallickara, Sangmi Lee, advisor | |
dc.contributor.author | Pallickara, Shrideep, committee member | |
dc.contributor.author | Hayne, Stephen, committee member | |
dc.date.accessioned | 2022-05-30T10:21:13Z | |
dc.date.available | 2022-05-30T10:21:13Z | |
dc.date.issued | 2022 | |
dc.description.abstract | Voluminous time-series observational data impose challenges pertaining to storage and analytics. Identifying patterns in such climate time-series data is critical for many geospatial applications. Over the recent years, clustering has become a key computational technique for identifying patterns/clusters. However, data with complex structures and high dimensions could lead to uninformative clusters and hinder the quality of clustering. In this research, we use the state-of-the-art autoencoders with LSTMs, Bidirectional LSTMs and GRUs to learn highly non-linear mapping functions by training the networks with subsequences of timeseries to perform data reconstruction. Next, we extract the trained encoders to generate embeddings which are lightweight. These embeddings are more space efficient than the original time series data and require less computational power and resources for further processing. In the final step of clustering, instead of using common distance-based metrics like Euclidean distance, we use DTW, an algorithm for computing similarity between time series by ignoring variations in speed, to calculate similarity between the embeddings during the application of k- Means algorithm. Based on Silhouette score, this method generates clusters which are better than other reduction techniques. | |
dc.format.medium | born digital | |
dc.format.medium | masters theses | |
dc.identifier | Mendis_colostate_0053N_17047.pdf | |
dc.identifier.uri | https://hdl.handle.net/10217/235176 | |
dc.language | English | |
dc.language.iso | eng | |
dc.publisher | Colorado State University. Libraries | |
dc.relation.ispartof | 2020- | |
dc.rights | Copyright and other restrictions may apply. User is responsible for compliance with all applicable laws. For information about copyright law, please see https://libguides.colostate.edu/copyright. | |
dc.title | Embedding based clustering of time series data using dynamic time warping | |
dc.type | Text | |
dcterms.rights.dpla | This Item is protected by copyright and/or related rights (https://rightsstatements.org/vocab/InC/1.0/). You are free to use this Item in any way that is permitted by the copyright and related rights legislation that applies to your use. For other uses you need to obtain permission from the rights-holder(s). | |
thesis.degree.discipline | Computer Science | |
thesis.degree.grantor | Colorado State University | |
thesis.degree.level | Masters | |
thesis.degree.name | Master of Science (M.S.) |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- Mendis_colostate_0053N_17047.pdf
- Size:
- 1.59 MB
- Format:
- Adobe Portable Document Format