Sparse binary transformers for multivariate time series modeling

Gorbett, Matt, author; Shirazi, Hossein, author; Ray, Indrakshi, author; ACM, publisher

Sparse binary transformers for multivariate time series modeling

dc.contributor.author	Gorbett, Matt, author
dc.contributor.author	Shirazi, Hossein, author
dc.contributor.author	Ray, Indrakshi, author
dc.contributor.author	ACM, publisher
dc.date.accessioned	2024-11-11T19:34:33Z
dc.date.available	2024-11-11T19:34:33Z
dc.date.issued	2023-08-04
dc.description.abstract	Compressed Neural Networks have the potential to enable deep learning across new applications and smaller computational environments. However, understanding the range of learning tasks in which such models can succeed is not well studied. In this work, we apply sparse and binary-weighted Transformers to multivariate time series problems, showing that the lightweight models achieve accuracy comparable to that of dense floating-point Transformers of the same structure. Our model achieves favorable results across three time series learning tasks: classification, anomaly detection, and single-step forecasting. Additionally, to reduce the computational complexity of the attention mechanism, we apply two modifications, which show little to no decline in model performance: 1) in the classification task, we apply a fixed mask to the query, key, and value activations, and 2) for forecasting and anomaly detection, which rely on predicting outputs at a single point in time, we propose an attention mask to allow computation only at the current time step. Together, each compression technique and attention modification substantially reduces the number of non-zero operations necessary in the Transformer. We measure the computational savings of our approach over a range of metrics including parameter count, bit size, and floating point operation (FLOPs) count, showing up to a 53x reduction in storage size and up to 10.5x reduction in FLOPs.
dc.format.medium	born digital
dc.format.medium	articles
dc.identifier.bibliographicCitation	Matt Gorbett, Hossein Shirazi, and Indrakshi Ray. 2023. Sparse Binary Transformers for Multivariate Time Series Modeling. In Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD '23), August 6–10, 2023, Long Beach, CA, USA. ACM, New York, NY, USA, 13 pages. https://doi.org/10.1145/3580305.3599508
dc.identifier.doi	https://doi.org/10.1145/3580305.3599508
dc.identifier.uri	https://hdl.handle.net/10217/239532
dc.language	English
dc.language.iso	eng
dc.publisher	Colorado State University. Libraries
dc.relation.ispartof	Publications
dc.relation.ispartof	ACM DL Digital Library
dc.rights	©Matt Gorbett, et al. ACM 2023. This is the author's version of the work. It is posted here for your personal use. Not for redistribution. The definitive Version of Record was published in KDD '23, https://dx.doi.org/10.1145/3580305.3599508.
dc.subject	transformer
dc.subject	sparse
dc.subject	pruned
dc.subject	binary
dc.subject	deep learning
dc.subject	multivariate time series
dc.subject	anomaly detection
dc.subject	classification
dc.subject	forecasting
dc.subject	lottery ticket hypothesis
dc.title	Sparse binary transformers for multivariate time series modeling
dc.type	Text

Files

Original bundle

Now showing 1 - 1 of 1

Name:: FACF_ACMOA_3580305.3599508.pdf
Size:: 1.24 MB
Format:: Adobe Portable Document Format

Download

Collections

Publications