Sparse binary transformers for multivariate time series modeling

Gorbett, Matt, author; Shirazi, Hossein, author; Ray, Indrakshi, author; ACM, publisher

doi:https://doi.org/10.1145/3580305.3599508

Sparse binary transformers for multivariate time series modeling

Files

FACF_ACMOA_3580305.3599508.pdf (1.24 MB)

Date

2023-08-04

Authors

Gorbett, Matt, author

Shirazi, Hossein, author

Ray, Indrakshi, author

ACM, publisher

Abstract

Compressed Neural Networks have the potential to enable deep learning across new applications and smaller computational environments. However, understanding the range of learning tasks in which such models can succeed is not well studied. In this work, we apply sparse and binary-weighted Transformers to multivariate time series problems, showing that the lightweight models achieve accuracy comparable to that of dense floating-point Transformers of the same structure. Our model achieves favorable results across three time series learning tasks: classification, anomaly detection, and single-step forecasting. Additionally, to reduce the computational complexity of the attention mechanism, we apply two modifications, which show little to no decline in model performance: 1) in the classification task, we apply a fixed mask to the query, key, and value activations, and 2) for forecasting and anomaly detection, which rely on predicting outputs at a single point in time, we propose an attention mask to allow computation only at the current time step. Together, each compression technique and attention modification substantially reduces the number of non-zero operations necessary in the Transformer. We measure the computational savings of our approach over a range of metrics including parameter count, bit size, and floating point operation (FLOPs) count, showing up to a 53x reduction in storage size and up to 10.5x reduction in FLOPs.

Subject

transformer

sparse

pruned

binary

deep learning

multivariate time series

anomaly detection

classification

forecasting

lottery ticket hypothesis

URI

https://hdl.handle.net/10217/239532

Collections

Publications

Full item page

Sparse binary transformers for multivariate time series modeling

Files

Date

Authors

Journal Title

Journal ISSN

Volume Title

Abstract

Description

Rights Access

Subject

Citation

URI

Collections

Endorsement

Review

Supplemented By

Referenced By