Repository logo
 

A light-speed large language model accelerator with optical stochastic computing

dc.contributor.authorAfifi, Salma, author
dc.contributor.authorAlo, Oluwaseun, author
dc.contributor.authorThakkar, Ishan, author
dc.contributor.authorPasricha, Sudeep, author
dc.contributor.authorACM, publisher
dc.date.accessioned2025-09-25T18:41:05Z
dc.date.available2025-09-25T18:41:05Z
dc.date.issued2025-06-29
dc.description.abstractTo address the increasingly intensive computational demands of attention-based large language models (LLMs), there is a growing interest in developing energy-efficient and high-speed hardware accelerators. To that end, photonics is being considered as an alternative technology to digital electronics. This work introduces a novel optical hardware accelerator that leverages stochastic computing principles for LLMs. Our proposed accelerator incorporates full-range optical stochastic multipliers and stochastic-analog compute-capable optical-to-electrical transducer units to efficiently handle static and dynamic tensor computations in attention-based models. Our analysis shows that our accelerator exhibits at least 7.6× speedup and 1.3× lower energy compared to state-of-the-art LLMs hardware accelerators.
dc.format.mediumborn digital
dc.format.mediumarticles
dc.identifier.bibliographicCitationSalma Afifi, Oluwaseun Alo, Ishan Thakkar, and Sudeep Pasricha. 2025. A Light-Speed Large Language Model Accelerator with Optical Stochastic Computing. In Great Lakes Symposium on VLSI 2025 (GLSVLSI '25), June 30-July 02, 2025, New Orleans, LA, USA. ACM, New York, NY, USA, 7 pages. https://doi.org/10.1145/3716368.3735299
dc.identifier.doihttps://doi.org/10.1145/3716368.3735299
dc.identifier.urihttps://hdl.handle.net/10217/242039
dc.languageEnglish
dc.language.isoeng
dc.publisherColorado State University. Libraries
dc.relation.ispartofPublications
dc.relation.ispartofACM DL Digital Library
dc.rights©Salma Afifi, et al. ACM 2025. This is the author's version of the work. It is posted here for your personal use. Not for redistribution. The definitive Version of Record was published in GLSVLSI '25, https://dx.doi.org/10.1145/3716368.3735299.
dc.subjecttransformer neural networks
dc.subjectsilicon photonics
dc.subjectinference acceleration
dc.subjectstochastic computing
dc.subjectoptical computing
dc.titleA light-speed large language model accelerator with optical stochastic computing
dc.typeText

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
FACF_ACMOA_3716368.3735299.pdf
Size:
6.56 MB
Format:
Adobe Portable Document Format

Collections