A light-speed large language model accelerator with optical stochastic computing
dc.contributor.author | Afifi, Salma, author | |
dc.contributor.author | Alo, Oluwaseun, author | |
dc.contributor.author | Thakkar, Ishan, author | |
dc.contributor.author | Pasricha, Sudeep, author | |
dc.contributor.author | ACM, publisher | |
dc.date.accessioned | 2025-09-25T18:41:05Z | |
dc.date.available | 2025-09-25T18:41:05Z | |
dc.date.issued | 2025-06-29 | |
dc.description.abstract | To address the increasingly intensive computational demands of attention-based large language models (LLMs), there is a growing interest in developing energy-efficient and high-speed hardware accelerators. To that end, photonics is being considered as an alternative technology to digital electronics. This work introduces a novel optical hardware accelerator that leverages stochastic computing principles for LLMs. Our proposed accelerator incorporates full-range optical stochastic multipliers and stochastic-analog compute-capable optical-to-electrical transducer units to efficiently handle static and dynamic tensor computations in attention-based models. Our analysis shows that our accelerator exhibits at least 7.6× speedup and 1.3× lower energy compared to state-of-the-art LLMs hardware accelerators. | |
dc.format.medium | born digital | |
dc.format.medium | articles | |
dc.identifier.bibliographicCitation | Salma Afifi, Oluwaseun Alo, Ishan Thakkar, and Sudeep Pasricha. 2025. A Light-Speed Large Language Model Accelerator with Optical Stochastic Computing. In Great Lakes Symposium on VLSI 2025 (GLSVLSI '25), June 30-July 02, 2025, New Orleans, LA, USA. ACM, New York, NY, USA, 7 pages. https://doi.org/10.1145/3716368.3735299 | |
dc.identifier.doi | https://doi.org/10.1145/3716368.3735299 | |
dc.identifier.uri | https://hdl.handle.net/10217/242039 | |
dc.language | English | |
dc.language.iso | eng | |
dc.publisher | Colorado State University. Libraries | |
dc.relation.ispartof | Publications | |
dc.relation.ispartof | ACM DL Digital Library | |
dc.rights | ©Salma Afifi, et al. ACM 2025. This is the author's version of the work. It is posted here for your personal use. Not for redistribution. The definitive Version of Record was published in GLSVLSI '25, https://dx.doi.org/10.1145/3716368.3735299. | |
dc.subject | transformer neural networks | |
dc.subject | silicon photonics | |
dc.subject | inference acceleration | |
dc.subject | stochastic computing | |
dc.subject | optical computing | |
dc.title | A light-speed large language model accelerator with optical stochastic computing | |
dc.type | Text |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- FACF_ACMOA_3716368.3735299.pdf
- Size:
- 6.56 MB
- Format:
- Adobe Portable Document Format