LLM: Realizing Low-Latency Memory by Exploiting Embedded Silicon Photonics for Irregular Workloads

Fariborz, M.; Samani, M.; Fotouhi, P.; Proietti, R.; I. -M., Yi; Akella, V.; Lowe-Power, J.; Palermo, S.; Yoo, S. J. B.

doi:10.1007/978-3-031-07312-0_3

As emerging workloads exhibit irregular memory access patterns with poor data reuse and locality, they would benefit from a DRAM that achieves low latency without sacrificing bandwidth and energy efficiency. We propose LLM (Low Latency Memory), a codesign of the DRAM microarchitecture, the memory controller and the LLC/DRAM interconnect by leveraging embedded silicon photonics in 2.5D/3D integrated system on chip. LLM relies on Wavelength Division Multiplexing (WDM)-based photonic interconnects to reduce the contention throughout the memory subsystem. LLM also increases the bank-level parallelism, eliminates bus conflicts by using dedicated optical data paths, and reduces the access energy per bit with shorter global bitlines and smaller row buffers. We evaluate the design space of LLM for a variety of synthetic benchmarks and representative graph workloads on a full-system simulator (gem5). LLM exhibits low memory access latency for traffics with both regular and irregular access patterns. For irregular traffic, LLM achieves high bandwidth utilization (over 80% peak throughput compared to 20% of HBM2.0). For real workloads, LLM achieves 3 × and 1.8 × lower execution time compared to HBM2.0 and a state-of-the-art memory system with high memory level parallelism, respectively. This study also demonstrates that by reducing queuing on the data path, LLM can achieve on average 3.4 × lower memory latency variation compared to HBM2.0.

LLM: Realizing Low-Latency Memory by Exploiting Embedded Silicon Photonics for Irregular Workloads / Fariborz, M.; Samani, M.; Fotouhi, P.; Proietti, R.; Yi, I. -M.; Akella, V.; Lowe-Power, J.; Palermo, S.; Yoo, S. J. B.. - STAMPA. - 13289:(2022), pp. 44-64. ( 37th International Conference on High Performance Computing, ISC High Performance 2022 Hamburg, Germany May 29 – June 2, 2022) [10.1007/978-3-031-07312-0_3].

LLM: Realizing Low-Latency Memory by Exploiting Embedded Silicon Photonics for Irregular Workloads

Fariborz M.;Samani M.;Fotouhi P.;Proietti R.;Yi I. -M.;Akella V.;Lowe-Power J.;Palermo S.;Yoo S. J. B.

2022

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno del prodotto
	
				2022
			
	Titolo della Serie/Collana
	
				LECTURE NOTES IN ARTIFICIAL INTELLIGENCE
			
	Codice ISBN
	
				978-3-031-07311-3
978-3-031-07312-0
			
	Appare nelle tipologie
	
				4.1 Contributo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
ISC_2022_LLM_Paper.pdf accesso aperto Tipologia: 2. Post-print / Author's Accepted Manuscript Licenza: Pubblico - Tutti i diritti riservati Dimensione 1.09 MB Formato Adobe PDF Visualizza/Apri	1.09 MB	Adobe PDF	Visualizza/Apri
LLM_paper.pdf accesso riservato Tipologia: 2a Post-print versione editoriale / Version of Record Licenza: Non Pubblico - Accesso privato/ristretto Dimensione 2.04 MB Formato Adobe PDF Visualizza/Apri Richiedi una copia	2.04 MB	Adobe PDF	Visualizza/Apri Richiedi una copia

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11583/2972984

PORTO @ Archivio Istituzionale della Ricerca

LLM: Realizing Low-Latency Memory by Exploiting Embedded Silicon Photonics for Irregular Workloads

Fariborz M.;Samani M.;Fotouhi P.;Proietti R.;Yi I. -M.;Akella V.;Lowe-Power J.;Palermo S.;Yoo S. J. B.

2022

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Pubblicazioni consigliate

Informazioni

Conferma cancellazione

Scheda breve

Scheda completa

Scheda completa (DC)