The many-core design paradigm requires flexible and modular hardware and software components to provide the required scalability of next-generation on-chip multiprocessor architectures. A multidisciplinary approach is necessary to consider all the interactions between the different components of the design. In this work a complete design methodology is proposed, tackling at once the aspects of hardware architecture, programming model and design automation. The proposed design flow has been used in the implementation of a multiprocessor Network-on-Chip based system, the NoCRay graphic accelerator. The system uses 8 Tensilica LX processors and has been physically implemented on a Xilinx Virtex-4 LX-160 FPGA reporting a 17.3M equivalent gate-count. Performance are compared with a commercial general purpose processor and show good results considering the low frequency of the prototype.
A methodology and a case-study for Network-on-Chipbased MP-SoC architectures / Tota, Sergio Vincenzo; Casu, MARIO ROBERTO; Motto, Paolo; RUO ROCH, Massimo; Zamboni, Maurizio. - ELETTRONICO. - (2007). (Intervento presentato al convegno Second International Conference on Nano-Networks (Nano-Net 2007) tenutosi a Catania nel 24-26 Sept. 2007).
A methodology and a case-study for Network-on-Chipbased MP-SoC architectures
TOTA, Sergio Vincenzo;CASU, MARIO ROBERTO;MOTTO, PAOLO;RUO ROCH, Massimo;ZAMBONI, Maurizio
2007
Abstract
The many-core design paradigm requires flexible and modular hardware and software components to provide the required scalability of next-generation on-chip multiprocessor architectures. A multidisciplinary approach is necessary to consider all the interactions between the different components of the design. In this work a complete design methodology is proposed, tackling at once the aspects of hardware architecture, programming model and design automation. The proposed design flow has been used in the implementation of a multiprocessor Network-on-Chip based system, the NoCRay graphic accelerator. The system uses 8 Tensilica LX processors and has been physically implemented on a Xilinx Virtex-4 LX-160 FPGA reporting a 17.3M equivalent gate-count. Performance are compared with a commercial general purpose processor and show good results considering the low frequency of the prototype.Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.
https://hdl.handle.net/11583/1671636
Attenzione
Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo