Skip to Main content Skip to Navigation
Journal articles

Generating Efficient FPGA-based CNN Accelerators from High-Level Descriptions

Nermine Ali 1 Jean-Marc Philippe 1 Benoit Tain 2 Philippe Coussy 3 
1 LIAE - Laboratoire Intelligence Artificielle Embarquée
Université Paris-Saclay, DSCIN - Département Systèmes et Circuits Intégrés Numériques : DRT/LIST/DSCIN
2 LECA - Laboratoire Environnement de Conception & Architecture
Université Paris-Saclay, DSCIN - Département Systèmes et Circuits Intégrés Numériques : DRT/LIST/DSCIN
Abstract : The wide landscape of memory-hungry and compute-intensive Convolutional Neural Networks (CNNs) is quickly changing. CNNs are continuously evolving by introducing new layers or optimization strategies to either improve accuracy, reduce memory and computational needs or both. Moving such algorithms to on-device enables smarter edge products. However, hardware designers find this constant evolution hard to master, which keeps CNN accelerators one step behind. More approaches are using reconfigurable hardware, such as FPGAs, to design customized inference accelerators that are more suited to the newly-emerging CNN algorithms. Moreover, high-level design techniques, such as High-Level Synthesis (HLS), are adopted to address the time-consuming RTL-based design and the design space exploration problems. HLS allows generating RTL source code from high-level descriptions. This paper presents a hardware accelerator generation framework targeting FPGAs that relies on two steps. The first step characterizes the input CNN and produces hardware-aware metrics. The second step exploits the generated metrics to produce an optimized C-HLS source code for each layer of the input CNN, then it uses an HLS tool to generate a synthesizable RTL representation of the inference accelerator. The main goal of this approach is to reduce the gap between the evolving CNNs and the hardware accelerators, thus reducing design time of new systems.
Document type :
Journal articles
Complete list of metadata

https://hal-cea.archives-ouvertes.fr/cea-03760532
Contributor : Contributeur MAP CEA Connect in order to contact the contributor
Submitted on : Thursday, August 25, 2022 - 12:23:47 PM
Last modification on : Sunday, August 28, 2022 - 3:21:26 AM

File

 Restricted access
To satisfy the distribution rights of the publisher, the document is embargoed until : 2024-08-24

Please log in to resquest access to the document

Identifiers

Citation

Nermine Ali, Jean-Marc Philippe, Benoit Tain, Philippe Coussy. Generating Efficient FPGA-based CNN Accelerators from High-Level Descriptions. Journal of Signal Processing Systems, Springer, 2022, ⟨10.1007/s11265-022-01797-w⟩. ⟨cea-03760532⟩

Share

Metrics

Record views

31