Fast prototyping of Quantized neural networks on an FPGA edge computing device with Brevitas and FINN

Devansh Chawda, Benaoumeur Senouci*

*Kontaktforfatter

Publikation: Kapitel i bog/rapport/konference-proceedingKonferencebidrag i proceedingsForskningpeer review

Abstract

In this paper, we propose a solution for fast proto-typing of Deep learning neural network models on edge computing devices like FPGA for researchers with limited knowledge of high level languages like VHDL. We use Xilinx' Brevitas tool for Quantization and FINN framework for deployment/inference on Pynq-Z2 board. The paper will also share presently available methods for FPGA prototyping and how tools like Brevitas and FINN can be used for more efficient inference of DNN on small scale edge computers like FPGA by levaraging their 1. Quantization Aware Training(QAT) and Post Training Quanti-zation(PTQ) 2. Streamlining networks and transformations 3. Dataflow partitioning of the NN model using FINN compiler 4. DMA, FIFO and IP generation for HW build and 5. Inference on FPGA using PYNQ python Driver. The weights and activations of a custom model were quantised from floating points to 8, 4 and 2 bit for which an accuracy drop of 0.1 %, 0.8% and 7.6% was observed respectively.

OriginalsprogEngelsk
Titel2024 Fifteenth International Conference on Ubiquitous and Future Networks (ICUFN)
ForlagIEEE Press
Publikationsdato2024
Sider238-240
ISBN (Elektronisk)9798350385298
DOI
StatusUdgivet - 2024
Begivenhed15th International Conference on Ubiquitous and Future Networks, ICUFN 2024 - Hybrid, Hungary, Ungarn
Varighed: 2. jul. 20245. jul. 2024

Konference

Konference15th International Conference on Ubiquitous and Future Networks, ICUFN 2024
Land/OmrådeUngarn
ByHybrid, Hungary
Periode02/07/202405/07/2024
SponsorKorean Institute of Communications and Information Sciences (KICS)
NavnInternational Conference on Ubiquitous and Future Networks, ICUFN
ISSN2165-8528

Bibliografisk note

Publisher Copyright:
© 2024 IEEE.

Fingeraftryk

Dyk ned i forskningsemnerne om 'Fast prototyping of Quantized neural networks on an FPGA edge computing device with Brevitas and FINN'. Sammen danner de et unikt fingeraftryk.

Citationsformater