Fast prototyping of Quantized neural networks on an FPGA edge computing device with Brevitas and FINN

Devansh Chawda, Benaoumeur Senouci*

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

Abstract

In this paper, we propose a solution for fast proto-typing of Deep learning neural network models on edge computing devices like FPGA for researchers with limited knowledge of high level languages like VHDL. We use Xilinx' Brevitas tool for Quantization and FINN framework for deployment/inference on Pynq-Z2 board. The paper will also share presently available methods for FPGA prototyping and how tools like Brevitas and FINN can be used for more efficient inference of DNN on small scale edge computers like FPGA by levaraging their 1. Quantization Aware Training(QAT) and Post Training Quanti-zation(PTQ) 2. Streamlining networks and transformations 3. Dataflow partitioning of the NN model using FINN compiler 4. DMA, FIFO and IP generation for HW build and 5. Inference on FPGA using PYNQ python Driver. The weights and activations of a custom model were quantised from floating points to 8, 4 and 2 bit for which an accuracy drop of 0.1 %, 0.8% and 7.6% was observed respectively.

Original languageEnglish
Title of host publication2024 Fifteenth International Conference on Ubiquitous and Future Networks (ICUFN)
PublisherIEEE Press
Publication date2024
Pages238-240
ISBN (Electronic)9798350385298
DOIs
Publication statusPublished - 2024
Event15th International Conference on Ubiquitous and Future Networks, ICUFN 2024 - Hybrid, Hungary, Hungary
Duration: 2. Jul 20245. Jul 2024

Conference

Conference15th International Conference on Ubiquitous and Future Networks, ICUFN 2024
Country/TerritoryHungary
CityHybrid, Hungary
Period02/07/202405/07/2024
SponsorKorean Institute of Communications and Information Sciences (KICS)
SeriesInternational Conference on Ubiquitous and Future Networks, ICUFN
ISSN2165-8528

Keywords

  • Brevitas
  • CNN
  • DNN
  • FINN
  • FPGA
  • Neural Network
  • Pynq
  • Quantization

Fingerprint

Dive into the research topics of 'Fast prototyping of Quantized neural networks on an FPGA edge computing device with Brevitas and FINN'. Together they form a unique fingerprint.

Cite this