Abstract
In this paper, we propose a solution for fast proto-typing of Deep learning neural network models on edge computing devices like FPGA for researchers with limited knowledge of high level languages like VHDL. We use Xilinx' Brevitas tool for Quantization and FINN framework for deployment/inference on Pynq-Z2 board. The paper will also share presently available methods for FPGA prototyping and how tools like Brevitas and FINN can be used for more efficient inference of DNN on small scale edge computers like FPGA by levaraging their 1. Quantization Aware Training(QAT) and Post Training Quanti-zation(PTQ) 2. Streamlining networks and transformations 3. Dataflow partitioning of the NN model using FINN compiler 4. DMA, FIFO and IP generation for HW build and 5. Inference on FPGA using PYNQ python Driver. The weights and activations of a custom model were quantised from floating points to 8, 4 and 2 bit for which an accuracy drop of 0.1 %, 0.8% and 7.6% was observed respectively.
Originalsprog | Engelsk |
---|---|
Titel | 2024 Fifteenth International Conference on Ubiquitous and Future Networks (ICUFN) |
Forlag | IEEE Press |
Publikationsdato | 2024 |
Sider | 238-240 |
ISBN (Elektronisk) | 9798350385298 |
DOI | |
Status | Udgivet - 2024 |
Begivenhed | 15th International Conference on Ubiquitous and Future Networks, ICUFN 2024 - Hybrid, Hungary, Ungarn Varighed: 2. jul. 2024 → 5. jul. 2024 |
Konference
Konference | 15th International Conference on Ubiquitous and Future Networks, ICUFN 2024 |
---|---|
Land/Område | Ungarn |
By | Hybrid, Hungary |
Periode | 02/07/2024 → 05/07/2024 |
Sponsor | Korean Institute of Communications and Information Sciences (KICS) |
Navn | International Conference on Ubiquitous and Future Networks, ICUFN |
---|---|
ISSN | 2165-8528 |
Bibliografisk note
Publisher Copyright:© 2024 IEEE.