Butterfly network: a convolutional neural network with a new architecture for multi-scale semantic segmentation of pedestrians

M. A. Alavianmehr*, M. S. Helfroush, H. Danyali, A. Tashk

*Corresponding author for this work

Research output: Contribution to journalJournal articleResearchpeer-review

Abstract

The detection of multi-scale pedestrians is one of the challenging tasks in pedestrian detection applications. Moreover, the task of small-scale pedestrian detection, i.e., accurate localization of pedestrians as low-scale target objects, can help solve the issue of occluded pedestrian detection as well. In this paper, we present a fully convolutional neural network with a new architecture and an innovative, fully detailed supervision for semantic segmentation of pedestrians. The proposed network has been named butterfly network (BF-Net) because of its architecture analogous to a butterfly. The proposed BF-Net preserves the ability of simplicity so that it can process static images with a real-time image processing rate. The sub-path blocks embedded in the architecture of the proposed BF-Net provides a higher accuracy for detecting multi-scale objective targets including the small ones. The other advantage of the proposed architecture is replacing common batch normalization with conditional one. In conclusion, the experimental results of the proposed method demonstrate that the proposed network outperform the other state-of-the-art networks such as U-Net + + , U-Net3 + , Mask-RCNN, and Deeplabv3 + for the semantic segmentation of the pedestrians.

Original languageEnglish
Article number9
JournalJournal of Real-Time Image Processing
Volume20
Issue number1
Number of pages17
ISSN1861-8200
DOIs
Publication statusPublished - Feb 2023

Keywords

  • Butterfly network (BF-Net)
  • Convolutional neural network
  • Pedestrian detection
  • Semantic segmentation
  • State-of-the-art U-Nets

Fingerprint

Dive into the research topics of 'Butterfly network: a convolutional neural network with a new architecture for multi-scale semantic segmentation of pedestrians'. Together they form a unique fingerprint.

Cite this