Soon filter: Advancing tiny neural architectures for high throughput edge inference

Alan T. L. Bacellar; Zachary Susskind; Maurício Breternitz; Lizy K. John; Felipe M. G. França; Priscila M. V. Lima

Ciência_Iscte Publications Publication Detailed Description

Publication in conference proceedings Q3

Soon filter: Advancing tiny neural architectures for high throughput edge inference

Alan T. L. Bacellar (Bacellar, A.); Zachary Susskind (Susskind, Z.); Maurício Breternitz (Breternitz Jr., M.); Lizy K. John (John, L.); Felipe M. G. França (França, F.); Priscila M. V. Lima (Lima, P.);

Proceedings of the International Joint Conference on Neural Networks

Year (definitive publication)

2024

Language

English

Country

United States of America

More Information

Visit Link

Web of Science®

Times Cited: 0

(Last checked: 2026-06-27 17:38)

View record in Web of Science®

Scopus

Times Cited: 0

(Last checked: 2026-06-27 20:46)

View record in Scopus

Google Scholar

Times Cited: 1

(Last checked: 2026-06-24 20:32)

View record in Google Scholar

Overton

This publication is not indexed in Overton

Abstract

As Deep Neural Networks become more complex and computationally demanding, efficient models for inference at the edge, particularly multiplication-free ones, have gained significant attention. The Ultra Low-Energy Edge Neural Network (ULEEN) is a notable architecture optimized for high throughput edge designs. ULEEN uniquely employs Bloom Filters with binary values to compute neuron activation, boasting better efficiency metrics than Binary Neural Networks (BNNs). This work uncovers a gradient back-propagation bottleneck within ULEEN’s Bloom filters and introduces a simplified version of it as a solution: the "Soon Filter". Both theoretically and empirically, we demonstrate that our approach improves gradient back-propagation efficiency. Tests on MLPerf Tiny, MNIST and various UCI datasets reveal that our method surpasses ULEEN, BNN, and DeepShift. Notably, with MLPerf KWS (Key Word Spotting) dataset, we achieve 69.6% accuracy with only 101KiB, while ULEEN, BNN and DeepShift achieve only 67.4%, 55.9%, and 24.9% respectively. Remarkably, we also achieve 67.7% accuracy with only 50KiB, resulting in a 2x model size reduction compared to ULEEN while maintaining similar accuracy (+0.3%). This results underscores the promising potential of our solution for efficient inference at the edge in applications that rely on high throughput architectures.

Acknowledgements

Keywords

Accuracy,Upper bound,Random access memory,Computer architecture,Throughput,Hardware,Filters

Fields of Science and Technology Classification

Computer and Information Sciences - Natural Sciences

Funding Records

Funding Reference	Funding Entity
UIDB 50008/2020	Fundação para a Ciência e a Tecnologia

Publication Identifiers

Scopus (source: Ciência_Iscte)	2-s2.0-85205024091
DOI (source: author)	10.1109/IJCNN60899.2024.10650678
WoS (source: Ciência_Iscte)	WOS:001315691506054
WoS (source: author)	WOS:001315691506054
Scopus (source: author)	2-s2.0-85205024091
Ciência_Iscte ID	ci-pub-105541

Other Publication Details

Online Publication Year	2024
Publisher	IEEE
Indexes	Web of Science©; Scopus;
ISSN	2161-4393 (print) 2161-4407 (online)
ISBN	979-8-3503-5932-9 (print) 979-8-3503-5931-2 (online)
Volume
Article Number
Pages	1 - 8	Total Pages	8
Peer Reviewed	Yes
Editors
Event Title
Event Organizer
City	Yokohama, Japan
Event Type	Conference
Event Classification	International
Event Year	2024
Event Publication Type	Full Paper
Publication Date (online)	2024-09-09
Publication Date (print)

Altmetric

Dimensions

PlumX Metrics