Soon filter: Advancing tiny neural architectures for high throughput edge inference

Alan T. L. Bacellar; Zachary Susskind; Maurício Breternitz; Lizy K. John; Felipe M. G. França; Priscila M. V. Lima

Ciência_Iscte Publicações Descrição Detalhada da Publicação

Publicação em atas de evento científico Q3

Soon filter: Advancing tiny neural architectures for high throughput edge inference

Alan T. L. Bacellar (Bacellar, A.); Zachary Susskind (Susskind, Z.); Maurício Breternitz (Breternitz Jr., M.); Lizy K. John (John, L.); Felipe M. G. França (França, F.); Priscila M. V. Lima (Lima, P.);

Proceedings of the International Joint Conference on Neural Networks

Ano (publicação definitiva)

2024

Língua

Inglês

País

Estados Unidos da América

Mais Informação

Visitar Link

Web of Science®

N.º de citações: 0

(Última verificação: 2026-06-27 17:38)

Ver o registo na Web of Science®

Scopus

N.º de citações: 0

(Última verificação: 2026-06-27 20:46)

Ver o registo na Scopus

Google Scholar

N.º de citações: 1

(Última verificação: 2026-06-24 20:32)

Ver o registo no Google Scholar

Overton

Esta publicação não está indexada no Overton

Abstract/Resumo

As Deep Neural Networks become more complex and computationally demanding, efficient models for inference at the edge, particularly multiplication-free ones, have gained significant attention. The Ultra Low-Energy Edge Neural Network (ULEEN) is a notable architecture optimized for high throughput edge designs. ULEEN uniquely employs Bloom Filters with binary values to compute neuron activation, boasting better efficiency metrics than Binary Neural Networks (BNNs). This work uncovers a gradient back-propagation bottleneck within ULEEN’s Bloom filters and introduces a simplified version of it as a solution: the "Soon Filter". Both theoretically and empirically, we demonstrate that our approach improves gradient back-propagation efficiency. Tests on MLPerf Tiny, MNIST and various UCI datasets reveal that our method surpasses ULEEN, BNN, and DeepShift. Notably, with MLPerf KWS (Key Word Spotting) dataset, we achieve 69.6% accuracy with only 101KiB, while ULEEN, BNN and DeepShift achieve only 67.4%, 55.9%, and 24.9% respectively. Remarkably, we also achieve 67.7% accuracy with only 50KiB, resulting in a 2x model size reduction compared to ULEEN while maintaining similar accuracy (+0.3%). This results underscores the promising potential of our solution for efficient inference at the edge in applications that rely on high throughput architectures.

Agradecimentos/Acknowledgements

Palavras-chave

Accuracy,Upper bound,Random access memory,Computer architecture,Throughput,Hardware,Filters

Classificação Fields of Science and Technology

Ciências da Computação e da Informação - Ciências Naturais

Registos de financiamentos

Referência de financiamento	Entidade Financiadora
UIDB 50008/2020	Fundação para a Ciência e a Tecnologia

Identificadores da Publicação

WoS (fonte: Ciência_Iscte)	WOS:001315691506054
Scopus (fonte: autor)	2-s2.0-85205024091
DOI (fonte: autor)	10.1109/IJCNN60899.2024.10650678
Scopus (fonte: Ciência_Iscte)	2-s2.0-85205024091
WoS (fonte: autor)	WOS:001315691506054
ID Ciência_Iscte	ci-pub-105541

Outros Detalhes da Publicação

Ano Publicação Online	2024
Editora	IEEE
Indexação	Web of Science©; Scopus;
ISSN	2161-4393 (print) 2161-4407 (online)
ISBN	979-8-3503-5932-9 (print) 979-8-3503-5931-2 (online)
Volume
Número Artigo
Páginas	1 - 8	Total Páginas	8
Avaliado Cientificamente	Sim
Editores
Título do Evento
Organizador do Evento
Cidade	Yokohama, Japan
Tipo de Evento	Conferência
Classificação do Evento	Internacional
Ano do Evento	2024
Tipo de Publicação no Evento	Artigo Completo
Data Publicação (online)	2024-09-09
Data Publicação (print)

Altmetric

Dimensions

PlumX Metrics