Weightless neural networks for efficient edge inference

Zachary Susskind; Aman Arora; Igor D. S. Miranda; Luis A. Q. Villon; Rafael F. Katopodis; Leandro S. de Araújo; Diego Leonel Cadette Dutra; Priscila Lima; Felipe França; Maurício Breternitz; Lizy K. John

Ciência_Iscte Publicações Descrição Detalhada da Publicação

Publicação em atas de evento científico

Weightless neural networks for efficient edge inference

Zachary Susskind (Susskind, Z.); Aman Arora (Arora, A.); Igor D. S. Miranda (Miranda, I. D. S.); Luis A. Q. Villon (Villon, L. A. Q.); Rafael F. Katopodis (Katopodis, R. F.); Leandro S. de Araújo (Araújo, L. S. de.); Diego Leonel Cadette Dutra (Dutra, D. L. C.); Priscila Lima (Lima, P. M. V.); Felipe França (França, F. M. G.); Maurício Breternitz (Breternitz Jr., M.); Lizy K. John (John, L. K.); et al.

PACT '22: Proceedings of the International Conference on Parallel Architectures and Compilation Techniques

Ano (publicação definitiva)

2022

Língua

Inglês

País

Estados Unidos da América

Mais Informação

Visitar Link

Web of Science®

N.º de citações: 15

(Última verificação: 2026-02-21 15:47)

Ver o registo na Web of Science®

Scopus

N.º de citações: 20

(Última verificação: 2026-02-21 19:07)

Ver o registo na Scopus

Google Scholar

N.º de citações: 47

(Última verificação: 2026-02-22 09:31)

Ver o registo no Google Scholar

Overton

Esta publicação não está indexada no Overton

Abstract/Resumo

Weightless neural networks (WNNs) are a class of machine learning model which use table lookups to perform inference, rather than the multiply-accumulate operations typical of deep neural networks (DNNs). Individual weightless neurons are capable of learning non-linear functions of their inputs, a theoretical advantage over the linear neurons in DNNs, yet state-of-the-art WNN architectures still lag behind DNNs in accuracy on common classification tasks. Additionally, many existing WNN architectures suffer from high memory requirements, hindering implementation. In this paper, we propose a novel WNN architecture, BTHOWeN, with key algorithmic and architectural improvements over prior work, namely counting Bloom filters, hardware-friendly hashing, and Gaussian-based nonlinear thermometer encodings. These enhancements improve model accuracy while reducing size and energy per inference. BTHOWeN targets the large and growing edge computing sector by providing superior latency and energy efficiency to both prior WNNs and comparable quantized DNNs. Compared to state-of-the-art WNNs across nine classification datasets, BTHOWeN on average reduces error by more than 40% and model size by more than 50%. We demonstrate the viability of a hardware implementation of BTHOWeN by presenting an FPGA-based inference accelerator, and compare its latency and resource usage against similarly accurate quantized DNN inference accelerators, including multi-layer perceptron (MLP) and convolutional models. The proposed BTHOWeN models consume almost 80% less energy than the MLP models, with nearly 85% reduction in latency. In our quest for efficient ML on the edge, WNNs are clearly deserving of additional attention.

Agradecimentos/Acknowledgements

Palavras-chave

Weightless Neural Networks,WNN,WiSARD,Neural networks,Hardware acceleration,Inference,Edge computing

Registos de financiamentos

Referência de financiamento	Entidade Financiadora
3015.001/3016.001	Semiconductor Research Corporation (SRC)
1763848	National Science Foundation
UIDB/04466/2020	Fundação para a Ciência e a Tecnologia
UIDP/04466/2020	Fundação para a Ciência e a Tecnologia
310676/2019-3	CNPq - Conselho Nacional de Desenvolvimento Científico e Tecnológico
POCI-01-0247-FEDER-045912	Comissão Europeia

Projetos Relacionados

Esta publicação é um output do(s) seguinte(s) projeto(s):

Identificadores da Publicação

WoS (fonte: Ciência_Iscte)	WOS:001071492700021
Scopus (fonte: autor)	2-s2.0-85147333502
DOI (fonte: autor)	10.1145/3559009.3569680
Scopus (fonte: Ciência_Iscte)	2-s2.0-85147333502
Handle (fonte: Ciência-IUL)	http://hdl.handle.net/10071/32038
WoS (fonte: autor)	WOS:001071492700021
ID Ciência_Iscte	ci-pub-92348

Outros Detalhes da Publicação

Ano Publicação Online	2023
Editora	Association for Computing Machinery
Indexação	Web of Science©; Scopus;
ISSN	--
ISBN	97978-1-4503-9868-8 (online)
Volume
Número Artigo
Páginas	279 - 290	Total Páginas	12
Avaliado Cientificamente	Sim
Editores	Andreas Kloeckner, José Moreira
Título do Evento	International Conference on Parallel Architectures and Compilation Techniques
Organizador do Evento
Cidade	Chicago, Illinois
Tipo de Evento	Conferência
Classificação do Evento	Internacional
Ano do Evento	2022
Tipo de Publicação no Evento	Artigo Completo
Repositório ISCTE-IUL	Link para o repositório
Data Publicação (online)
Data Publicação (print)

Altmetric

Dimensions

PlumX Metrics