Exportar Publicação

A publicação pode ser exportada nos seguintes formatos: referência da APA (American Psychological Association), referência do IEEE (Institute of Electrical and Electronics Engineers), BibTeX e RIS.

Exportar Referência (APA)
Bacellar, A., Susskind, Z., Breternitz Jr., M., John, L., França, F. & Lima, P. (2024). Soon filter: Advancing tiny neural architectures for high throughput edge inference. In Proceedings of the International Joint Conference on Neural Networks. (pp. 1-8). Yokohama, Japan: IEEE.
Exportar Referência (IEEE)
A. T. Bacellar et al.,  "Soon filter: Advancing tiny neural architectures for high throughput edge inference", in Proc. of the Int. Joint Conf. on Neural Networks, Yokohama, Japan, IEEE, 2024, pp. 1-8
Exportar BibTeX
@inproceedings{bacellar2024_1782623309805,
	author = "Bacellar, A. and Susskind, Z. and Breternitz Jr., M. and John, L. and França, F. and Lima, P.",
	title = "Soon filter: Advancing tiny neural architectures for high throughput edge inference",
	booktitle = "Proceedings of the International Joint Conference on Neural Networks",
	year = "2024",
	editor = "",
	volume = "",
	number = "",
	series = "",
	doi = "10.1109/IJCNN60899.2024.10650678",
	pages = "1-8",
	publisher = "IEEE",
	address = "Yokohama, Japan",
	organization = "",
	url = "https://ieeexplore.ieee.org/document/10650678"
}
Exportar RIS
TY  - CPAPER
TI  - Soon filter: Advancing tiny neural architectures for high throughput edge inference
T2  - Proceedings of the International Joint Conference on Neural Networks
AU  - Bacellar, A.
AU  - Susskind, Z.
AU  - Breternitz Jr., M.
AU  - John, L.
AU  - França, F.
AU  - Lima, P.
PY  - 2024
SP  - 1-8
SN  - 2161-4393
DO  - 10.1109/IJCNN60899.2024.10650678
CY  - Yokohama, Japan
UR  - https://ieeexplore.ieee.org/document/10650678
AB  - As Deep Neural Networks become more complex and computationally demanding, efficient models for inference at the edge, particularly multiplication-free ones, have gained significant attention. The Ultra Low-Energy Edge Neural Network (ULEEN) is a notable architecture optimized for high throughput edge designs. ULEEN uniquely employs Bloom Filters with binary values to compute neuron activation, boasting better efficiency metrics than Binary Neural Networks (BNNs). This work uncovers a gradient back-propagation bottleneck within ULEEN’s Bloom filters and introduces a simplified version of it as a solution: the "Soon Filter". Both theoretically and empirically, we demonstrate that our approach improves gradient back-propagation efficiency. Tests on MLPerf Tiny, MNIST and various UCI datasets reveal that our method surpasses ULEEN, BNN, and DeepShift. Notably, with MLPerf KWS (Key Word Spotting) dataset, we achieve 69.6% accuracy with only 101KiB, while ULEEN, BNN and DeepShift achieve only 67.4%, 55.9%, and 24.9% respectively. Remarkably, we also achieve 67.7% accuracy with only 50KiB, resulting in a 2x model size reduction compared to ULEEN while maintaining similar accuracy (+0.3%). This results underscores the promising potential of our solution for efficient inference at the edge in applications that rely on high throughput architectures.
ER  -