Exportar Publicação

A publicação pode ser exportada nos seguintes formatos: referência da APA (American Psychological Association), referência do IEEE (Institute of Electrical and Electronics Engineers), BibTeX e RIS.

Exportar Referência (APA)
Sezavar, A., Brites, C., Ascenso, J. & Ebrahimi, T. (2025). A learning-based lossless event data compression for computer vision applications. In Proc. SPIE 13605, Applications of Digital Image Processing XLVIII. (pp. 33-39). San Diego, United States: SPIE.
Exportar Referência (IEEE)
A. Sezavar et al.,  "A learning-based lossless event data compression for computer vision applications", in Proc. SPIE 13605, Applications of Digital Image Processing XLVIII, San Diego, United States, SPIE, 2025, pp. 33-39
Exportar BibTeX
@inproceedings{sezavar2025_1777814805921,
	author = "Sezavar, A. and Brites, C. and Ascenso, J. and Ebrahimi, T.",
	title = "A learning-based lossless event data compression for computer vision applications",
	booktitle = "Proc. SPIE 13605, Applications of Digital Image Processing XLVIII",
	year = "2025",
	editor = "",
	volume = "",
	number = "",
	series = "",
	doi = "10.1117/12.3068095",
	pages = "33-39",
	publisher = "SPIE",
	address = "San Diego, United States",
	organization = ""
}
Exportar RIS
TY  - CPAPER
TI  - A learning-based lossless event data compression for computer vision applications
T2  - Proc. SPIE 13605, Applications of Digital Image Processing XLVIII
AU  - Sezavar, A.
AU  - Brites, C.
AU  - Ascenso, J.
AU  - Ebrahimi, T.
PY  - 2025
SP  - 33-39
DO  - 10.1117/12.3068095
CY  - San Diego, United States
AB  - Event-based computer vision is becoming very popular. With progress in sensing events, the volume of data produced has increased manyfold, and there is a need for compression. This paper introduces a novel deep-learning-based lossless event data compression codec. The idea is to represent the events as a point cloud with spatial dimensions x and y and temporal dimension t as its coordinates. Then, an adaptive octree structure is created to better compact the latter without introducing any loss by coding the occupancy map. The binary representation of the octree structure, which corresponds to a denser representation of the event data, is then entropy-coded with a learning-based model. The latter is based on using a deep neural network to obtain the probability model of a hyperprior-based arithmetic coder. The proposed hyperprior network architecture includes two neural networks following an auto-encoder structure, which allows the capture of the source statistics effectively.
ER  -