Deep spatio-temporal and frequency guided fusion network for event-to-video reconstruction

Ramna Maqsood; Paulo Nunes; Caroline Conti; Luís Ducla Soares

Ciência_Iscte Publicações Descrição Detalhada da Publicação

Artigo em revista científica Q2

Deep spatio-temporal and frequency guided fusion network for event-to-video reconstruction

Ramna Maqsood (Maqsood, R.); Paulo Nunes (Nunes, P.); Caroline Conti (Conti, C.); Luís Ducla Soares (Soares, L. D.);

Título Revista

IEEE Open Journal of Signal Processing

Ano (publicação definitiva)

2026

Língua

Inglês

País

Estados Unidos da América

Mais Informação

Visitar Link

Web of Science®

N.º de citações: 0

(Última verificação: 2026-07-20 20:45)

Ver o registo na Web of Science®

Scopus

N.º de citações: 0

(Última verificação: 2026-07-20 17:28)

Ver o registo na Scopus

Google Scholar

N.º de citações: 0

(Última verificação: 2026-07-22 07:40)

Ver o registo no Google Scholar

Overton

Esta publicação não está indexada no Overton

Abstract/Resumo

Event-to-video (E2V) reconstruction has gained significant attention recently for its advantages in enabling high dynamic range and fast motion capture capabilities. However, event data encodes only relative brightness changes, lacking the absolute intensity information necessary for accurate reconstruction. Recent methods incorporate previously reconstructed images to provide intensity references but process them in the spatial domain where low- and high-frequency components are highly coupled. This spatial processing typically leads to the degradation of fine details and introduces artifacts such as over-smoothing, blurring and low contrast reconstruction. To address this, we propose a deep spatio-temporal and frequency guided fusion network for E2V reconstruction (DSTFN-E2V), featuring a dual-path architecture with two key components: i) a prior frequency decomposition module (PFDM), and ii) a spatio-temporal event-driven feature extraction module (STEM). The PFDM decouples low- and high-frequency information from previously reconstructed images and current event voxel grid via a 2D discrete wavelet transform, processing the low-frequency subband through residual blocks to preserve structural coherence and intensity references, while an edge-detail refinement module (ERM) enhances edge and texture details from high-frequency subbands. The frequency-specific features from PFDM and the spatio-temporal features from STEM are then integrated through the proposed event-image fusion blocks (EIFBs) that apply cross-attention across three encoder stages, enabling simultaneous structural preservation and detail recovery. Experiments on four real-world datasets demonstrate that DSTFN-E2V achieves state-of-the-art results with 12% SSIM improvements while being 50% faster than recent attention-based methods, with superior edge fidelity and reduced artifacts.

Agradecimentos/Acknowledgements

This work was supported in part by the National funds through FCT – Fundação para a Ciência e a Tecnologia, I.P., and in part by EU funds through Project/support UID/50008/2025 –Instituto de Telecomunicações, with DOI identifier 10.54499/UID/50008/2025.

Palavras-chave

Event-to-video (E2V) reconstruction,Discrete wavelet transforms,Cross-attention networks

Classificação Fields of Science and Technology

Ciências da Computação e da Informação - Ciências Naturais
Engenharia Eletrotécnica, Eletrónica e Informática - Engenharia e Tecnologia

Registos de financiamentos

Referência de financiamento	Entidade Financiadora
UID/50008/2025	Fundação para a Ciência e a Tecnologia

Contribuições para os Objetivos do Desenvolvimento Sustentável das Nações Unidas

Com o objetivo de aumentar a investigação direcionada para o cumprimento dos Objetivos do Desenvolvimento Sustentável para 2030 das Nações Unidas, é disponibilizada no Ciência_Iscte a possibilidade de associação, quando aplicável, dos artigos científicos aos Objetivos do Desenvolvimento Sustentável. Estes são os Objetivos do Desenvolvimento Sustentável identificados pelo(s) autor(es) para esta publicação. Para uma informação detalhada dos Objetivos do Desenvolvimento Sustentável, clique aqui.

Identificadores da Publicação

WoS (fonte: Ciência_Iscte)	WOS:001779860200001
Scopus (fonte: autor)	2-s2.0-105039328137
DOI (fonte: autor)	10.1109/OJSP.2026.3693230
Scopus (fonte: Ciência_Iscte)	2-s2.0-105039328137
Handle (fonte: Ciência-IUL)	http://hdl.handle.net/10071/37536
WoS (fonte: autor)	WOS:001779860200001
ID Ciência_Iscte	ci-pub-118597

Outros Detalhes da Publicação

Ano Publicação Online	2026
Editora	IEEE
Indexação	Web of Science©; Scopus;
ISSN	2644-1322 (print) 2644-1322 (online)
ISBN	--
Factor de Impacto	--
Volume	7	Número
Série
Número Artigo
Páginas	541 - 550
Avaliado Cientificamente	Sim
Repositório ISCTE-IUL	Link para o repositório
Data Publicação (online)
Data Publicação (print)

Altmetric

Dimensions

PlumX Metrics