Light field image coding: objective performance assessment of Lenslet and 4D LF data representations

Ricardo Jorge Monteiro; Nuno M. M. Rodrigues; Sérgio M. M. Faria; Paulo Nunes

Ciência_Iscte Publicações Descrição Detalhada da Publicação

Publicação em atas de evento científico Q3

Light field image coding: objective performance assessment of Lenslet and 4D LF data representations

Ricardo Jorge Monteiro (Monteiro, R. J. S.); Nuno M. M. Rodrigues (Rodrigues, N. M. M.); Sérgio M. M. Faria (Faria, S. M. M.); Paulo Nunes (Nunes, P. J. L.);

SPIE Optical Engineering + Applications

Ano (publicação definitiva)

2018

Língua

Inglês

País

Estados Unidos da América

Mais Informação

Visitar Link

Web of Science®

N.º de citações: 4

(Última verificação: 2026-07-21 07:12)

Ver o registo na Web of Science®

Scopus

Esta publicação não está indexada na Scopus

Google Scholar

N.º de citações: 9

(Última verificação: 2026-07-21 14:32)

Ver o registo no Google Scholar

Overton

Esta publicação não está indexada no Overton

Abstract/Resumo

State-of-the-art light field (LF) image coding solutions, usually, rely in one of two LF data representation formats: Lenslet or 4D LF. While the Lenslet data representation is a more compact version of the LF, it requires additional camera metadata and processing steps prior to image rendering. On the contrary, 4D LF data, consisting of a stack of sub-aperture images, provides a more redundant representation requiring, however, minimal side information, thus facilitating image rendering. Recently, JPEG Pleno guidelines on objective evaluation of LF image coding defined a processing chain that allows to compare different 4D LF data codecs, aiming to facilitate codec assessment and benchmark. Thus, any codec that does not rely on the 4D LF representation needs to undergo additional processing steps to generate an output comparable to a reference 4D LF image. These additional processing steps may have impact on the quality of the reconstructed LF image, especially if color subsampling format and bit depth conversions have been performed. Consequently, the influence of these conversions needs to be carefully assessed as it may have a significant impact on a comparison between different LF codecs. Very few in-depth comparisons on the effects of using existing LF representation have been reported. Therefore, using the guidelines from JPEG Pleno, this paper presents an exhaustive comparative analysis of these two LF data representation formats in terms of LF image coding efficiency, considering different color subsampling formats and bit depths. These comparisons are performed by testing different processing chains to encode and decode the LF images. Experimental results have shown that, in terms of coding efficiency for different color subsampling formats, the Lenslet LF data representation is more efficient when using YUV 4:4:4 with 10 bit/sample, while the 4D LF data representation is more efficient when using YUV 4:2:0 with 8 bit/sample. The “best” LF data representation, in terms of coding efficiency, depends on several factors which are extensively analyzed in this paper, such as the objective metric that is used for comparison (e.g., average PSNR-Y or average PNSR-YUV), the type of LF content, as well as the color format. The maximum objective quality is also determined, by evaluating the influence of each block from each processing chain in the objective quality of the reconstructed LF image. Experimental results show that, when the 4D LF data representation is not used the maximum achieved objective quality is lower than 50 dB, in terms of average PSNR-YUV.

Agradecimentos/Acknowledgements

Palavras-chave

Light field,Light field image coding,Light field data representation,JPEG pleno,Objective performance assessment

Classificação Fields of Science and Technology

Ciências da Computação e da Informação - Ciências Naturais
Engenharia Eletrotécnica, Eletrónica e Informática - Engenharia e Tecnologia

Registos de financiamentos

Referência de financiamento	Entidade Financiadora
UID/EEA/50008/2013	Fundação para a Ciência e a Tecnologia

Identificadores da Publicação

WoS (fonte: Ciência_Iscte)	WOS:000450861700012
Scopus (fonte: autor)	2-s2.0-85053868697
DOI (fonte: autor)	10.1117/12.2322713
Handle (fonte: Ciência-IUL)	http://hdl.handle.net/10071/17045
ID Ciência_Iscte	ci-pub-53137

Outros Detalhes da Publicação

Ano Publicação Online	2018
Editora	SPIE
Indexação	Web of Science©;
ISSN	0277-786X (print) 0277-786X (online)
ISBN	--
Volume
Número Artigo	107520D
Páginas	--	Total Páginas	17
Avaliado Cientificamente	Sim
Meio de Divulgação	Ambos (impresso e digital)
Editores	Andrew G. Tescher
Título do Evento	--
Organizador do Evento	SPIE
Cidade	San Diego
Tipo de Evento	Conferência
Classificação do Evento	Internacional
Ano do Evento	2018
Tipo de Publicação no Evento	Artigo Completo
Repositório ISCTE-IUL	Link para o repositório
Data Publicação (online)
Data Publicação (print)

Altmetric

Dimensions

PlumX Metrics