TY - GEN
T1 - HEVC saliency map computation
AU - Ammar, Marwa
AU - Mitrea, Mihai
AU - Boujelben, Ismail
AU - Caller, Patrick Le
PY - 2016/1/1
Y1 - 2016/1/1
N2 - This paper investigates whether the information related to the human visual saliency is still preserved at the level of the HEVC compressed stream syntax elements. In this respect, a new saliency model, matched to the peculiarities of this emerging standard is defined. It consists of four elementary maps, describing the four main saliency features: intensity, color, orientation and motion. These maps arc defined based on the energies of the luma and chroma coefficients, on the variations of the intra prediction modes and on the energy of motion vectors, respectively. They are ftisioned according to 48 static and static-dynamic pooling formulas. The results are compared to three state-of-the-art uncompressed (pixel) domain as well as to the MPEG-4 AVC compressed domain saliency maps. It is brought to light that the HEVC saliency model outperforms (with singular exceptions) the state-of-the-art uncompressed domain and is as good as MPEG-4 AVC saliency model. We can thus state that, as its MPEG-4 AVC ancestor, although not designed based upon visual saliency principles, the HEVC compression standard preserves this human visual property at the level of its syntax elements.
AB - This paper investigates whether the information related to the human visual saliency is still preserved at the level of the HEVC compressed stream syntax elements. In this respect, a new saliency model, matched to the peculiarities of this emerging standard is defined. It consists of four elementary maps, describing the four main saliency features: intensity, color, orientation and motion. These maps arc defined based on the energies of the luma and chroma coefficients, on the variations of the intra prediction modes and on the energy of motion vectors, respectively. They are ftisioned according to 48 static and static-dynamic pooling formulas. The results are compared to three state-of-the-art uncompressed (pixel) domain as well as to the MPEG-4 AVC compressed domain saliency maps. It is brought to light that the HEVC saliency model outperforms (with singular exceptions) the state-of-the-art uncompressed domain and is as good as MPEG-4 AVC saliency model. We can thus state that, as its MPEG-4 AVC ancestor, although not designed based upon visual saliency principles, the HEVC compression standard preserves this human visual property at the level of its syntax elements.
UR - https://www.scopus.com/pages/publications/85011101690
U2 - 10.2352/ISSN.2470-1173.2016.16HVEI-107
DO - 10.2352/ISSN.2470-1173.2016.16HVEI-107
M3 - Conference contribution
AN - SCOPUS:85011101690
T3 - Human Vision and Electronic Imaging 2016, HVEI 2016
SP - 72
EP - 79
BT - Human Vision and Electronic Imaging 2016, HVEI 2016
A2 - Pappas, Thrasyvoulos N.
A2 - de Ridder, Huib
A2 - Rogowitz, Bernice E.
PB - Society for Imaging Science and Technology
T2 - Human Vision and Electronic Imaging 2016, HVEI 2016
Y2 - 14 February 2016 through 18 February 2016
ER -