Stereo Video Quality Metric Based on Multi-Dimensional Analysis
Abstract
:1. Introduction
- (1)
- To solve the problem that stereo video contains both inter-view and temporal information, which leads to the difficulty in quality evaluation, this work designs a TVJD model based on correlation analysis, which can describe the characteristics of stereo video more accurately;
- (2)
- To solve the problem that TVJD subbands are substantial and complex, this work classifies the subbands by their own characteristics and generation mechanism. On this basis, the features of different subbands are extracted separately, which makes the features more sensitive to distortions;
- (3)
- To solve the problem that it is difficult to model the influence of temporal fluctuation on stereo video quality, in this work, STEW function is designed by simulating the stimulation of spatial-temporal alteration on the visual system.
2. Related Works
3. Proposed MDA-SVQM
3.1. TVJD for sGoF Decomposition
3.2. TVJD Subband Feature Extraction
3.2.1. High Frequency Subband Feature Extraction
3.2.2. Low Frequency Subband Feature Extraction
3.3. sGoF Quality Prediction
3.4. sGoF Quality Pooling
4. Experimental Results and Analysis
4.1. Stereo Video Database and Perforamnce Indicators
4.1.1. NAMA3D-COSPAD1 Stereo Video Database
4.1.2. NBU-3DV Stereo Video Database
4.1.3. Performance Indicators
- (a)
- PLCC
- (b)
- SROCC
- (c)
- RMSE
4.2. Verification of Each Module in MDA-SVQM
4.2.1. Verification of TVJD
4.2.2. Verification of STEW
4.3. Overall Performance Evaluation
- (1)
- Randomly select the distorted stereo videos in NAMA3D database to construct the test dataset, that is, 80% of the distorted video is used as the training data, and the remaining 20% is the training dataset, and no overlap between two sets;
- (2)
- Adopt NBU-3DV database as the training dataset and test on NAMA3D database.
4.4. Discussion
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Vieira, M.; Guimarães, P.V.; Violante-Carvalho, N.; Benetazzo, A.; Bergamasco, F.; Pereira, H. A Low-Cost Stereo Video System for Measuring Directional Wind Waves. J. Mar. Sci. Eng. 2020, 8, 831. [Google Scholar] [CrossRef]
- Zhang, Q.; Wang, Y.; Wei, T.; Jiang, B.; Gan, Y. A Complexity Reduction Scheme for Depth Coding in 3D-HEVC. Information 2019, 10, 164. [Google Scholar] [CrossRef] [Green Version]
- Zhang, H.; Li, D.; Yu, Y.; Guo, N. Subjective and Objective Quality Assessments of Display Products. Entropy 2021, 23, 814. [Google Scholar] [CrossRef] [PubMed]
- Pan, G.; Paul, M. Rate-Distortion Optimal Joint Texture and Depth Map Coding for 3-D Video Streaming. IEEE Trans. Multimed. 2020, 3, 610–625. [Google Scholar]
- Appina, B.; Dendi, S.V.R.; Manasa, K.; Channappayya, S.S.; Bovik, A.C. Study of subjective quality and objective blind quality prediction of stereoscopic videos. IEEE Trans. Image Process. 2019, 10, 5027–5040. [Google Scholar] [CrossRef] [PubMed]
- Jin, C.; Peng, Z.; Zou, W.; Chen, F.; Jiang, G.; Yu, M. No-Reference Quality Assessment for 3D Synthesized Images Based on Visual-Entropy-Guided Multi-Layer Features Analysis. Entropy 2021, 23, 770. [Google Scholar] [CrossRef] [PubMed]
- Zhou, W.; Bovik, A.C.; Sheikh, H.R.; Simoncelli, E.P. Image quality assessment: From error visibility to structural similarity. IEEE Trans. Image Process. 2004, 4, 600–612. [Google Scholar]
- Zhang, L.; Shen, Y.; Li, H. VSI: A Visual saliency-induced index for perceptual image quality assessment. IEEE Trans. Image Process. 2014, 10, 4270–4281. [Google Scholar] [CrossRef] [Green Version]
- Pinson, M.H.; Wolf, S. A new standardized method for objectively measuring video quality. IEEE Trans. Broadcast. 2004, 3, 312–322. [Google Scholar] [CrossRef]
- Jin, L.; Boev, A.; Gotchev, A.P.; Egiazarian, K. 3D-DCT based perceptual quality assessment of stereo video. In Proceedings of the 2011 IEEE International Conference on Image Processing, Brussels, Belgium, 11–14 September 2011; pp. 2521–2524. [Google Scholar]
- Han, J.; Jiang, T.; Ma, S. Stereoscopic video quality assessment model based on spatial-temporal structural information. In Proceedings of the 2012 IEEE Visual Communications & Image Processing, San Diego, CA, USA, 27–30 November 2012; pp. 1–6. [Google Scholar]
- Saad, M.A.; Pinson, M.H.; Nicholas, D.G.; Van Kets, N.; Van Wallendeal, G.; Da Silva, R.; Jaladi, R.V.; Corriveau, P.J. Impact of camera pixel count and monitor resolution perceptual image quality. In Proceedings of the 2015 Colour and Visual Computing Symposium, Gjovik, Norway, 25–26 August 2015; pp. 1–6. [Google Scholar]
- Ijsselsteijn, W.A.; Eridder, H.D.; Vliegen, J. Subjective evaluation of stereoscopic images: Effects of camera parameters and display duration. IEEE Trans. Circuits Syst. Video Technol. 2000, 2, 225–233. [Google Scholar] [CrossRef]
- Wang, K.; Barkowsky, M.; Cousseau, R.; Brunnström, K.; Sjöström, M. Subjective evaluation of HDTV stereoscopic videos in IPTV scenarios using absolute category rating. In Proceedings of the International Society for Optics and Photonics IS&T/SPIE Electronic Imaging, San Diego, CA, USA, 15 February 2011; p. 78631T. [Google Scholar]
- Seuntiens, P.; Meesters, L.; Ijsselsteijn, W. Perceived quality of compressed stereoscopic images: Effects of symmetric and asymmetric JPEG coding and camera separation. ACM Trans. Appl. Percept. 2006, 2, 95–109. [Google Scholar] [CrossRef]
- Urvoy, M.; Barkowsky, M.; Cousseau, R.; Koudota, Y.; Ricorde, V.; Le Callett, P.; Gutiérrez, J.; García, N. NAMA3DS1-COSPAD1: Subjective video quality assessment database on coding conditions introducing freely available high quality 3D stereoscopic sequences. In Proceedings of the 2012 IEEE International Workshop on Quality of Multimedia Experience, Melbourne, Australia, 5–7 July 2012; pp. 109–114. [Google Scholar]
- Götz-Hahn, F.; Hosu, V.; Lin, H.; Saupe, D. KonVid-150k: A Dataset for No-Reference Video Quality Assessment of Videos in-the-Wild. IEEE Access 2021, 9, 72139–72160. [Google Scholar] [CrossRef]
- Battisti, F.; Carli, M.; Stramacci, A.; Boev, A.; Gotchev, A. A Perceptual quality metric for high-definition stereoscopic 3D video. In Proceedings of the 2015 SPIE/IS&T Electronic Imaging, San Francisco, CA, USA, 8–12 February 2015; p. 939916. [Google Scholar]
- Galkandage, C.; Calic, J.; Dogan, S.; Guillemaut, J.Y. Stereoscopic video quality assessment using binocular energy. IEEE J. Sel. Top. Signal Process. 2016, 1, 102–112. [Google Scholar] [CrossRef] [Green Version]
- Yu, M.; Zheng, K.; Jiang, G.; Shao, F.; Peng, Z. Binocular perception based reduced-reference stereo video quality assessment method. J. Vis. Commun. Image Represent. 2016, 38, 246–255. [Google Scholar] [CrossRef]
- Qi, F.; Zhao, D.; Fan, X.; Jiang, T. Stereoscopic video quality assessment based on visual attention and just-noticeable difference models. Signal Image Video Process. 2016, 10, 737–744. [Google Scholar] [CrossRef]
- Jiang, G.; Liu, S.; Shao, F.; Peng, Z.; Chen, F. No reference stereo video quality assessment based on motion feature in tensor decomposition domain. J. Vis. Commun. Image Represent. 2018, 50, 247–262. [Google Scholar] [CrossRef]
- Narwaria, M.; Lin, W. SVD-based quality metric for image and video using machine learning. IEEE Trans. Syst. Man Cyber. B 2012, 2, 347–364. [Google Scholar] [CrossRef]
- Yang, J.; Zhu, Y.; Ma, C.; Lu, W.; Meng, Q. Stereoscopic video quality assessment based on 3D convolutional neural networks. Neurocomputing 2018, 309, 83–93. [Google Scholar] [CrossRef] [Green Version]
- Depeursinge, A.; Püspöki, Z.; Ward, J.P.; Unser, M. Steerable wavelet machines (SWM): Learning moving frames for texture classification. IEEE Trans. Image Process. 2017, 4, 1626–1636. [Google Scholar] [CrossRef] [Green Version]
- Scholte, H.S.; Ghebreab, S.; Waldorp, L.; Smeulders, A.W.M.; Lamme, V.A.F. Brain responses strongly correlate with Weibull image statistics when processing natural images. J. Vis. 2009, 4, 29. [Google Scholar] [CrossRef]
- Blake, R. Threshold conditions for binocular rivalry. J. Exp. Psychol. Human. 1977, 2, 251–257. [Google Scholar] [CrossRef]
- Ferroukhi, M.; Ouahabi, A.; Attari, M.; Habchi, Y.; Taleb-Ahmed, A. Medical Video Coding Based on 2nd-Generation Wavelets: Performance Evaluation. Electronics 2019, 8, 88. [Google Scholar] [CrossRef] [Green Version]
- Mallik, B.; Sheikh-Akbari, A.; Kor, A. HEVC Based Mixed-Resolution Stereo Video Codec. IEEE Access 2018, 6, 52691–52702. [Google Scholar] [CrossRef]
- Li, S.; Ma, L.; Ngan, K.N. Full-Reference Video quality assessment by decoupling detail losses and additive impairments. IEEE Trans. Circuits Syst. Video Technol. 2012, 7, 1100–1112. [Google Scholar] [CrossRef]
Abbrev. | Full Name | Roles in MDA-SVQM |
---|---|---|
TVJD | Temporal-view Joint Decomposition | sGoF decomposition |
SFJI | Static-Fusion Joint Information | Subbands generated by TVJD |
MFJI | Motion-Fusion Joint Information | |
SRJI | Static-Rivalry Joint Information | |
MRJI | Motion-Rivalry Joint Information | |
SWM | Steerable Wavelet Machine | LOID extraction algorithm |
LOID | Local organization of image direction | Feature map of SFJI and SRJI |
STEW | Spatial-temporal Energy Weighting | sGoF weight calculation |
Notation | Definition |
---|---|
Distorted video, consists of VL,dis and VR,dis | |
i-th distorted sGoF, consists of GL,dis(i) and GR,dis(i). | |
Subbands of MFJI, SFJI, MRJI and SRJI | |
Kurtosis of MFJI and MRJI distribution, used as features of MFJI and MRJI, feature length: 1 × 1 | |
Entropy of MFJI and MRJI coefficients, used as feature of MFJI and MRJI, feature length: 1 × 1 | |
steermax operation denoted with angle maps used in SWM | |
LOID Feature maps of SFJI and SRJI generated by SWM | |
Scale parameter of Weibull distribution used as feature of SFJI and SRJI, feature length: 1 × 1 | |
Shape parameter of Weibull distribution used as feature of SFJI and SRJI, feature length: 1 × 1 | |
Feature vector of high frequency subband, vector length: 1 × 4 | |
Feature vector of an sGoF, vector length: 1 × 12 | |
Quality of i-th sGoF sG(i) | |
Spatial energy weights of pixels in location (i, j) | |
Temporal energy weight of i-th sGoF | |
Spatial-temporal joint energy weights of i-th sGoF | |
Quality of distorted stereo video |
Test Video | TI | DI | Inter-View Low Frequency | Inter-View High Frequency | Temporal Low Frequency | Temporal High Frequency |
---|---|---|---|---|---|---|
Boxers [16] | 129.28 | 9.66 | 182.36 | 33.25 | 155.63 | 20.68 |
Hall [16] | 73.64 | 12.86 | 165.65 | 40.26 | 170.22 | 10.65 |
News Report [16] | 54.57 | 8.06 | 188.39 | 28.48 | 180.65 | 5.65 |
Temporal Pooling Strategy | PLCC | SROCC | RMSE |
---|---|---|---|
Average Pooling | 0.7065 | 0.6947 | 0.7769 |
Asymmetric Pooling [20] | 0.7181 | 0.7077 | 0.7698 |
Fluctuation Pooling [30] | 0.7115 | 0.7054 | 0.7709 |
STEW based Pooling | 0.7216 | 0.7092 | 0.7510 |
Metric | PLCC | SROCC | RMSE |
---|---|---|---|
PSNR | 0.7663 | 0.7419 | 0.7254 |
SSIM [7] | 0.8006 | 0.7934 | 0.5217 |
MNSVQM [22] | 0.8998 | 0.8846 | 0.4608 |
MDA-SVQM | 0.9328 | 0.9226 | 0.3562 |
MDA-SVQM (cross-database) | 0.9004 | 0.8892 | 0.4627 |
Metrics | PLCC | SROCC | RMSE |
---|---|---|---|
PSNR | 0.6699 | 0.6470 | 0.8433 |
SSIM [7] | 0.7664 | 0.7492 | 0.7296 |
VQM [9] | 0.6340 | 0.6006 | 0.8784 |
PHVS-3D [10] | 0.5480 | 0.5146 | 0.9501 |
3D-STS [11] | 0.6417 | 0.6214 | 0.9067 |
BSVQE [19] | 0.8124 | 0.8009 | 0.4952 |
Metric in [21] | 0.6503 | 0.6229 | 0.8629 |
MNSVQM [22] | 0.8545 | 0.8349 | 0.4538 |
MDA-SVQM-1 | 0.8884 | 0.8765 | 0.4451 |
MDA-SVQM-2 | 0.8765 | 0.8698 | 0.4489 |
Distortion | PLCC | SROCC | RMSE |
---|---|---|---|
H.264 | 0.9042 | 0.9071 | 0.4264 |
JP2K | 0.8828 | 0.8796 | 0.4773 |
D&S | 0.8765 | 0.8711 | 0.4902 |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
He, Z.; Xu, H.; Luo, T.; Liu, Y.; Song, Y. Stereo Video Quality Metric Based on Multi-Dimensional Analysis. Entropy 2021, 23, 1129. https://doi.org/10.3390/e23091129
He Z, Xu H, Luo T, Liu Y, Song Y. Stereo Video Quality Metric Based on Multi-Dimensional Analysis. Entropy. 2021; 23(9):1129. https://doi.org/10.3390/e23091129
Chicago/Turabian StyleHe, Zhouyan, Haiyong Xu, Ting Luo, Yi Liu, and Yang Song. 2021. "Stereo Video Quality Metric Based on Multi-Dimensional Analysis" Entropy 23, no. 9: 1129. https://doi.org/10.3390/e23091129