西安电子科技大学学报 ›› 2016, Vol. 43 ›› Issue (4): 160-165+171.doi: 10.3969/j.issn.1001-2400.2016.04.028

• 研究论文 • 上一篇    下一篇

一种深度图的时域下采样编码及重建方法

葛川1;刘琚1,2;元辉1,3;肖依凡1;李凤荣3   

  1. (1. 山东大学 信息科学与工程学院,山东 济南  250101;
    2. 山东大学 苏州研究院,江苏 苏州  215123;
    3. 中国科学院 上海微系统与信息技术研究所 无线传感网与通信重点实验室,上海  200050)
  • 收稿日期:2015-06-29 出版日期:2016-08-20 发布日期:2016-10-12
  • 通讯作者: 刘琚(1965-),男,教授,博士,E-mail:juliu@sdu.edu.cn
  • 作者简介:葛川(1985-),男,山东大学博士研究生,E-mail:gavinkeh.chuan@gmail.com.
  • 基金资助:

    国家自然科学基金资助项目(61201211, 61571274);教育部博士点基金资助项目(20120131120032,20130131110029);山东省优秀中青年科学家奖励基金资助项目(BS2012DX021);中国科学院无线传感网与通信重点实验室开放课题资助项目(2013002);山东大学青年学者未来计划资助项目(2015WLJH39);苏州市科技计划资助项目(SYG201443)

Temporal subsampling based depth maps coding and  the reconstruction method

GE Chuan1;LIU Ju1,2;YUAN Hui1,3;XIAO Yifan1;LI Fengrong3   

  1. (1. School of Information Science and Engineering, Shandong Univ., Jinan  250101, China;
    2. Suzhou Research Institute of Shandong Univ., Suzhou  215123, China;
    3. Key Lab. of Wireless Sensor Network & Communication, Shanghai Institute of Microsystem and Information Technology, Chinese Academy of Sciences, Shanghai  200050, China)
  • Received:2015-06-29 Online:2016-08-20 Published:2016-10-12

摘要:

为提高三维视频系统的编码效率,提出了一种针对中间视点的深度时域下采样编码方法.首先确定深度图像的丢弃方式;然后利用基于时间一致性和视点间的相关性,恢复出丢弃的深度图像,并对恢复出的结果进行维纳滤波,进一步提高恢复的深度图的质量.该方法需要向编码端发送维纳滤波器系数和重建深度选择标识符.实验结果表明,在保证同等码率的前提下,解码端重建的虚拟视图的峰值信噪比最大能提升0.229dB,平均提升0.130dB,编码性能得到改善.

关键词: 视频编码, 立体图像处理, 深度图时域下采样, 时间一致性, 视点间相关性

Abstract:

In order to improve the coding efficiency of a three-dimensional video system, a depth temporal subsampling based coding method for the intermediate view is proposed. In this paper, the manner of how to discard depth frames is firstly determined. Then, those discarded depths are reconstructed utilizing temporal consistency and multiview correspondences. Finally, in order to further improve the quality of the reconstructed depth, the wiener filter is implemented on the reconstructed results. Filter coefficients and indicators for selecting the depth predictor should be transmitted to the decoder. Experimental results demonstrate that a maximum 0.229dB and average 0.130dB peak signal-to noise ratio (PSNR) gain could be achieved for the virtual view reconstructed by the decoder while maintaining the same coding bit rate, and that the coding efficiency can be improved.

Key words: video coding, stereo image processing, depth temporal subsampling, temporal consistency, multiview correspondences