研究生: |
徐君潔 Hsu, Chun-Chieh |
---|---|
論文名稱: |
Mode Decision using Inter-view Dependencies and Depth Information for Multiview Video Coding 運用深度資訊之快速多視角視訊編碼技術研究 |
指導教授: |
王家祥
Wang, Jia-Shung |
口試委員: |
杭學鳴
潘晴財 |
學位類別: |
碩士 Master |
系所名稱: |
電機資訊學院 - 資訊工程學系 Computer Science |
論文出版年: | 2011 |
畢業學年度: | 99 |
語文別: | 中文 |
論文頁數: | 55 |
中文關鍵詞: | 多視角編碼 、快速運動向量補償 、快速視差向量補償 、加速模式選擇 、深度資訊 |
外文關鍵詞: | multiview video coding, fast motion estimation, fast disparity estimation, fast mode decision, depth information |
相關次數: | 點閱:1 下載:0 |
分享至: |
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
多視角編碼技術使用了運動向量補償(motion estimation)和視差向量補償(disparity estimation)去發掘不同視角影片之間的相關性且利用編碼率失真最佳化的區塊模式選擇(rate-distortion optimization)達到極高的壓縮效果,然而也造成了編碼器的計算量大幅提升。為了降低計算量,許多論文將注意力放在快速運動向量補償演算法、快速視差向量補償演算法或快速模式選擇演算法,但是卻沒有完整的利用鄰近的視角影片壓縮過後提供的訊息。
本篇論文利用不同視角影片彼此的相關性和深度資訊提出了一個快速模式選擇演算法。首先根據鄰近視角影片壓縮時選擇的模式,我們致力於減少每個宏塊(macroblock)需要計算的模式數量,接著結合以臨界值為基礎的提早結束模式選擇機制來進一步的加速模式選擇的流程。第二部分,我們利用運動向量補償的結果去決定是否需要計算視差向量補償。第三部分,我們分別減少運動向量補償和視差向量補償的搜尋範圍。最後,如果此多視角影片含有深度資訊,我們會利用此深度資訊去進一步的縮小視差向量補償的搜尋範圍和過濾掉不適合的參考宏塊。實驗結果顯示,本篇論文可以加速約80%的編碼時間且只造成少部分的失真率下降以及編碼率提高。
Multiview Video Coding (MVC) performs both disparity and motion estimation to exploit inter-view correlations along with the rate distortion optimization to achieve superior coding efficiency, however the encoder load dramatically increases as well. To reduce the stunning computational complexity, many research works focus on fast motion estimation, fast disparity estimation and/or fast mode decision, but are not fully utilized the information supplied by inter-view referencing frames. In this thesis, a fast mode decision algorithm using both inter-view dependencies and depth information is proposed. First, we delicately arrange the minimal candidate modes for each macroblock according to the valued modes corresponding to its inter-view referencing macroblocks, along with the threshold-based early termination to end up the mode decision procedure earlier. Next, we use the result of motion estimation to decide if disparity estimation is required. Third, the search ranges of motion estimation and disparity estimation will be reduced accordingly. Finally, apply the depth information (if available) to refine the search range of disparity estimation and filter out the inappropriate referencing macroblocks as well. The experimental results demonstrate that the proposed algorithm reduces around 80% of the entire encoding time with negligible PSNR drop and bit-rate increasing.
[1] ISO/IEC/JTC1/SC29/WG11, “Multiview Coding Using AVC,” Bangkok, Thailand, Jan. 2006.
[2] A. Vetro, P. Pandit, H. Kimata, and A. Smolic, Joint Multiview Video Model (JMVM 8.0), Joint Video Team, Doc. JVT-AA207, Geneva, CH, April 2008.
[3] X. Xu, and Y. He, "Fast disparity motion estimation in MVC based on range prediction," IEEE International Conference on Image Processing (ICIP), pp.2000-2003, October 2008.
[4] W. Zhu, X. Tian, F. Zhou, and Y. Chen, "Fast disparity estimation using spatio-temporal correlation of disparity field for multiview video coding," IEEE Transactions on Consumer Electronics, vol.56, no.2, pp.957-964, May 2010.
[5] X. Li, D. Zhao, S. Ma, and Wen Gao, "Fast disparity and motion estimation based on correlations for multiview video coding," IEEE Transactions on Consumer Electronics , vol.54, no.4, pp.2037-2044, November 2008.
[6] X. Li, D. Zhao, X. Ji, Q. Wang, and W. Gao, "A fast inter frame prediction algorithm for multi-view video coding," IEEE International Conference on Image Processing (ICIP), vol.3, pp.III-417-III-420, 16-19 September 2007.
[7] X. San, H. Cai, J.-G. Lou, and J. Li, "Multiview image coding based on geometric prediction," IEEE Transactions on Circuits and Systems for Video Technology, vol.17, no.11, pp.1536-1548, November 2007.
[8] J. Lu, H. Cai, J.-G. Lou, and J. Li, "An epipolar geometry-based fast disparity estimation algorithm for multiview image and video Coding," IEEE Transactions on Circuits and Systems for Video Technology, vol.17, no.6, pp.737-750, June 2007.
[9] L. Shen, T. Yan, Z. Liu, Z. Zhang, P. An, and L. Yang, "Fast mode decision for multiview video coding," IEEE International Conference on Image Processing (ICIP), pp.2953-2956, 7-10 November 2009.
[10] Li. Shen, Z. Liu, P. An, R. Ma, and Z. Zhang, "Low-complexity mode decision for MVC," IEEE Transactions on Circuits and Systems for Video Technology, vol.21, no.6, pp.837-843, June 2011.
[11] L. Shen, Z. Liu, T. Yan, Z. Zhang, and P. An, "View-adaptive motion estimation and disparity estimation for low complexity multiview video coding," IEEE Transactions on Circuits and Systems for Video Technology, vol.20, no.6, pp.925-930, June 2010.
[12] Z. Peng, M. Yu, G. Jiang, W. Liu, and F. Shao, "Fast macroblock selection algorithm for multiview video coding based on mode analyses," International Symposium on Intelligent Information Technology Application Workshops (IITAW), pp.1081-1084, 21-22 December 2008.
[13] T.-Y. Kuo, Y.-Y. Lai, and Y.-C. Lo, "Fast mode decision for non-anchor picture in multiview video coding," IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB), pp.1-5, 24-26 March 2010.
[14] W. Zhu, X. Tian, F. Zhou, and Y. Chen, "Fast inter mode decision based on textural segmentation and correlations for multiview video coding," IEEE Transactions on Consumer Electronics, vol.56, no.3, pp.1696-1704, August 2010.
[15] D.-H. Han and Y.-L. Lee, "Fast mode decision using global disparity vector for multiview video coding," International Conference on Future Generation Communication and Networking Symposia (FGCNS), vol.3, pp.209-213, 13-15 December 2008.
[16] W. Zhu, W. Jiang, and Y. Chen, "A fast inter mode decision for multiview video coding," International Conference on Information Engineering and Computer Science (ICIECS), pp.1-4, 19-20 December 2009.
[17] M. Ai and J. Wang, "A fast mode decision algorithm for multiview video coding," International Congress on Image and Signal Processing (CISP), vol.7, pp.3252-3257, 16-18 October 2010.
[18] Y.-H. Lin and J.-L. Wu, “A depth information based fast mode decision algorithm for color plus depth-map 3D videos,” IEEE Transactions on Broadcasting, vol.57, no.2, pp.542-550, June 2011.