研究生: |
許嘉容 Hsu, Chia-Jung |
---|---|
論文名稱: |
針對影片及指尖追蹤應用的立體匹配技術 Stereo Matching Techniques for Videos and Fingertip-tracking Applications |
指導教授: |
王家祥
Wang, Jia-Shung |
口試委員: |
王家祥
Jia-Shung Wang 葉梅珍 Mei-Chen Yeh 陳煥宗 Hwann-Tzong Chen |
學位類別: |
碩士 Master |
系所名稱: |
電機資訊學院 - 資訊工程學系 Computer Science |
論文出版年: | 2014 |
畢業學年度: | 102 |
語文別: | 英文 |
論文頁數: | 62 |
中文關鍵詞: | 區域性立體匹配 、視差圖 、深度圖 、時間上的一致性 、加權傳播 |
外文關鍵詞: | local stereo matching,, disparity map, temporal consistency, weighted propagation, fingertip-tracking |
相關次數: | 點閱:2 下載:0 |
分享至: |
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
三維立體視覺的應用中,如何得到準確的深度資訊是一大關鍵議題,其中立體匹配演算法是解決此問題的方法之一。本論文提出一個利用條件式傳遞(weighted propagation)的方式綜合顏色和空間結構的資訊,並針對邊界、遮蔽和信賴度低的區域做修正,得到視差圖做為深度資訊。此方法可以針對不同的應用做彈性調整達到内容感知(content-aware)及加速的效果。第一,應用於影片,加入運動向量的考量達到時間上的連續性,並且利用內差法達到加速的效果;第二,應用於指間追蹤,導入皮膚色的特質使手部的深度資訊更趨完善,同時減少深度等級的運算降低運算量。實驗結果顯示,我們提出的方法可以快速且準確的得到視差圖,且針對不同的應用皆突顯我們的優勢。
How to get accurate depth information is an essential issue in computer vision. Stereo matching has been proved to be the effective way to compute such dense and reliable disparity maps. In this thesis, several stereo matching algorithms are proposed to compute disparity maps for several kinds of applications, such as videos, fingertip-tracking, etc. Basically, these algorithms are local approach with the concept of weighted propagation combining both color and spatial structure simultaneously. Also, several refinement techniques are joined to improve the accuracy on border, occluded and unreliable pixels. Furthermore, for the applications of video processing, the issue of temporal consistency is considered and solved. And, for the fingertip-tracking applications, the skin-color is heavily weighted to enhance the data integrity within the hand regions. The experimental results show that the proposed algorithms provide comparable high-quality disparity maps for both images and videos and suitable for some applications, such as fingertip-tracking.
[1] D. Scharstein and R. Szeliski, “A Taxonomy and Evaluation of Dense Two-Frame Stereo Correspondence Algorithms,” International Journal of Computer Vision, vol. 47, no. 1-3, 2002, pp 7-42. http://www.middlebury.edu/stereo/
[2] Q. Yang, L. Wang, R. Yang, H. Stewénius, and D. Nistér, “Stereo matching with color-weighted correlation, hierarchical belief propagation and occlusion handling,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 31, no. 3, March 2009, pp.492-504.
[3] V. Kolmogorov and R. Zabih, “Computing Visual Correspondence with Occlusions using Graph Cuts,” IEEE International Conference on Computer Vision (ICCV), vol.2, 2001, pp. 508 - 515.
[4] Z. Wang and Z. Zheng, “A region based stereo matching algorithm using cooperative optimization,” IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2008, pp. 1-8.
[5] O. Veksler, “Stereo correspondence by dynamic programming on a tree,” IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2005, pp. 384 -390.
[6] K. Yoon and S. Kweon, “Adaptive Support-Weight Approach for Correspondence Search,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 28, no. 4, April 2006, pp.650-656.
[7] X. Mei, X. Sun, M. Zhou, S. Jiao, H. Wang, and X. Zhang, “On Building an Accurate Stereo Matching System on Graphics Hardware,” IEEE International Conference on Computer Vision Workshops (ICCV Workshops), November 2011, pp. 464-474.
[8] L. De-Maeztu, A. Villanueva, and R. Cabeza, “Near Real-time Stereo Matching Using Geodesic Diffusion,” IEEE Transactions on Pattern Analysis and Machine Intelligence, February 2012, vol. 34, no. 2, pp. 410-416.
[9] S. Mattoccia, S. Giardino, and A. Gambini, “Accurate and efficient cost aggregation strategy for stereo correspondence based on approximated joint bilateral filtering,” 9th Asian Conference on Computer Vision (ACCV), September 2009, pp 371-380.
[10] C. Rhemann, A. Hosni, M. Bleyer, C. Rother, and M. Gelautz, “Fast cost-volume filtering for visual correspondence and beyond,” IEEE Transactions on Pattern Analysis and Machine Intelligence, February 2013, vol. 35, no 2, pp.504-511.
[11] http://www.minoru3d.com/
[12] Dongbo Min ; Jiangbo Lu ; Do, M.N. , “Joint Histogram-Based Cost Aggregation for Stereo Matching,” IEEE Transactions on Pattern Analysis and Machine Intelligence, October 2013, vol. 35, no 10, pp.2539-2545.
[13] Cuong Cao Pham and Jae Wook Jeon, “Domain Transformation-Based Efficient Cost Aggregation for Local Stereo Matching,” IEEE Transactions on Circuits and Systems for Video Technology, July 2013, vol. 23, no. 7, pp. 1119-1130.
[14] Qingqing Yang, Dongxiao Li, Lianghao Wang, and Ming Zhang, “Full-image Guided Filtering for Fast Stereo Matching,” IEEE Signal Processing Letters, March 2013, vol. 20, no. 3, pp. 237-240.
[15] Cevahir Cigla, A. Aydın Alatan, “Information Permeability for Stereo Matching,” Signal Processing: Image Communication, October 2013, vol. 28, no. 9, pp. 1072–1088.
[16] Michael Bleyer and Margrit Gelautz, “Temporally Consistent Disparity Maps from Uncalibrated Stereo Videos,” Proceedings of the 6th International Symposium on Image and Signal Processing and Analysis (ISPA), September 2009, pp. 383-387.
[17] C. Richardt, D. Orr, I. Davies, A. Criminisi, and N. Dodgson, “Real-time spatiotemporal stereo matching using the dual-cross-bilateral grid,” European Conference on Computer Vision (ECCV), September 2010, vol. 6313, pp.510-523.
[18] Asmaa Hosni, Christoph Rhemann, Michael Bleyer, and Margrit Gelautz, “Temporally Consistent Disparity and Optical Flow via Efficient Spatio-temporal Filtering,” 5th Pacific Rim Symposium on Advances in Image and Video Technology (PSIVIT), November 2011, Lecture Notes in Computer Science vol. 7087, pp 165-177.
[19] Jing Chen, Canhui Cai, Cuihua Li, “A Stereo Object Segmentation Algorithm Based On Disparity and Temporal-spatial Information,” 2012 International Symposium on Intelligent Signal Processing and Communications Systems (ISPACS), November 2012, pp.745-748.
[20] Qi Wang, Xilin Chen, Wen Gao, “Skin Color Weighted Disparity Competition for Hand Segmentation from Stereo Camera,” Proceedings of the British Machine Vision Conference, 2010, pp.66.1-66.11.
[21] Dung Duc Nguyen, Thien Cong Pham, Jae Wook Jeon, “Fingertip Detection with Morphology and Geometric Calculation,” IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), October 2009, pp.1460-1465.
[22] Hui Liang, Junsong Yuan, and Daniel Thalmann, “3D Fingertip and Palm Tracking in Depth Image Sequences,” Proceedings of the 20th ACM international conference on Multimedia, October 2012, pp.785-788.
[23] Chia-Ping Chen, Yu-Ting Chen, Ping-Han Lee, Yu-Pao Tsai, and Shawmin Lei, “Real-time Hand Tracking on Depth Images,” IEEE Visual Communications and Image Processing (VCIP), November 2011, pp. 1-4.
[24] L. Jin, D. Yang, L. Zhen, and J. Huang, “A novel vision based finger-writing character recognition system,” 18th International Conference on Pattern Recognition (ICPR), August 2006, vol. 1, pp.1104-1107.
[25] D. Lee and S. Lee, “Vision-based finger action recognition by angle detection and contour analysis,” Electronics and Telecommunications Research Institute Journal, June. 2011, vol. 33, no. 3, pp. 415-422.
[26] Ziyong Feng, Shaojie Xu, Xin Zhang, Lianwen Jin, Zhichao Ye, and Weixin Yang, “Real-time Fingertip Tracking and Detection using Kinect Depth Sensor for a New Writing-in-the Air System,” Proceedings of the 4th International Conference on Internet Multimedia Computing and Service (ICIMCS), September 2012, pp.70-74.
[27] C. Stentoumis, L. Grammatikopoulos, I. Kalisperakis, E. Petsa, and G. Karras, “A local Adaptive Approach for Dense Stereo Matching in Architectural Scene Reconstruction,” 5th International Workshop on 3D Virtual Reconstruction and Visualization of Complex Architectures (3D-ARCH), February 2013.
[28] X. Hu and P. Mordohai, “Evaluation of Stereo Confidence Indoors and Outdoors,” IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2010, pp. 1466-1473.
[29] Won-Hee Lee, Yumi Kim, and Jong Beom Ra, “Efficient Stereo Matching Based on a New Confidence Metric,” 2012 Proceedings of the 20th European on Signal Processing Conference (EUSIPCO), August 2012, pp. 1139-1143.
[30] J. Kovac, P. Peer, F. Solina, “Human skin color clustering for face detection”, The IEEE Region 8 EUROCON 2003-International Conference on Computer as a Tool , September 2003, vol. 2, pp. 144-148.
[31] C. Cigla and A. A. Alatan, “Efficient Edge-Preserving Stereo Matching,” IEEE International Conference on Computer Vision Workshops (ICCV Workshops), November 2011, pp. 696-699.
[32] Zucheul Lee, Jason Juang, and Truong Q. Nguyen, “Local Disparity Estimation with Three-Moded Cross Census and Advanced Support Weight”, IEEE Transactions on Multimedia, December 2013, vol.15, no.8, pp.1855-1864.
[33] HHI database. http://sp.cs.tut.fi/mobile3dtv/stereo-video/