研究生: |
蕭一豪 Hsiao, Yi Hao |
---|---|
論文名稱: |
壓縮感測技術應用於物件辨識與追蹤 Compressive Sensing Reconstruction Methods for Object Recognition and Tracking |
指導教授: |
陳朝欽
Chen, Chaur Chin |
口試委員: |
廖弘源
Liao, Hong Yuan 陳煥宗 Chen, Hwann Tzong 黃仲陵 Huang, Chung Lin 張隆紋 Chang, Long Wen 范國清 Fan, Kuo Chin |
學位類別: |
博士 Doctor |
系所名稱: |
電機資訊學院 - 資訊工程學系 Computer Science |
論文出版年: | 2017 |
畢業學年度: | 105 |
語文別: | 英文 |
論文頁數: | 61 |
中文關鍵詞: | 壓縮感測 、物件追蹤 、物件辨識 |
外文關鍵詞: | Compressive Sensing, Object Tracking, Object Recognition |
相關次數: | 點閱:1 下載:0 |
分享至: |
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
壓縮感知(compressive sensing, compressed sensing)是一新穎的信號處理技術,透過重建演算法用遠低於Nyquist-Shannon取樣定理的取樣頻率來完整重建原始信號。近年來,壓縮感知已被廣泛應用於各領域,本論文基於壓縮感知的稀疏表示與重建演算法概念,提出優化方法應用於物件辨識與追蹤,以獲得較佳的精確性。
一種基於壓縮感知的稀疏表示分類法(sparse representation-based classification, SRC)近期被提出來應用於人臉辨識,本論文基於稀疏表示分類法概念,提出一個局部排序最大概率方法(SRC-maximum probability of the partial ranking, SRC-MP),所提方法應用於人臉與魚類資料庫,實驗結果顯示提出的方法比基於投影方法:如主成分分析(PCA)、線性判別分析(LDA)、2DPCA、2DLDA,與匹配追蹤(Matching pursuit, MP)相關方法:如正交匹配追蹤(Orthogonal matching pursuit, OMP)、壓縮採樣匹配追蹤(Compressive sampling matching pursuit, CoSaMP)、子空間追蹤(subspace pursuit, SP) 與正則正交匹配追蹤(regularized OMP, ROMP),可達到更佳的辨識精準度。
壓縮追蹤(compressive tracking)是一個基於壓縮感知的高效即時追蹤方法,本論文基於壓縮追蹤與稀疏表示分類法概念,提出一個樣本蒐集與稀疏樣本表示方法(sparse sample collection and representation, SSCR),在樣本蒐集方法上,透過整合背景相減法與壓縮追蹤來提升樣本蒐集的準確性。在疏稀樣本表示方法上,將預測樣本以正樣本與負樣本的稀疏方式來表示,並透過重建演算法計算每個預測樣本權重,具最大權重係數的預測樣本即為追蹤結果。同時,在重建演算法部分,本論文提出一個權重調整與動態更新正交匹配追蹤(re-weighting and dynamically updating OMP, RwDuOMP)方法來提升重建效能。所提方法被應用於具複雜環境的台灣墾丁真實世界水下影片,實驗結果顯示提出的方法有效增進魚類追蹤準確度。
Compressive sensing (Compressed sensing, CS) is a novel sampling technique which adopts reconstruction algorithms to reconstruct original signals from significantly fewer samples than those using the Nyquist-Shannon sampling theorem. Recently, several researches have been conducted to apply the CS framework to various applications. In this thesis, we propose improved reconstruction methods based on CS for object recognition and tracking.
In recent years, a sparse representation-based classification (SRC) method based on CS is presented for robust face recognition. Our first proposed enhancement is adopting a maximum probability of the partial ranking method based on the framework of SRC, called SRC-MP. It computes the maximum probability from the largest γ weighting coefficients for the subjects. The criterion of selection is now based on the maximum probability, instead of the largest weighting coefficients. Experiments are implemented on face and real-world fish databases. Experimental results show that our proposed method is able to achieve higher accuracy than projection-based methods, such as principal component analysis (PCA), linear discriminant analysis (LDA), 2DPCA and 2DLDA, and matching pursuit related algorithms, such as orthogonal matching pursuit (OMP), compressive sampling matching pursuit (CoSaMP), subspace pursuit (SP), and regularized OMP (ROMP).
On the other hand, a real-time compressive tracking (CT) method based on CS is proposed for object tracking. Our proposed enhancement is implemented for a sparse sample collection and representation (SSCR) method, based on CT and SRC concepts, for real-world fish tracking. The SSCR consists of sample collection and sparse sample representation procedures. The sample collection procedure incorporates background subtraction into CT to improve the accuracy of collecting sets of three kinds of samples (positive, negative, and predictive). The sparse sample representation procedure represents each predictive sample as a sparse linear combination of all positive and negative samples. The weights of the predictive samples are computed using our proposed re-weighting and dynamically updating orthogonal matching pursuit (RwDuOMP) method. The RwDuOMP method includes three procedures, picking over samples, re-weighting the picked samples, and dynamically updating negative samples. The predictive sample with the maximum weighting coefficient is regarded as the target object tracking result. We evaluate the SSCR method using several challenging real-world underwater sequences from an uncontrolled open sea in Taiwan. In addition, we compare the RwDuOMP method with OMP, CoSaMP, SP and ROMP methods. Experimental results indicate that our proposed method improves the accuracy of fish tracking.
[Avi2004] S. Avidan, “Support Vector Tracking,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 26, no. 8, pp. 1064-1072, 2004.
[Bar2007] R.G. Baraniuk, “Compressive Sensing,” IEEE Signal Processing Magazine, vol. 24, no. 4, pp. 118-121, 2007.
[Boy2004] S. Boyd, and L. Vandenberghe, “Convex Optimization,” Cambridge University Press, 2004.
[Can2005] E. Candes, and J. Romberg, “l1-MAGIC: Recovery of Sparse Signals via Convex Programming,” Techical Report, Caltech, October, 2005.
[Can2006] E. Candes, J. Romberg, and T. Tao, “Stable Signal Recovery from Incomplete and Inaccurate Measurements,” Communications on Pure and Applied Mathematics, vol. 59, no. 8, pp. 1207-1223, 2006.
[Can2008A] E. Candes, and M. Wakin, “An Introduction to Compressive Sampling,” IEEE Signal Processing Magazine, vol. 25, no. 2, pp. 21-30, 2008.
[Can2008B] E. Candes, “The Restricted Isometry Property and its Implications for Compressed Sensing,” Comptes Rendus Mathematique, vol. 346, no. 9, pp. 589-592, 2008.
[Che2001] S.S. Chen, D.L. Donoho, and M.a. Saunders, “Atomic decomposition by basis pursuit,” SIAM review., vol. 43, no. 1, pp. 129-159, 2001.
[Che2008] C.C. Chen, Y.S. Shieh, and H.T. Chu, “Face Image Retrieval by Projection-based Features,” The 3rd International Workshop on Image Media Quality and its Applications, pp. 138-144, 2008.
[Com2003] D. Comaniciu, V. Ramesh, and P. Meer, “Kernel-based Object Tracking,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 25, no. 5, pp. 564-575, 2003.
[Cre2003] D. Cremers, and C. Schnorr, “Statistical Shape Knowledge in Variational Motion Segmentation,” Image and Vision Computing, vol. 21, no. 1, pp. 77-86, 2003.
[Dai2009] W. Dai, and O. Milenkovic, “Subspace Pursuit for Compressive Sensing Signal Reconstruction,” IEEE Transactions on Information Theory, vol. 55, no. 5, pp. 2230-2249, 2009.
[Dav2011] M.A. Davenport, M.F. Duarte, Y.C. Eldar, and G. Kutyniok, “Introduction to Compressed Sensing,” in Compressed Sensing: Theory and Applications, Cambridge University Press, 2011.
[Don2005] D.L. Donoho, and J. Tanner, “Sparse Nonnegative Solution of Underdetermined Linear Equations by Linear Programming,” Proceedings of the National Academy of Sciences of the United States of America, vol. 102, no. 27, pp. 9446-9551, 2005.
[Don2006A] D.L. Donoho, “Compressed Sensing,” IEEE Transactions on Information Theory, vol. 52, no. 4, pp. 1289-1306, 2006.
[Don2006B] D.L. Donoho, “For Most Large Underdetermined Systems of Linear Equations, the Minimal l1-norm Solution is also the Sparsest Solution,” Communications on Pure and Applied Mathematics, vol. 59, no. 6, pp.797-829, 2006.
[Hua2007] S.S. Huang, L.C. Fu, and P.Y. Hsiao,“Region-level Motion-based Background Modeling and Subtraction Using MRFs,” IEEE Transactions on Image Processing, vol. 16, no. 5, pp. 1446-1456, 2007.
[Hsi2014] Y.H. Hsiao, F.P. Lin, S.I. Lin, and C.C. Chen, “Real-World Underwater Fish Recognition and Identification, using Sparse Representation,” Ecological Informatics, vol. 23, pp. 13-21, 2014.
[Hsi2016A] Y.H. Hsiao, and C.C. Chen, “Over-Atoms Accumulation Orthogonal Matching Pursuit Reconstruction Algorithm for Fish Recognition and Identification,” International Conference on Pattern Recognition, pp. 1066-1071, 2016.
[Hsi2016B] Y.H. Hsiao, and C.C. Chen, “A Sparse Sample Collection and Representation Method Using Re-weighting and Dynamically Updating OMP for Fish Tracking,” International Conference on Image Processing, pp. 3494-3497, 2016.
[Li2005] M. Li, and B. Yuan, “2D-LDA: A Statistical Linear Discriminant Analysis for Image Matrix,” Pattern Recognition Letters, vol. 26, no. 5, pp. 527-532, 2005.
[Mah2008] V. Mahadevan, and N. Vasconcelos, “Background Subtraction in Highly Dynamic Scenes,” IEEE Conference on Computer Vision and Pattern Recognition, pp. 1-6, 2008.
[Mal1993] S. Mallat, and Z. Zhang, “Matching Pursuits with Time-frequency Dictionaries,” IEEE Transactions on Signal Processing, vol. 41, no. 12, pp. 3397-3415, 1993.
[Mik1999] S. Mika, G. Ratsch, J. Weston, B. Scholkopf, and K.R. Mullers, “Fisher Discriminant Analysis with Kernels,” IEEE International Workshop on Neural Networks for Signal Processing, vol. 9, pp. 41-48, 1999.
[Nar1978] M. Narasimha, and A. Peterson, “On the Computation of the Discrete Cosine Transform,” IEEE Transactions on Communications, vol. 26, no. 6, pp. 934-936, 1978.
[Nee2009] D. Needell, and J.A. Tropp, “CoSaMP: Iterative Signal Recovery from Incomplete and Inaccurate Samples,” Applied and Computational Harmonic Analysis, vol. 26, no. 3, pp. 301-321, 2009.
[Nee2010] D. Needell, and R. Vershynin, “Signal Recovery from Incomplete and Inaccurate Measurements via Regularized Orthogonal Matching Pursuit,” IEEE Journal of Selected Topics in Signal Processing, vol. 4, no. 2, pp. 310-316, 2010.
[Pic2004] M. Piccardi, “Background Subtraction Techniques: a Review,” IEEE International Conference on System, Man and Cybernetics, pp. 3099-3104, 2004.
[Qai2013] S. Qaisar, R.M. Bilal, W. Lqbal, M. Naureen, and S. Lee, “Compressive Sensing: From Theory to Applications, a Survey,” Journal of Communications and Networks, vol. 15, no. 5, pp. 443-456, 2013.
[Rio1991] O. Rioul, and M. Vetterli, “Wavelets and signal processing,” IEEE Signal Processing Magazine, vol. 8, pp. 14-38, 1991.
[Sha1949] C.E. Shannon, “Communication in the Presence of Noise,” Proceedings of the IRE, vol. 37, no. 1, pp. 10-21, 1949.
[Sha2007] Y. Sharon, J. Wright, and Y. Ma, “Computation and Relaxation of Conditions for Equivalence between l1 and l0 Minimization,” IEEE Transactions on Information Theory, 2007.
[Shi2010] Y.H. Shiau, Y.H. Chen, Y.H. Tseng, K.T. Cheng, J.S. Lin, S.W. Lo, and H.M. Chou, “A Real-Time High-Resolution Underwater Ecological Observation Streaming System,” International Society for Photogrammetry and Remote Sensing Technical Commission VIII Symposium, pp. 517-521, 2010.
[Shi2012A] Y.H. Shiau, and C.C. Chen, “A Sparse Representation Method with Maximum Probability of Partial Ranking for Face Recognition,” IEEE International Conference on Image Processing, pp. 1445-1448, 2012.
[Shi2012B] Y.H. Shiau, F.P. Lin, and C.C. Chen, “Using Sparse Representation for Fish Recognition and Verification in Real World Observation,” Proc. Int. Workshop on Visual Observation and Analysis of Animal and Insect Behavior (VAIB), in conjunction with International Conference on Pattern Recognition, 2012.
[Shi2013] Y.H. Shiau, C.C. Chen, and S.I. Lin, “Using Bounding-Surrounding Boxes Method for Fish Tracking in Real World Underwater Observation,” International Journal of Advanced Robotic Systems, vol. 10, pp. 1-6, 2013.
[Spa2008] C. Spampinato, G. Nadarajan, Y.H. Chen-Burger, and R.B. Fisher, “Detecting, tracking and counting fish in low quality unconstrained underwater videos,” International Conference on Computer Vision Theory and Applications, vol. 2, pp. 514-520, 2008.
[Sta1999] C. Stauffer, and W.E.L. Grimson, “Adaptive Background Mixture Models for Real-Time Tracking,” IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 246-252, 1999.
[Tro2007] J.A. Tropp, and A.C. Gilbert, “Signal Recovery from Random Measurements via Orthogonal Matching Pursuit,” IEEE Transactions. on Information Theory, vol. 53, no. 12, pp. 4655-4666, 2007.
[Tur1991] M. Turk and A. Pentland, “Eigenfaces for Recognition,” Journal of Cognitive Neuroscience, vol.3, no.1, pp. 71-86, 1991.
[Vio2001] P. Viola, and M. Jones, “Rapid Object Detection Using a Boosted Cascade of Simple Features,” IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 1, pp. 511-518, 2001.
[Wag2012] A. Wagner, J. Wright, A. Ganesh, Z. Zhou, H. Mobahi, and Y. Ma, “Towards a Practical Face Recognition System: Robust Registration and Illumination by Sparse Representation,” IEEE Transactions on Pattern analysis and Machine Intelligence, vol. 34, no. 2, pp. 372-386, 2012.
[Wan2012] J. Wang, S. Kwon, and B. Shim, “Generalized Orthogonal Matching Pursuit,” IEEE Transactions on Signal Processing, vol. 60, no. 12, pp. 6202-6216, 2012.
[Wri2009] J. Wright, A.Y. Yang, A. Ganesh, S.S. Sastry, and Y. Ma, “Robust Face Recognition via Sparse Representation,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 31, no. 2, pp. 210-227, 2009.
[Yan2004] J. Yang, D. Zhang, A.F. Frangi, and J.Y. Yang, “Two-Dimensional PCA: a New Approach to Appearance-Based Face Representation and Recognition,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 26, no. 1, pp. 131-137, 2004.
[Yil2006] A. Yilmaz, O. Javed, and M. Shah, “Object Tracking: A Survey,” ACM Journal of Computing Surveys, vol. 38, no. 4, 2006.
[Zan2003] Q. Zang, and R. Klette, “Object Classification and Tracking in Video Surveillance,” In Computer Analysis of Images and Patterns, pp. 198-205, 2003.
[Zha2012] K. Zhang, L. Zhang, and M.H. Yang, “Real-Time Compressive Tracking,” Proceedings of the 12th European Conference on Computer Vision, pp. 864-877, 2012.
[Zha2014] K. Zhang, L. Zhang, and M.H. Yang, “Fast Compressive Tracking,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 36, no. 10, pp. 2002-2015, 2014.
[Zho2004] S.K. Zhou, R. Chellappa, and B. Moghaddam, “Visual Tracking and Recognition Using Appearance-adaptive Models in Particle Filters,” IEEE Transactions on Image Processing, vol. 13, no. 11, pp. 1491-1506, 2004.
[Web01] http://vision.ucsd.edu/~leekc/ExtYaleDatabase/ExtYaleB.html, last access on Dec. 27, 2016.
[Web02] http://fei.edu.br/~cet/facedatabase.html, last access on Dec. 27, 2016.
[Web03] http://www.cl.cam.ac.uk/research/dtg/attarchive/facedatabase.html, last access on Dec. 27, 2016.
[Web04] National Basketball Association, http://www.nba.com
[Web05] http://cv.snu.ac.kr/research/~vtd/, last access on Dec. 27, 2016.