研究生: |
江信作 |
---|---|
論文名稱: |
以移動向量為基礎的人類特定動作辨識研究 Motion-based Human Activity Recognition with Spatial Relationship Analysis |
指導教授: | 王家祥 |
口試委員: |
葉梅珍
賴尚宏 |
學位類別: |
碩士 Master |
系所名稱: |
電機資訊學院 - 資訊工程學系 Computer Science |
論文出版年: | 2013 |
畢業學年度: | 101 |
語文別: | 英文 |
論文頁數: | 39 |
中文關鍵詞: | 視覺監控 、人類活動識別 、動量為基礎的 |
相關次數: | 點閱:3 下載:0 |
分享至: |
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
為了徹底防範公共以及私人場所的安全,監視攝影機被廣泛布置在各個角落以防止竊盜、非法活動、或其他可能威脅大眾安全的事件。而分析這些監視影片的工具用於從評價人類的行為模式來識別潛在的安全風險顯得非常的必須且重要。在這篇論文中,我們把重點放在同時追蹤多人的行蹤並且分析兩兩間的互動關係,利用這些關係來了解他們在做何種活動。有別於一般的根據圖片的方法,我們提出了一種新的方法有效的在近距離的監視影片識別了活動。關鍵的動態特徵以及運動方向從身體的幾個特定的部分(四肢、軀幹)提取出來。像是推人、打人、踢人這些活動就跟關鍵的動態特徵非常相關。我們給定了一些規範來識別這先關鍵的動態特徵。在把所有的人類都標記好了動態特徵後,我們就可以利用簡單的規範來判定任兩人在做何種互動。在每張圖片都得到辨識互動的結果後,利用固定時間區間內的密度來判斷此時間區間有沒有此活動發生。最後再利用SVM的模型把結果近一步的優化。
The surveillance cameras have been largely deployed to record and track moving objects to prevent theft, illegal activities or whatever so as to secure the safety in public or private places prevalently. Tools for analyzing those surveillance videos should thereby be necessary and important for evaluating the behavioral patterns of humans identified as potential security risk. In this thesis, we focus on tracking multiple people simultaneously when they are interacting with each other, and then recognizing their activities as well. A novel motion-based approach, rather than the image-based, to effectively recognize the moving activities is proposed for the near-field visual surveillance. Key actions are extracted and characterized by motion directions of several portions (limbs, torso) of human body. Some activities, such as pushing, punching, kicking, etc. are strongly relative to these key actions associated with blobs of human limbs. We give some rules for identifying these key actions. After key actions on identified human blobs for two-person interactions labeled, we can recognize what activity they are doing by means of simple principle: The activity with the highest density in a fixed-time period will be the result. Finally, a SVM–based training procedure is employed to further refine the decision.
[1] A.Vahdat, Gao Bo, M Ranjbar, G Mori, "A Discriminative Key Pose Sequence Model for Recognizing Human Interactions," Computer Vision Workshops (ICCV Workshops), pp.1729-1736, 2011.
[2] O. Oreifej, R. Mehran, and M. Shah, "Human Identity Recognition in Aerial Image," IEEE Conference on Computer Vision and Pattern Recognition, pp. 709-716, 2010.
[3] Y. Rubner, C. Tomasi, and L. J. Guibas, "A metric for distributions with applications to image databases," Proceedings of the IEEE International Conference on Computer Vision, pp. 59-66, 1998.
[4] N. Dalal and B. Triggs, "Histograms of Oriented Gradients for Human Detection," IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 1, pp. 886–893, 2005.
[5] Vahdat, A., Bo Gao, Ranjbar, M. and Mori, G. " A discriminative key pose sequence model for recognizing human interactions ," 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops), pp. 1729 – 1736, 2011.
[6] Han-Kuen Chiu, Algorithms and applications on medium-scale human crowd surveillance in aerial videos, Master Thesis of National Tsing-Hua University, Hsinchu Taiwan, July 2012.
[7] M. Ryoo and J. Aggarwal, “Spatio-temporal relationship match: Video structure comparison for recognition of complex human activities,” IEEE International Conference on Computer Vision, pp. 1593–1600, 2009.
[8] J.M. Guo,Y.F. Lin, C.H. Hsia, M.H. Shih, and C.S. Hsu, “Hierarchical method for foreground detection using codebook model,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 21, no. 6, pp. 804–815, June 2011.
[9] W. Ge, R. Collins, and R. Ruback, “Vision-based analysis of small groups in pedestrian crowds,” IEEE Transactions on Pattern Analysis and Mach. Intelligence, vol. 34, no. 5, pp. 1003–1016, May 2012.
[10] M.-C. Chang, N. Krahnstoever, and W. Ge, “Probabilistic group-level motion analysis and scenario recognition,” IEEE International Conference on Computer Vision, pp. 250-254, 2011.
[11] M.-C. Chang, and W. Ge., and X Liu, “Probabilistic group-level motion analysis and scenario recognition,” WACV 2012.pp.249-255.
[12] Y. Liu, C. Chang, H Nguyen, “Improved Hand Tracking System,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 22 , no. 5, pp. 697-701.
[13] Hsu-Yung Cheng, Chih-Chia Weng, and Yi-Ying Chen, “ Vehicle detection in aerial surveillance using dynamic Bayesian networks,” IEEE Transactions on Image Processing, vol. 21, no. 4, pp. 2152 –2159, April 2012.
[14] P. Agrawal and P. Narayanan, “ Person De-Identification in Videos,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 21, no. 3, pp. 299 –310, Jan 2011.
[15] C. Beleznai and H. Bischof. “Fast human detection in crowded scenes by contour integration and local shape estimation,” IEEE Conference on Computer Vision and Pattern Recognition, pp. 2246–2253, 2009.