簡易檢索 / 詳目顯示

研究生: 楊燿聰
Yang, Yao-Tsung
論文名稱: 高效能的多物件偵測研究與硬體實現
Study of Efficient Multiple Object Detection and Hardware Implementation
指導教授: 邱瀞德
口試委員: 范倫達
張添烜
學位類別: 碩士
Master
系所名稱: 電機資訊學院 - 通訊工程研究所
Communications Engineering
論文出版年: 2013
畢業學年度: 101
語文別: 英文
論文頁數: 56
中文關鍵詞: 多物件偵測演算法多物件概論提高樹多物件可行性階層低記憶體花費高準確性漸進式階層演算法
外文關鍵詞: Multiple Object Detection Algorithm, Multiple Object Probability Boosting Tree, Multiple Object Capable Cascades, Low Memory Cost, High Accuracy, Boosted Cascade algorithm
相關次數: 點閱:3下載:0
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • 本篇論文研究多物件偵測演算法進而提出改進方法。目的為提升多物件偵測法的準確性及降低所花費的記憶體。針對多物件概率提高樹演算法,我們利用樹節點索引法降低記憶體花費,與利用漸近式階層取代達到高準確性。再者針對多物件可行性階層演算法,我們利用簡化多物件可行性階層的架構,減少弱分類器的個數以及達到高準確性。利用MIT CBCL汽車資料庫,比較以上所有演算法的記憶體花費以及準確性後,簡化多物件可行性階層演算法有效地減少13.161x103 ~ 261.477x103位元的記憶體以及達到高偵測率95.54%與低假陽性率1.94%。

    在多物件漸進式階層演算法的硬體實現,運算積分影像時利用積分窗口減少儲存積分影像的記憶體大小。接著利用平行階層偵測的架構減少偵測時間。此設計晶片核心電路面積1.21mm2並且達到工作頻率100MHz以及30fps速度於處理大小160x120的圖片。


    In this thesis, we study the multiple object detection algorithms and propose improved methods. The goal is to increase detection rate and reduce memory cost in multiple object detection methods. For the multiple object probability boosting tree scheme, we use tree node index to reduce memory cost and use boosted cascade to achieve high detection rate and low false positive rate. Furthermore, in multiple object capable cascade method, we propose the single stage cascade to replace the original parallel cascade structure to reduce the number of weak classifier and achieve high accuracy. For MIT CBCL car database, the multiple object capable cascades algorithm with single stage cascade structure reduces low memory cost by around 13.161x103 ~ 261.477x103 bits and achieves high detection rate of 95.54% and low false positive rate of 1.94% compared to above other algorithms.

    In the hardware implementation of multiple object boosted cascade scheme, we design an efficient the integral window to reduce the memory cost and calculate feature value. In cascade detection, we reduce the number of weak classifiers storage, and we exploit the parallel cascade detection architecture to reduce the detection time. The post-layout chip achieves operation frequency of 100MHz, processing images of 30 fps with size 160x120, and with core area of 1.21 mm2.

    1 Introduction 1.1 Motivation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 1.2 Problem Description . . . . . . . . . . . . . . . . . . . . . . . . . . 3 1.3 Goal and Contribution . . . . . . . . . . . . . . . . . . . . . . . . . 5 1.4 Thesis Organization . . . . . . . . . . . . . . . . . . . . . . . . . . . 7 2 Related Works 2.1 Probability Boosting Tree (PBT) . . . . . . . . . . . . . . . . . . . 8 2.2 Multiple Object Capable Cascades (MOCCs) . . . . . . . . . . . . . 12 2.3 AdaBoost . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13 2.4 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14 3 Efficient Multiple Object PBT 16 3.1 Main Concept . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16 3.2 Multiple Object PBT with the Tree Node Index . . . . . . . . . . . 16 3.3 Multiple Object PBT with Boosted Cascade . . . . . . . . . . . . . 21 3.4 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23 4 Efficient MOCCs 4.1 Main Concept . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24 4.2 Modified Structure of MOCCs . . . . . . . . . . . . . . . . . . . . . 24 4.3 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30 5 Boosted Cascade Object Detection Framework 5.1 Main Concept . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31 5.2 Feature Type and Integral Image . . . . . . . . . . . . . . . . . . . 32 5.3 Learning a Classifier System . . . . . . . . . . . . . . . . . . . . . . 34 5.4 Cascade of Classifier . . . . . . . . . . . . . . . . . . . . . . . . . . 35 5.5 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 36 6 Performance Evaluation 6.1 Setting of Simulation Environments . . . . . . . . . . . . . . . . . . 37 6.2 Software Simulation in Multiple Object Detection . . . . . . . . . . 38 6.3 Performance Evaluation and Comparison with Memory Cost in Multiple Object Detection . . . . . . . . . . . . . . . . . . . . . . . 39 6.4 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 40 7 Hardware Implementation 7.1 Overall Architecture . . . . . . . . . . . . . . . . . . . . . . . . . . 41 7.2 Discussion of the Sub-Window . . . . . . . . . . . . . . . . . . . . . 42 7.3 Integral Image Process Unit . . . . . . . . . . . . . . . . . . . . . . 43 7.4 Cascade Detection Unit . . . . . . . . . . . . . . . . . . . . . . . . 45 7.5 Implementation Result . . . . . . . . . . . . . . . . . . . . . . . . . 47 7.6 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 50 8 Conclusion and Future Work 8.1 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 51 8.2 Future Work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 52

    [1] “http://140.134.32.129/edu/semi-oe/poster-13_les0507/YCLin.pdf," Jul
    ,2013.
    [2] I. Kallenbach, R. Schweier, G. Palm, and O. Lohlein, “Multi-class Object
    Detection in Vision Systems Using a Hierarchy of Cascaded Classifiers," IEEE
    Intelligent Vehicles Symposium, June 13-15, 2006.
    [3] “http://wardsauto.com/news-amp-analysis/vehicle-electronics-growing-bits-
    and-bytes," Jul ,2013.
    [4] Z. Tu, “Probabilistic boosting-tree: learning discriminative models for classification, recognition, and clustering," Computer Vision, 2005. ICCV 2005. Tenth IEEE International Conference on, Vol. 2, pp.1589-1596 , 2005.
    [5] Z. Tu, K. Narr, P. Dollar, I. Dinov, P. Thompson, and A. Toga, “Brain
    Anatomical Structure Segmentation by Hybrid Discriminative-Generative Models," Medical Imaging, IEEE Transactions on, vol. 27, no. 4, pp. 495-508, 2008.
    [6] A. Ng and M. Jordan, “On discriminative vs. generative classifiers: A comparison of logistic regression and naive Bayes," In Advances in Neural Information
    Processing Systems, 2002.
    [7] P. Viola and M. Jones, “Rapid object detection using a boosted cascade of
    simple features," Proc. IEEE Conf. Computer Vision and Pattern Recognition,
    vol. 1, pp. I-511 to I-518, 2001.
    [8] Y. Freund and R. E. Schapire, “A decision-theoretic generalization of on-line
    learning and an application to boosting," Journal of Computer and System
    Sciences, vol. 55, no. 1, pp. 119-139, 1997.
    [9] C. Papageorgiou and T. Poggio, “A trainable object detection system: Car detection in static images," October, 1999.
    [10] M. Oren, C. Papageorgiou, P. Sinha, E. Osuna and T. Poggio, “Pedestrian detection using wavelet templates," Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 193-199, 1997.
    [11] C. Papageorgiou, M. Oren and T. Poggio, “A general framework for object detection," Computer Vision, 1998. Sixth International Conference on IEEE, pp. 555-562, 1998.
    [12] C. Papageorgiou , “A trainable system for object detection in images and video
    sequences," 2000.
    [13] C. Papageorgiou and T. Poggio, “A trainable system for object detection," International Journal of Computer Vision, vol. 38.1, pp. 15-33, 2000.
    [14] C. Papgeorgiou, F. Girosi and T. Poggio, “Sparse correlation kernel analysis and reconstruction," 1998.
    [15] C. Papageorgiou and T. Poggio, “Trainable pedestrian detection," Proc. IEEE International Conference on Image Processing on, vol. 4, 1999.
    [16] A. Mohan, “Robust Object Detection in Images by Components," 1999.
    [17] A. Mohan, C. Papageorgiou, and T. Poggio, “Example-based object detection in images by components," IEEE Transactions on Pattern Analysis and Machine Intelligence on, pp. 349-361, 2001.
    [18] J. Stallkamp, M. Schlipsing, J. Salmen and C. Igel, “Man vs. computer:
    Benchmarking machine learning algorithms for traffic sign recognition," pp. 323-
    332, March, 2012.
    [19] M. Everingham, L. Van Gool, C. K. Williams, J. Winn, and A. Zisserman,
    “The pascal visual object classes (voc) challenge," International journal of computer vision, pp. 303-338, 2010.
    [20] T. T. Son and S. Mita, “Car detection using multi-feature selection for varying poses," IEEE Intelligent Vehicles Symposium, pp. 507-512, June, 2009.
    [21] Yi Tang, Wei-Ming Liu, and Wu Jianwei, “An improved pedestrian detection algorithm based on Adaboost cascading stucture," 2010 8th World Congress on
    Intelligent Control and Automation (WCICA), IEEE, pp. 6321-6326, 2010.
    [22] J. Jiao,Z. Zheng,J. Park,Y. L. Murphey, and Y. Luo, “A robust multi-class traffic sign detection and classification system using asymmetric and symmetric features," IEEE International Conference on Systems, Man and Cybernetics, pp. 3421-3427, October, 2009.
    [23] C. R. Chen, W. S. Wong, and C. T. Chiu, “A 0.64 mm real-time cascade face detection design based on reduced two-field extraction," IEEE Transactions on 55 Very Large Scale Integration (VLSI) Systems, vol. 19, no. 11, pp. 1937-1948, 2011.
    [24] W. S. Wong,C. R. Chen,and C. T. Chiu, “A 100MHz hardware-efficient boost cascaded face detection design," IEEE International Conference on Image Processing (ICIP), pp. 3237-3240, November, 2009.
    [25] Y. Hanai,Y. Hori,J. Nishimura, and T. Kuroda, “A versatile recognition processor employing Haar-like feature and cascaded classifier," IEEE International Solid-State Circuits Conference-Digest of Technical Papers (ISSCC), pp. 148-149, February, 2009.
    [26] C. Kyrkou and T. Theocharides, “A flexible parallel hardware architecture for AdaBoost-based real-time object detection," IEEE Transactions on Very Large Scale Integration (VLSI) Systems, vol. 19, no. 6, pp. 1034-1047, 2011.

    無法下載圖示 全文公開日期 本全文未授權公開 (校內網路)
    全文公開日期 本全文未授權公開 (校外網路)

    QR CODE