簡易檢索 / 詳目顯示

研究生: 劉光宇
Liu, Kuang-Yu
論文名稱: 利用第一人稱視覺之駕駛視角偵測與估測技術並用於駕駛輔助系統
Viewing Angle Detection and Estimation Systems for Driving Assistance using First-Person-Vision
指導教授: 黃仲陵
Huang, Chung-Lin
林嘉文
Lin, Chia-Wen
口試委員: 范國清
余孝先
黃仲陵
林嘉文
學位類別: 碩士
Master
系所名稱: 電機資訊學院 - 電機工程學系
Department of Electrical Engineering
論文出版年: 2013
畢業學年度: 102
語文別: 英文
論文頁數: 41
中文關鍵詞: 駕駛視角第一人稱視覺駕駛輔助
外文關鍵詞: gaze direction, first-person-vision, driver assistance
相關次數: 點閱:2下載:0
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • 行車中,駕駛者的缺乏專注力往往是造成行車事故發生的原因,本論文的目的在建立一套駕駛者輔助系統,此系統可應用於智慧型車輛,來避免由於駕駛者的不當駕駛造成的事故發生。本篇論文使用第一人稱視覺架構,有別於傳統固定式攝影機的環境,是由使用者配戴可攜式攝影機直接取得前方影像,代表使用者眼睛所觀察到的。本論文共有兩個主要的技術:車內外視角偵測及視角角度估測。在第一項技術,我們利用了“bag of words”影像分類的方法,事先蒐集資料庫並應用FAST+BRIEF的特徵萃取方法。之後我們利用此方法將輸入影像轉換成特徵向量,將此利用SVM分類器來偵測輸入影像是車內或車外,進而辨別駕駛者目前的專注程度。在第二項技術,我們額外架設一台行車紀錄器來擷取車輛前方的影像並且作為視角角度估測的參考座標軸。透過雙攝影機的影像特徵比對方法,來估測兩台攝影機的世界座標運動轉換關係,進而估測駕駛者目前的視角方向。本論文實驗結果顯示,我們可以達到即時的估測出駕駛者的視角方向,以及在駕駛者視角車內影像偵測也有良好的準確程度。


    This thesis proposes an intelligent vehicle system to identify the driver improper driving behavior using the so-called “first-person vision” (FPV) technology. Different from conventional computer vision, FPV is based on the person wearing a goggles camera representing the subject vision. There are two technologies proposed in this thesis: vehicle exterior/interior view detection and driver viewing angle estimation. For the first method, we use “bag of words” image classification approach by applying FAST+BRIEF feature descriptor in the dataset collected in advance. Then, we establish the first-person vision “vocabulary dictionary”, encoding an input image into a feature vector. Finally, we apply SVM classifier to detect whether the input image is from the inside of a vehicle, and further identifying the driver current attention. For the second method, we install an extra vehicle-mounted camera to record the image in front of the vehicle and to deem it as world coordinate reference for viewing angle estimation. Then, we find the relationship between the world and first-person-vision camera coordinate. Finally, we may further estimate the viewing direction of driver.

    Chapter 1 Introduction 1 1.1 Motivation 1 1.2 Related Works 2 1.3 System Overview 4 Chapter 2 Feature Extraction 6 2.1 FAST Keypoints Detector 6 2.1.1 Method 6 2.1.2 Machine Learning a Corner Detector 7 2.1.3 Non-maximal Suppression 9 2.2 BRIEF Feature Descriptor 9 2.2.1 Method 9 Chapter 3 Vehicle Exterior/Interior View Detection 11 3.1 Bag of Words Model 11 3.1.1 Vocabulary Tree 12 3.1.2 Weighting 14 3.2 SVM Classifier 15 Chapter 4 Gaze Direction Estimation 16 4.1 Camera Model 16 4.2 Epipolar Geometry 17 4.2.1 Computation of the Fundamental Matrix 19 4.3 Essential Matrix 20 4.3.1 Extraction from the Essential Matrix 20 4.4 Decomposition of Rotation Matrix 22 4.5 Correspondences Matching 24 Chapter 5 Experiment Results and Discussion 25 5.1 Comparison of Feature Extraction 25 5.2 Vehicle Interior View Detection 27 5.3 Gaze Direction Estimation 30 5.4 Execution Time 36 Chapter 6 Conclusion and Future Work 37 References 38

    [1] M. Miyaji, H. Kawanaka and K. Oguri, "Driver's cognitive distraction detection using physiological features by the adaboost," in ITSC, St. Louis, MO, 2009.
    [2] E. Murphy-Chutorian, A. Doshi and M. Trivedi, "Head Pose Estimation for Driver Assistance Systems: A Robust Algorithm and Experimental Evaluation," in ITSC, Seattle, WA, 2007.
    [3] H. Badino and T. Kanade, "A Head-Wearable Short-Baseline Stereo System for the Simultaneous Estimation of Structure and Motion," IAPR Conference on Machine Vision Applications, 6 2011.
    [4] T. Kanade and M. Hebert, "First-Person Vision," Proceedings of the IEEE, vol. 100, no. 8, pp. 2442 - 2453, 2012.
    [5] Y. Liang, M. Reyes and J. Lee, "Real-Time Detection of Driver Cognitive Distraction Using Support Vector Machines," Intelligent Transportation Systems, IEEE Transactions on, vol. 8, no. 2, pp. 340 - 350, 6 2007.
    [6] D. Galvez-López and J. Tardos, "Bags of Binary Words for Fast Place Recognition in Image Sequences," Robotics, IEEE Transactions on, vol. 28, no. 5, pp. 1188 - 1197, 10 2012.
    [7] H. Pirsiavash and D. Ramanan, "Detecting activities of daily living in first-person camera views," in CVPR, Providence, RI, 2012.
    [8] E. Rosten and T. Drummond, "Machine learning for high-speed corner detection," in ECCV, 2006.
    [9] O. Aghazadeh, J. Sullivan and S. Carlsson, "Novelty Detection from an Ego-Centric Perspective," in CVPR, 2011.
    [10] O. Aghazadeh, J. Sullivan and S. Carlsson, "Multi View Registration for Novelty/Background Separation," in CVPR, 2012.
    [11] T. D'Orazio, M. Leo, C. Guaragnella and A. Distante, "A visual approach for driver inattention detection," Pattern Recognition, vol. 40, no. 8, pp. 2341-2355, 8 2007.
    [12] M. Calonder, V. Lepetit, C. Strecha and P. Fua, "BRIEF: binary robust independent elementary features," in ECCV, 2010.
    [13] Y. Dong, Z. Hu, K. Uchimura and N. Murayama, "Driver Inattention Monitoring System for," Intelligent Transportation Systems, IEEE Transactions on, vol. 12, no. 2, 6 2011.
    [14] H. Kang, A. A. Efros, M. Hebert and T. Kanade, "Image Matching in Large Scale Indoor Environment," in CVPR, 2009.
    [15] Y. Kameda and Y. Ohta, "Image Retrieval of First-Person Vision for Pedestrian Navigation in Urban Area," in ICPR, 2010.
    [16] A. Doshi and M. Trivedi, "Investigating the relationships between gaze patterns, dynamic vehicle surround analysis, and driver intentions," IEEE Intelligent Vehicles Symposium, 2009.
    [17] J. Sivic and A. Zisserman, "Video Google: a text retrieval approach to object matching in videos," in ICCV, 2003.
    [18] R. a. Z. A. Hartley, Multiple View Geometry in Computer Vision, Second ed., Cambridge University Press, ISBN: 0521540518, 2004.
    [19] D. Nister and H. tewenius, "Scalable Recognition with a Vocabulary Tree," in CVPR, 2006.
    [20] J. R. Quinlan, "Induction of Decision Trees," Machine Learning, vol. 1, no. 1, pp. 81-106, 1986.
    [21] C. C. Chang and C. J. Lin, "LIBSVM : a library for support vector machines," ACM Transactions on Intelligent Systems and Technology, pp. 27:1--27:27, 2011.
    [22] J. Thorsten, "Text categorization with Support Vector Machines: Learning with many relevant features," Lecture Notes in Computer Science, pp. 137-142, 1998.
    [23] Z. Zhang, "A flexible new technique for camera calibration," Pattern Analysis and Machine Intelligence, IEEE Transactions on, 10 2000.
    [24] P. S. Richard I. Hartley, Triangulation, 1995.

    無法下載圖示 全文公開日期 本全文未授權公開 (校內網路)
    全文公開日期 本全文未授權公開 (校外網路)

    QR CODE