研究生: |
劉光宇 Liu, Kuang-Yu |
---|---|
論文名稱: |
利用第一人稱視覺之駕駛視角偵測與估測技術並用於駕駛輔助系統 Viewing Angle Detection and Estimation Systems for Driving Assistance using First-Person-Vision |
指導教授: |
黃仲陵
Huang, Chung-Lin 林嘉文 Lin, Chia-Wen |
口試委員: |
范國清
余孝先 黃仲陵 林嘉文 |
學位類別: |
碩士 Master |
系所名稱: |
電機資訊學院 - 電機工程學系 Department of Electrical Engineering |
論文出版年: | 2013 |
畢業學年度: | 102 |
語文別: | 英文 |
論文頁數: | 41 |
中文關鍵詞: | 駕駛視角 、第一人稱視覺 、駕駛輔助 |
外文關鍵詞: | gaze direction, first-person-vision, driver assistance |
相關次數: | 點閱:3 下載:0 |
分享至: |
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
行車中,駕駛者的缺乏專注力往往是造成行車事故發生的原因,本論文的目的在建立一套駕駛者輔助系統,此系統可應用於智慧型車輛,來避免由於駕駛者的不當駕駛造成的事故發生。本篇論文使用第一人稱視覺架構,有別於傳統固定式攝影機的環境,是由使用者配戴可攜式攝影機直接取得前方影像,代表使用者眼睛所觀察到的。本論文共有兩個主要的技術:車內外視角偵測及視角角度估測。在第一項技術,我們利用了“bag of words”影像分類的方法,事先蒐集資料庫並應用FAST+BRIEF的特徵萃取方法。之後我們利用此方法將輸入影像轉換成特徵向量,將此利用SVM分類器來偵測輸入影像是車內或車外,進而辨別駕駛者目前的專注程度。在第二項技術,我們額外架設一台行車紀錄器來擷取車輛前方的影像並且作為視角角度估測的參考座標軸。透過雙攝影機的影像特徵比對方法,來估測兩台攝影機的世界座標運動轉換關係,進而估測駕駛者目前的視角方向。本論文實驗結果顯示,我們可以達到即時的估測出駕駛者的視角方向,以及在駕駛者視角車內影像偵測也有良好的準確程度。
This thesis proposes an intelligent vehicle system to identify the driver improper driving behavior using the so-called “first-person vision” (FPV) technology. Different from conventional computer vision, FPV is based on the person wearing a goggles camera representing the subject vision. There are two technologies proposed in this thesis: vehicle exterior/interior view detection and driver viewing angle estimation. For the first method, we use “bag of words” image classification approach by applying FAST+BRIEF feature descriptor in the dataset collected in advance. Then, we establish the first-person vision “vocabulary dictionary”, encoding an input image into a feature vector. Finally, we apply SVM classifier to detect whether the input image is from the inside of a vehicle, and further identifying the driver current attention. For the second method, we install an extra vehicle-mounted camera to record the image in front of the vehicle and to deem it as world coordinate reference for viewing angle estimation. Then, we find the relationship between the world and first-person-vision camera coordinate. Finally, we may further estimate the viewing direction of driver.
[1] M. Miyaji, H. Kawanaka and K. Oguri, "Driver's cognitive distraction detection using physiological features by the adaboost," in ITSC, St. Louis, MO, 2009.
[2] E. Murphy-Chutorian, A. Doshi and M. Trivedi, "Head Pose Estimation for Driver Assistance Systems: A Robust Algorithm and Experimental Evaluation," in ITSC, Seattle, WA, 2007.
[3] H. Badino and T. Kanade, "A Head-Wearable Short-Baseline Stereo System for the Simultaneous Estimation of Structure and Motion," IAPR Conference on Machine Vision Applications, 6 2011.
[4] T. Kanade and M. Hebert, "First-Person Vision," Proceedings of the IEEE, vol. 100, no. 8, pp. 2442 - 2453, 2012.
[5] Y. Liang, M. Reyes and J. Lee, "Real-Time Detection of Driver Cognitive Distraction Using Support Vector Machines," Intelligent Transportation Systems, IEEE Transactions on, vol. 8, no. 2, pp. 340 - 350, 6 2007.
[6] D. Galvez-López and J. Tardos, "Bags of Binary Words for Fast Place Recognition in Image Sequences," Robotics, IEEE Transactions on, vol. 28, no. 5, pp. 1188 - 1197, 10 2012.
[7] H. Pirsiavash and D. Ramanan, "Detecting activities of daily living in first-person camera views," in CVPR, Providence, RI, 2012.
[8] E. Rosten and T. Drummond, "Machine learning for high-speed corner detection," in ECCV, 2006.
[9] O. Aghazadeh, J. Sullivan and S. Carlsson, "Novelty Detection from an Ego-Centric Perspective," in CVPR, 2011.
[10] O. Aghazadeh, J. Sullivan and S. Carlsson, "Multi View Registration for Novelty/Background Separation," in CVPR, 2012.
[11] T. D'Orazio, M. Leo, C. Guaragnella and A. Distante, "A visual approach for driver inattention detection," Pattern Recognition, vol. 40, no. 8, pp. 2341-2355, 8 2007.
[12] M. Calonder, V. Lepetit, C. Strecha and P. Fua, "BRIEF: binary robust independent elementary features," in ECCV, 2010.
[13] Y. Dong, Z. Hu, K. Uchimura and N. Murayama, "Driver Inattention Monitoring System for," Intelligent Transportation Systems, IEEE Transactions on, vol. 12, no. 2, 6 2011.
[14] H. Kang, A. A. Efros, M. Hebert and T. Kanade, "Image Matching in Large Scale Indoor Environment," in CVPR, 2009.
[15] Y. Kameda and Y. Ohta, "Image Retrieval of First-Person Vision for Pedestrian Navigation in Urban Area," in ICPR, 2010.
[16] A. Doshi and M. Trivedi, "Investigating the relationships between gaze patterns, dynamic vehicle surround analysis, and driver intentions," IEEE Intelligent Vehicles Symposium, 2009.
[17] J. Sivic and A. Zisserman, "Video Google: a text retrieval approach to object matching in videos," in ICCV, 2003.
[18] R. a. Z. A. Hartley, Multiple View Geometry in Computer Vision, Second ed., Cambridge University Press, ISBN: 0521540518, 2004.
[19] D. Nister and H. tewenius, "Scalable Recognition with a Vocabulary Tree," in CVPR, 2006.
[20] J. R. Quinlan, "Induction of Decision Trees," Machine Learning, vol. 1, no. 1, pp. 81-106, 1986.
[21] C. C. Chang and C. J. Lin, "LIBSVM : a library for support vector machines," ACM Transactions on Intelligent Systems and Technology, pp. 27:1--27:27, 2011.
[22] J. Thorsten, "Text categorization with Support Vector Machines: Learning with many relevant features," Lecture Notes in Computer Science, pp. 137-142, 1998.
[23] Z. Zhang, "A flexible new technique for camera calibration," Pattern Analysis and Machine Intelligence, IEEE Transactions on, 10 2000.
[24] P. S. Richard I. Hartley, Triangulation, 1995.