研究生: |
謝志鈴 Heish, Chih-Ling |
---|---|
論文名稱: |
應用深度強化學習網路於多機器人之避障與地圖導航 Collision Avoidance and Map Navigation of a Multi-Robot System using Deep Reinforcement Learning Network |
指導教授: |
葉廷仁
Yeh, Ting-Jen |
口試委員: |
劉承賢
Liu, Cheng-Hsien 黃浚鋒 Huang, Chung-Feng |
學位類別: |
碩士 Master |
系所名稱: |
工學院 - 動力機械工程學系 Department of Power Mechanical Engineering |
論文出版年: | 2020 |
畢業學年度: | 109 |
語文別: | 中文 |
論文頁數: | 69 |
中文關鍵詞: | 深度強化學習 、地圖導航 、避障 、非完整約束 |
外文關鍵詞: | deep reinforcement learning, map navigation, collision avoidance, nonholonomic constraint |
相關次數: | 點閱:3 下載:2 |
分享至: |
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
本研究利用深度強化學習建立深度神經價值網路來讓群組機器人可以即時避開移動障礙物並完成導航任務。在學習的初始資料蒐集時便將輪型移動的非完整約束納入考量,使得機器人得以做出更有效率且符合運動學限制的行為。強化學習訓練先從雙機器人網路開始,再根據相關測試結果,系統化建立了一套由少數機器人網路訓練及拓展至多機器人網路的架構與流程。最後利用訓練完成的網路,搭配設置虛擬機器人來取代不同地圖環境中的固定障礙物,使得本研究原先在開放空間中訓練的網路,可以適用於不同環境進行地圖導航,而無須重新進行訓練,提升了網路的泛用性。
This thesis applies deep reinforcement learning to build a deep value network for collision avoidance and map navigation of the multiple-robot system. By considering the nonholonomic constraints associated with wheeled mobile robots in the initial learning data collection, the robots can perform more efficient behaviors that conform to the kinematics constraints. Reinforcement learning training starts with a dual-robot navigation network. Then based on the test results, a systematic procedure to constructively extend the dual-robot network to a multi-robot network is proposed. It is also shown that virtual robots can be adopted in the trained navigation network to emulate fixed obstacles in the map environment. By doing so, the network originally trained in the open space can be used for navigation in different map environments without retraining. Both simulations and experiments verify the effectiveness and generality of the multiple-robot navigation network constructed by the proposed approach.
[1] E. W. Dijkstra, "A note on two problems in connexion with graphs," Numerische mathematik, vol. 1, no. 1, pp. 269-271, 1959.
[2] P. E. Hart, N. J. Nilsson, and B. Raphael, "A formal basis for the heuristic determination of minimum cost paths," IEEE transactions on Systems Science and Cybernetics, vol. 4, no. 2, pp. 100-107, 1968.
[3] S. Kuswadi, J. W. Santoso, M. N. Tamara, and M. Nuh, "Application SLAM and Path Planning using A-Star Algorithm for Mobile Robot in Indoor Disaster Area," in 2018 International Electronics Symposium on Engineering Technology and Applications (IES-ETA), 2018, pp. 270-274: IEEE.
[4] T. Chen, G. Zhang, X. Hu, and J. Xiao, "Unmanned aerial vehicle route planning method based on a star algorithm," in 2018 13th IEEE Conference on Industrial Electronics and Applications (ICIEA), 2018, pp. 1510-1514: IEEE.
[5] A. K. Guruji, H. Agarwal, and D. Parsediya, "Time-efficient A* algorithm for robot path planning," Procedia Technology, vol. 23, pp. 144-149, 2016.
[6] H. Samet, "An overview of quadtrees, octrees, and related hierarchical data structures," in Theoretical Foundations of Computer Graphics and CAD: Springer, 1988, pp. 51-68.
[7] H. Noborio, T. Naniwa, and S. Arimoto, "A quadtree‐based path‐planning algorithm for a mobile robot," Journal of Robotic Systems, vol. 7, no. 4, pp. 555-574, 1990.
[8] J.-P. Laumond, S. Sekhavat, and F. Lamiraux, "Guidelines in nonholonomic motion planning for mobile robots," in Robot motion planning and control: Springer, 1998, pp. 1-53.
[9] D. Dolgov, S. Thrun, M. Montemerlo, and J. Diebel, "Practical search techniques in path planning for autonomous driving," Ann Arbor, vol. 1001, no. 48105, pp. 18-80, 2008.
[10] P. Fiorini and Z. Shiller, "Motion planning in dynamic environments using velocity obstacles," The International Journal of Robotics Research, vol. 17, no. 7, pp. 760-772, 1998.
[11] J. Van Den Berg, S. J. Guy, M. Lin, and D. Manocha, "Reciprocal n-body collision avoidance," in Robotics research: Springer, 2011, pp. 3-19.
[12] J. Van den Berg, M. Lin, and D. Manocha, "Reciprocal velocity obstacles for real-time multi-agent navigation," in 2008 IEEE International Conference on Robotics and Automation, 2008, pp. 1928-1935: IEEE.
[13] S. J. Guy et al., "Clearpath: highly parallel collision avoidance for multi-agent simulation," in Proceedings of the 2009 ACM SIGGRAPH/Eurographics Symposium on Computer Animation, 2009, pp. 177-187.
[14] C. Fulgenzi, A. Spalanzani, and C. Laugier, "Dynamic obstacle avoidance in uncertain environment combining PVOs and occupancy grid," in Proceedings 2007 IEEE International Conference on Robotics and Automation, 2007, pp. 1610-1616: IEEE.
[15] B. Patle, A. Pandey, D. Parhi, and A. Jagadeesh, "A review: On path planning strategies for navigation of mobile robot," Defence Technology, 2019.
[16] P. Long, T. Fanl, X. Liao, W. Liu, H. Zhang, and J. Pan, "Towards optimally decentralized multi-robot collision avoidance via deep reinforcement learning," in 2018 IEEE International Conference on Robotics and Automation (ICRA), 2018, pp. 6252-6259: IEEE.
[17] Y. F. Chen, M. Liu, M. Everett, and J. P. How, "Decentralized non-communicating multiagent collision avoidance with deep reinforcement learning," in 2017 IEEE international conference on robotics and automation (ICRA), 2017, pp. 285-292: IEEE.
[18] Y. F. Chen, M. Everett, M. Liu, and J. P. How, "Socially aware motion planning with deep reinforcement learning," in 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2017, pp. 1343-1350: IEEE.
[19] C. Chen, Y. Liu, S. Kreiss, and A. Alahi, "Crowd-robot interaction: Crowd-aware robot navigation with attention-based deep reinforcement learning," in 2019 International Conference on Robotics and Automation (ICRA), 2019, pp. 6015-6022: IEEE.
[20] R. Han, S. Chen, and Q. Hao, "Cooperative Multi-Robot Navigation in Dynamic Environment with Deep Reinforcement Learning," in 2020 IEEE International Conference on Robotics and Automation (ICRA), 2020, pp. 448-454: IEEE.
[21] R. S. Sutton and A. G. Barto, "Reinforcement learning: An introduction," 2011.
[22] A. Sakai, D. Ingram, J. Dinius, K. Chawla, A. Raffin, and A. Paques, "Pythonrobotics: a python code collection of robotics algorithms," arXiv preprint arXiv:1808.10703, 2018.
[23] D. Harris and S. Harris, Digital design and computer architecture. Morgan Kaufmann, 2010.
[24] S. Thrun, W. Burgard, and D. Fox, Probabilistic robotics (Intelligent robotics and autonomous agents). MIT Press, 2005, pp. I-XX, 1-647.