智能网联汽车自动驾驶行为决策方法研究

doi:10.13306/j.1672-3813.2021.03.013

复杂系统与复杂性科学

2021, Vol. 18

Issue (3): 88-94 DOI: 10.13306/j.1672-3813.2021.03.013

本期目录 | 过刊浏览 | 高级检索

智能网联汽车自动驾驶行为决策方法研究

徐泽洲^1,2, 曲大义^1,2, 洪家乐², 宋晓晨²

1.青岛市城市规划设计研究院,山东青岛 266071;
2.青岛理工大学,山东青岛 266520

Research on Decision-making Method for Autonomous Driving Behavior of Connected and Automated Vehicle

XU Zezhou^1,2, QU Dayi^1,2, HONG Jiale², SONG Xiaochen²

1. Institute of Urban Transportation, Qingdao 266071;
2. Qingdao University of Technology, Qingdao 266520,China

摘要
参考文献
相关文章
Metrics

全文: PDF(2165 KB)
输出: BibTeX | EndNote (RIS)

摘要针对在交叉口自动驾驶车辆与其他车辆直行冲突的问题,构建自动驾驶汽车行为决策模型,采用深度强化学习对自动驾驶汽车通过道路交叉口进行训练,让自动驾驶汽车自主决策学习,实现复杂场景的快速控制,并与非支配排序遗传算法对比验证自动驾驶汽车的稳定性。仿真结果表明采用深度确定性策略梯度算法的自动驾驶车辆行为决策方法能够更好地输出速度确保了油门及刹车值的平稳变化,有效解决了自动驾驶汽车的安全和舒适问题。

	服务

	把本文推荐给朋友
	加入引用管理器
	E-mail Alert
	RSS
	作者相关文章
	徐泽洲
	曲大义
	洪家乐
	宋晓晨

关键词 ：智能网联, 自动驾驶, 深度强化学习, 行为决策, 仿真分析

Abstract：Aiming at the problem of direct conflict between autonomous vehicles and other human-driven vehicles at intersections, an autonomous vehicle behavior decision model is built, and deep reinforcement learning is used to train autonomous vehicles when passing road intersections, allowing autonomous vehicles to make autonomous decisions and achieve fast control of complex scenarios,and the comparison with the non-dominated sorting genetic algorithm-Ⅱ verifies the stability of the autonomous vehicle.The simulation results show that the autonomous vehicle beha-vior decision-making method using the depth deterministic strategy gradient algorithm has better output speed to ensure the smooth changes of the throttle and brake values, and effectively solve the safety and comfort problems of autonomous vehicles.

Key words： intelligent network connection automatic driving deep reinforcement learning decision control simulation analysis

收稿日期: 2021-02-06 出版日期: 2021-06-18

ZTFLH:	U463
	TP18

基金资助:国家自然科学基金(51678320)

通讯作者: 洪家乐(1995-),男,河南鹤壁人,硕士研究生,主要研究方向为交通系统优化。

作者简介: 徐泽洲(1975-),男,山东青岛人,硕士,高级工程师,主要研究方向为交通规划与管理。

引用本文:

徐泽洲, 曲大义, 洪家乐, 宋晓晨. 智能网联汽车自动驾驶行为决策方法研究[J]. 复杂系统与复杂性科学, 2021, 18(3): 88-94.
XU Zezhou, QU Dayi, HONG Jiale, SONG Xiaochen. Research on Decision-making Method for Autonomous Driving Behavior of Connected and Automated Vehicle. Complex Systems and Complexity Science, 2021, 18(3): 88-94.

链接本文:

http://fzkx.qdu.edu.cn/CN/10.13306/j.1672-3813.2021.03.013 或 http://fzkx.qdu.edu.cn/CN/Y2021/V18/I3/88

[1] 杨帆.无人驾驶汽车的发展现状和展望[J].上海汽车,2014(3):35-40.
Yang Fan. Development situation and prospect of driverless vehicle[J].Shanghai Auto,2014(3):35-40.
[2] 马国成. 车辆自适应巡航跟随控制技术研究[D].北京:北京理工大学,2014.
Ma Guocheng. Research on adaptive cruise control tracking system applied for motor vehicles[D].Beijing:Beijing Institute of Technology,2014.
[3] Lange S, Riedmiller M. Deep auto-encoder neural networks in reinforcement learning[C].The 2010 International Joint Conference on Neural Networks (IJCNN), Barcelona, 2010.
[4] Lange S, Riedmiller M, Voigtlander A. Autonomous reinforcement learning on raw visual input data in a real world application[C]. The 2012 International Joint Conference on Neural Networks. Brisbance: IEEE, 2012.
[5] Mnih V, Kavukcuoglu K, Silved D, etal.Human-level control through deep reinforcement learning[J]. Nature, 2015, 518(7540):529-533.
[6] Chae H, Kang C M, Kim B D,et al. Autonomous braking system via deep reinforcement learning[J]. 2017 IEEE 20th International Conference on Intelligent Transportation Systems (ITSC). Yokohama:IEEE, 2017.
[7] Sallab A, Abdou M, Perot E, et al. Deep reinforcement learning framework for autonomous driving[J]. Electronic Imaging,2017(19):70-76.
[8] Vasquez R,Farooq B. Multi-objective autonomous braking system using naturalistic dataset[C]. 2019 IEEE Intelligent Transportation Systems Conference (ITSC). Auckland: IEEE, 2019.
[9] Wang Y, Li X P, Yao H D. Review of trajectory optimisation for connected automated vehicles[J]. IET Intelligent Transport Systems,2018,13(4):580-586.
[10] Gerard A U, Jin W L. Mobility and environment improvement of signalized networks through Vehicle-to-Infrastructure (V2I) communications[J]. Transportation Research Part C,2016,68:70-82.
[11] Yao H D,Cui J X,Li X P,et al. A trajectory smoothing method at signalized intersection based on individualized variable speed limits with location optimization[J]. Transportation Research Part D,2018,62:456-473.
[12] Jiang H F,Hu J, An S, et al. Eco approaching at an isolated signalized intersection under partially connected and automated vehicles environment[J]. Transportation Research Part C,2017,79:290-307.
[13] Xu B, Jeff B X G ,Bian Y G, et al. Cooperative method of traffic signal optimization and speed control of connected vhicles at isolated intersections[J]. IEEE Transactions on Intelligent Transportation Systems, 2019, 20(4):1390-1403.
[14] Han X, Ma R, H. Zhang M. Energy-aware trajectory optimization of CAV platoons through a signalized intersection[J]. Transportation Research Part C, 2020, 118: 102652.
[15] 夏伟. 基于深度强化学习的自动驾驶决策仿真[D].深圳:中国科学院大学(中国科学院深圳先进技术研究院),2017.
Xia Wei. Simulation of automatic driving strategy based on deep reinforcement learning[D]. Shenzhen:University of Chinese Academy of Sciences(Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences ),2017.
[16] 范鑫磊,李栋,张尉,等.基于深度强化学习的导弹规避决策训练研究[J].电光与控制,2021,28(1):81-85.
Fan Xinlei, Li Dong, Zhang Wei, et al. Missile evasion decision training based on deep reinforcement learning[J].Electronics Optics & Control,2021,28(1):81-85.
[17] 徐国艳,宗孝鹏,余贵珍,等.基于DDPG的无人车智能避障方法研究[J].汽车工程,2019,41(2):206-212.
Xu Guoyan, Zong Xiaopeng, Yu Guizhen, et al. A research on intelligent obstacle avoidance of unmanned vehicle based on DDPG algorithm[J].Automotive Engineering, 2019, 41(2): 206-212.
[18] 杨顺,蒋渊德,吴坚,等.基于多类型传感数据的自动驾驶深度强化学习方法[J].吉林大学学报(工学版),2019,49(4):1026-1033.
Yang Shun, Jiang Yuande, Wu Jian, et al. Autonomous driving policy learning based on deep reinforcement learning and multi-type sensor data[J].Journal of Jilin University(Engineering and Technology Edition),2019,49(4):1026-1033.
[19] Qi W W, Wang W, Shen B, et al. A modified post encroachment time model of urban road merging area based on lane-change characteristics[J]. IEEE Access, 2020,8:72835-72846.
[20] 樊娇,雷涛,董南江,等.基于改进NSGA-Ⅱ算法的多目标无人机路径规划[DB/OL]. [2021-05-11].http://kns.cnki.net/kcms/detail/14.1138.TJ.20210419.1630.002.html.
Fan Jiao, Lei Tao, Dong Nanjiang, et al. Multi-objective UAV path planning based on an improved NGSA-II[DB/OL]. [2021-05-11].http://kns.cnki.net/kcms/detail/14.1138.TJ.20210419.1630.002.html.

[1]	郑振华, 刘其朋. 基于视觉特征提取的强化学习自动驾驶系统[J]. 复杂系统与复杂性科学, 2020, 17(4): 30-37.
[2]	付帅帅, 陈伟达, 丁军飞, 王丹丹. 政府对“农超对接”发展影响的多方博弈与仿真[J]. 复杂系统与复杂性科学, 2020, 17(3): 52-61.

Viewed

Full text

Abstract

Cited

Shared

Discussed