심층 큐 네트워크 기반 장애물 회피에 관한 연구 :

김수형

추천

검색

자료유형: 학위논문

저자정보: 김수형 (국민대학교, 국민대학교 자동차공학전문대학원)

지도교수: 박기홍

발행연도: 2019

저작권: 국민대학교 논문은 저작권에 의해 보호받습니다.

이용수14

이 논문의 연구 히스토리 (2)

2019

심층 큐 네트워크 기반 장애물 회피에 관한 연구

김수형 2019.01 학위논문

2018

심층 큐 네트워크 기반 장애물 회피에 관한 연구

김수형 , 박유상 , 안태원 외 1명 한국자동차공학회 춘계학술대회 2018.06 학술대회자료

이 논문의 후속연구가 궁금하신가요?
연관 학술논문 또는 학술발표를 통해 보다 발전된 연구결과를 확인하실 수 있습니다.
이 논문의 연구 히스토리 확인하기

초록· 키워드

오류제보하기

최근 환경 센서 및 차량의 각종 전장부품의 성능이 고도화되고 제어기술이 발전함에 따라, DARPA Urban Challenge, 구글의 Self-driving car 등 해외 자동차업계를 중심으로 IT업계를 포함하여 자율주행자동차에 관한 연구가 활발히 진행되고 있다. 특히, SAE Level5에 해당하는 완전자율주행자동차가 출발지부터 목적지까지 안전하게 주행하기 위해서는 주변 환경을 정확히 인식하고, 적절한 조향각 생성을 통해 장애물을 회피해야 한다.
자율주행자동차가 장애물을 회피하기 위해서는 일반적으로 Path Planning과 Path Tracking 두 가지 알고리즘이 각각 개별적으로 동시에 개발되어야 한다. A*, RRT 등의 Path Planning 알고리즘은 주변 환경이 복잡한 도심지와 같은 상황에서 적용하기 어렵고, Stanely method, Pure pursuit 등의 Path Tracking 알고리즘은 다양한 Path를 정확하게 추종하지 못한다. 또한, MPC(Model Predictive Control)와 같은 모델기반 알고리즘은 연산속도가 느리다는 단점이 있다.
따라서 본 논문에서는 강화학습 알고리즘 중 하나인 DQN(Deep Q-Network)기반의 장애물 회피 알고리즘을 제안한다. 학습된 DQN모델은 기존의 Path Planning 및 Path Tracking 기술을 모두 포함하고 있으며, 특히 차량 동역학적 모델과 같은 복잡한 모델식을 사용하지 않기 때문에 연산속도가 빠르고 복잡한 도심지에서 적용이 가능하다. DQN모델을 사용하기 위해서는 다양한 시나리오에 대해 반복 수행을 하며 Q-Network를 학습시켜야 한다. 이를 위해 Prescan 시뮬레이터(Window PC)를 활용하였으며, Q-Network 개발을 위해서는 Python, Tensorflow(Linux PC)를 활용하였다. 또한 ROS(Robot Operating System)를 통해 토픽메시지를 송수신함으로써, window PC와 Linux PC을 연동하여 가상환경을 구축하였다.
본 논문 구현에서는 10000번의 반복 학습이후 장애물을 능숙하게 피하는 결과를 얻을 수 있었다.

Recently, as the performance of various environmental sensors and various electric parts of vehicles have been improved and control technology has been developed, studies on autonomous vehicles including the IT industry actively proceeding in automobile industries such as DARPA Urban Challenge and Google''s self-car. In particular, in order to safely travel from the starting point to the destination, a fully autonomous driving vehicle corresponding to SAE Level 5 must accurately recognize the surrounding environment and avoid an obstacle by generating an appropriate steering angle.
In order for an autonomous vehicle to avoid an obstacle, two algorithms for path planning and path tracking should be separately developed at the same time. Path planning algorithms such as A * and RRT are difficult to apply in a downtown area where the surrounding environment is complicated, and path tracking algorithms such as the Stanley method and Pure pursuit do not follow various paths precisely. In addition, model-based algorithms such as MPC (Model Predictive Control) have a disadvantage of slow computation speed.
Therefore, in this paper, we propose an obstacle avoidance algorithm based on Deep Q-Network (DQN), one of the reinforcement learning algorithms. The learned DQN model includes both existing path planning and path tracking technologies. Especially, it can be applied in urban areas with high computational speed because it does not use complex model equations such as vehicle dynamics model. In order to use the DQN model, it is necessary to repeat the various scenarios and learn the Q-Network. For this purpose, the PreScan simulator (Window PC) was used and Python and Tensorflow (Linux PC) were used for Q-Network development . In addition, a virtual environment was established by linking window PC and Linux PC from sending and receiving topic messages through ROS (Robot Operating System).
In order to verify the designed DQN and compare the results, we used the result of obstacle avoidance using G29 Logitech Wheel and the result of Cubic Spline algorithm. We verified the results of the DQN than the results of Cubic Spline, which is similar to the result of manipulating the steering wheel directly by G29.

국문요약 viii
제 1 장 Introduction 1
1.1 Background 1
1.2 Research Trends 4
1.3 Scope 9
1.4 Functional Architecture 11
제 2 장 Reinforcement Learning 12
2.1 Definition of Reinforcement Learning 12
2.2 Concept of Reinforcement Learning 14
2.2.1 Markov Decision Process&Bellman Equation 14
2.2.2 Dynamic Programing 18
2.2.3 Monte-Carlo 20
2.2.4 SARSA&Q-learning 22
제 3 장 Environment Construction 27
3.1 ROS Message Communication 29
3.2 Roll of Window PC 31
3.2.1 Road Environment Construction 31
3.2.2 Prscan&Simulink Interworking 33
3.3 Roll of Linux PC 39
3.3.1 State&Action Definition 39
3.3.2 Reward Design 41
제 4 장 Deep-Q-Network Algorithm 46
4.1 Deep Neural Network 46
4.2 Common with DeepMind DQN 52
4.2.1 Experience Replay 52
4.2.2 Target Network 55
4.3 Difference with DeepMind DQN 58
4.3.1 Asynchronous Learning 59
4.3.2 Signal to Signal 61
제 5 장 Simulation 63
5.1 Obstacle Avoidance Result based on DQN 65
5.1.1 Repeated Trial Result of 2000 Episodes 66
5.1.2 Repeated Trial Result of 10000 Episodes 68
5.2 Comparison DQN with Reference Algorithm 70
5.2.1 Car’s Trajectory and Steering angle at 15kph 71
5.2.2 Car’s Trajectory and Steering angle at 30kph 73
5.2.3 All Together Comparison at 30kph 77
제 6 장 Conclusion 80
References 82
Abstract 84
감사의 글 86

최근 본 자료

전체보기

구분	그룹	데이터 항목
AI 학습용 데이터	원문	원문 PDF 파일
AI 학습용 데이터	원문 + 메타 (기본/상세)	원문 PDF 파일 및 서지정보 CSV
대량 구매용 데이터	B2B 구독 방식	특정 자료 한정으로 원문 접근 권한 부여
대량 구매용 데이터	URL 전달 방식	바로 PDF 뷰어를 열람할 수 있는 URL 제공

구분	그룹	데이터 항목
AI 학습용 데이터	기본 메타	발행기관명, 간행물명, 권호명, 권(vol), 호(issue), 통권, 발행연도, 발행월, 논문명, 저자명, 시작페이지, 종료페이지, 전체페이지, 상세페이지URL
상세 메타 데이터	발행기관 메타	발행기관 이명, 영문명, 창립연도, 홈페이지URL, 발행기관 소개
	간행물 메타	부제목, 간행물 유형, ISSN, ISBN, 최초발행연도, 폐간연도, 간행빈도, 발행주기, 등재사항, 이용수, 피인용수, 권호수, 논문수, 표지이미지
	논문 메타	작성 언어, 부제목, 대등제목, 목차, 키워드, 초록, 이미지, 참고문헌, 이용수, 피인용수, 논문활용도, DBpia통합주제분류, KDC분류, DDC분류, 한국연구재단분류, UCI, DOI
	저자 메타	소속기관, 소속부서, 직급, 연구분야, 연구키워드, 이용수, 피인용수, 저자 논문활용도

구분	그룹	데이터 항목
※ 결합형/맞춤형 메타 데이터는 신청 내용에 따라 다양하게 제공 가능
이용순위 정보	주제분야별 많이 이용된 논문	“인문학”에서 많이 이용된 논문 TOP100
	이용기관별 많이 이용된 논문	“중고등학교”에서 많이 이용된 논문 TOP100
	세부기관별 많이 이용된 논문	“서울대학교”에서 많이 이용된 논문 TOP100
	키워드별 많이 이용된 논문	“Chat GPT”에서 많이 이용된 논문 TOP100
키워드 정보	많이 이용된 키워드	특정기간/분야/저널 내 많이 이용된 키워드
	많이 발행된 키워드	특정기간/분야/저널 내 많이 발행된 키워드
	많이 검색된 키워드	특정기간/분야/저널 내 많이 검색된 키워드
	연구 트렌드 키워드	특정 키워드 연관 연구동향 분석 데이터 키워드

논문 기본 정보

이 논문의 연구 히스토리 (2)

초록· 키워드

목차

최근 본 자료

댓글(0)