Exploring Deep Recurrent Q-Learning for Navigation in a 3D Environment

Rasmus Brejl; Hendrik Purwins; Henrik  Schoenau-Fog

doi:10.4108/eai.16-1-2018.153641

Exploring Deep Recurrent Q-Learning for Navigation in a 3D Environment

Rasmus Brejl, Hendrik Purwins, Henrik Schoenau-Fog

Publikation: Bidrag til tidsskrift › Tidsskriftartikel › Forskning › peer review

202 Downloads (Pure)

Abstract

Learning to navigate in 3D environments from raw sensory input is an important step towards bridging the gap between human players and artificial intelligence in digital games. Recent advances in deep reinforcement learning have seen success in teaching agents to play Atari 2600 games from raw pixel information where the environment is always fully observable by the agent. This is not true for first-person 3D navigation tasks. Instead, the agent is limited by its field of view which limits its ability to make optimal decisions in the environment. This paper explores using a Deep Recurrent Q-Network implementation with a long short-term memory layer for dealing with such tasks by allowing an agent to process recent frames and gain a memory of the environment. An agent was trained in a 3D first-person labyrinth-like environment for 2 million frames. Informal observations indicate that the trained agent navigated in the right direction but was unable to find the target of the environment.

Originalsprog	Engelsk
Artikelnummer	e3
Bogserie	EAI Endrosed Trasactions on Creative Technologies
Vol/bind	18
Udgave nummer	14
Antal sider	5
ISSN	2409-9708
DOI	https://doi.org/10.4108/eai.16-1-2018.153641
Status	Udgivet - 2018

Adgang til dokumentet

10.4108/eai.16-1-2018.153641Licens: CC BY 3.0

Open Access articleForlagets udgivne version, 1,2 MBLicens: CC BY 3.0

AUB Link

Søg efter materialet i Aalborg Universitetsbiblioteks søgemaskine

ViZARTS: Visualization and Adaptive Real-Time Storytelling - for Film, Animation and Games
Fog, H. S., Larsen, B. A., Reng, L., Bruni, L. E., Selvig, D. R., Mødekjær, C., Hussain, A., Pasalic, A., Thomsen, M. R., Ditlevsen, D. H., Gymoese, T. & Risvang, A. K.
01/10/2019 → …
Projekter: Projekt › Forskning
SMILE: SMILE Lab - Samsung Media Innovation Lab for Education
Reng, L. & Fog, H. S.
01/01/2014 → …
Projekter: Projekt › Andet

Citationsformater

@article{b4f013caa94446acb0ab881eeb91038d,

title = "Exploring Deep Recurrent Q-Learning for Navigation in a 3D Environment",

abstract = "Learning to navigate in 3D environments from raw sensory input is an important step towards bridging the gap between human players and artificial intelligence in digital games. Recent advances in deep reinforcement learning have seen success in teaching agents to play Atari 2600 games from raw pixel information where the environment is always fully observable by the agent. This is not true for first-person 3D navigation tasks. Instead, the agent is limited by its field of view which limits its ability to make optimal decisions in the environment. This paper explores using a Deep Recurrent Q-Network implementation with a long short-term memory layer for dealing with such tasks by allowing an agent to process recent frames and gain a memory of the environment. An agent was trained in a 3D first-person labyrinth-like environment for 2 million frames. Informal observations indicate that the trained agent navigated in the right direction but was unable to find the target of the environment.",

keywords = "Reinforcement Learning, Deep Learning, Q-Learning, Deep Recurrent Q-Learning, Artificial Intelligence, Navigation, Game Intelligence ",

author = "Rasmus Brejl and Hendrik Purwins and Henrik Schoenau-Fog",

year = "2018",

doi = "10.4108/eai.16-1-2018.153641",

language = "English",

volume = "18",

journal = "EAI Endrosed Trasactions on Creative Technologies",

issn = "2409-9708",

publisher = "EAI - European Alliance for Innovation",

number = "14",

}

TY - JOUR

T1 - Exploring Deep Recurrent Q-Learning for Navigation in a 3D Environment

AU - Brejl, Rasmus

AU - Purwins, Hendrik

AU - Schoenau-Fog, Henrik

PY - 2018

Y1 - 2018

N2 - Learning to navigate in 3D environments from raw sensory input is an important step towards bridging the gap between human players and artificial intelligence in digital games. Recent advances in deep reinforcement learning have seen success in teaching agents to play Atari 2600 games from raw pixel information where the environment is always fully observable by the agent. This is not true for first-person 3D navigation tasks. Instead, the agent is limited by its field of view which limits its ability to make optimal decisions in the environment. This paper explores using a Deep Recurrent Q-Network implementation with a long short-term memory layer for dealing with such tasks by allowing an agent to process recent frames and gain a memory of the environment. An agent was trained in a 3D first-person labyrinth-like environment for 2 million frames. Informal observations indicate that the trained agent navigated in the right direction but was unable to find the target of the environment.

AB - Learning to navigate in 3D environments from raw sensory input is an important step towards bridging the gap between human players and artificial intelligence in digital games. Recent advances in deep reinforcement learning have seen success in teaching agents to play Atari 2600 games from raw pixel information where the environment is always fully observable by the agent. This is not true for first-person 3D navigation tasks. Instead, the agent is limited by its field of view which limits its ability to make optimal decisions in the environment. This paper explores using a Deep Recurrent Q-Network implementation with a long short-term memory layer for dealing with such tasks by allowing an agent to process recent frames and gain a memory of the environment. An agent was trained in a 3D first-person labyrinth-like environment for 2 million frames. Informal observations indicate that the trained agent navigated in the right direction but was unable to find the target of the environment.

KW - Reinforcement Learning

KW - Deep Learning

KW - Q-Learning

KW - Deep Recurrent Q-Learning

KW - Artificial Intelligence

KW - Navigation

KW - Game Intelligence

U2 - 10.4108/eai.16-1-2018.153641

DO - 10.4108/eai.16-1-2018.153641

M3 - Journal article

SN - 2409-9708

VL - 18

JO - EAI Endrosed Trasactions on Creative Technologies

JF - EAI Endrosed Trasactions on Creative Technologies

IS - 14

M1 - e3

ER -

Exploring Deep Recurrent Q-Learning for Navigation in a 3D Environment

Abstract

Adgang til dokumentet

AUB Link

Fingeraftryk

Projekter

ViZARTS: Visualization and Adaptive Real-Time Storytelling - for Film, Animation and Games

SMILE: SMILE Lab - Samsung Media Innovation Lab for Education

Citationsformater