Learning to Grasp on the Moon from 3D Octree Observations with Deep Reinforcement Learning

Andrej Orsula; Simon Bøgh; Miguel Olivares-Mendez; Carol Martinez

doi:10.48550/arXiv.2208.00818

Learning to Grasp on the Moon from 3D Octree Observations with Deep Reinforcement Learning

Andrej Orsula^*, Simon Bøgh, Miguel Olivares-Mendez, Carol Martinez

^*Kontaktforfatter

Publikation: Bidrag til bog/antologi/rapport/konference proceeding › Konferenceartikel i proceeding › Forskning › peer review

Abstract

Extraterrestrial rovers with a general-purpose robotic arm have many potential applications in lunar and planetary exploration. Introducing autonomy into such systems is desirable for increasing the time that rovers can spend gathering scientific data and collecting samples. This work investigates the applicability of deep reinforcement learning for vision-based robotic grasping of objects on the Moon. A novel simulation environment with procedurally-generated datasets is created to train agents under challenging conditions in unstructured scenes with uneven terrain and harsh illumination. A model-free off-policy actor-critic algorithm is then employed for end-to-end learning of a policy that directly maps compact octree observations to continuous actions in Cartesian space. Experimental evaluation indicates that 3D data representations enable more effective learning of manipulation skills when compared to traditionally used image-based observations. Domain randomization improves the generalization of learned policies to novel scenes with previously unseen objects and different illumination conditions. To this end, we demonstrate zero-shot sim-to-real transfer by evaluating trained agents on a real robot in a Moon-analogue facility.

Originalsprog	Engelsk
Titel	2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)
Forlag	IEEE
Publikationsdato	20 okt. 2022
DOI	https://doi.org/10.48550/arXiv.2208.00818
Status	Udgivet - 20 okt. 2022
Begivenhed	2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) - Kyoto, Japan Varighed: 23 okt. 2022 → 27 okt. 2022 https://iros2022.org

Konference

Konference	2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)
Land/Område	Japan
By	Kyoto
Periode	23/10/2022 → 27/10/2022
Internetadresse	https://iros2022.org

Emneord

Reinforcement Learning
Robotics
Grasping
Space Robotics
Deep Learning
Maskinlæring

Adgang til dokumentet

10.48550/arXiv.2208.00818Licens: Andet

https://arxiv.org/pdf/2208.00818.pdfLicens: Andet

AUB Link

Søg efter materialet i Aalborg Universitetsbiblioteks søgemaskine

Andre filer og links

Source code (GitHub)

Citationsformater

@inproceedings{02da444cb2e149288567bf5acb6abcae,

title = "Learning to Grasp on the Moon from 3D Octree Observations with Deep Reinforcement Learning",

abstract = "Extraterrestrial rovers with a general-purpose robotic arm have many potential applications in lunar and planetary exploration. Introducing autonomy into such systems is desirable for increasing the time that rovers can spend gathering scientific data and collecting samples. This work investigates the applicability of deep reinforcement learning for vision-based robotic grasping of objects on the Moon. A novel simulation environment with procedurally-generated datasets is created to train agents under challenging conditions in unstructured scenes with uneven terrain and harsh illumination. A model-free off-policy actor-critic algorithm is then employed for end-to-end learning of a policy that directly maps compact octree observations to continuous actions in Cartesian space. Experimental evaluation indicates that 3D data representations enable more effective learning of manipulation skills when compared to traditionally used image-based observations. Domain randomization improves the generalization of learned policies to novel scenes with previously unseen objects and different illumination conditions. To this end, we demonstrate zero-shot sim-to-real transfer by evaluating trained agents on a real robot in a Moon-analogue facility.",

keywords = "Reinforcement Learning, Robotics, Grasping, Space Robotics, Deep Learning, Machine Learning, Reinforcement Learning, Robotics, Grasping, Space Robotics, Deep Learning, Maskinl{\ae}ring",

author = "Andrej Orsula and Simon B{\o}gh and Miguel Olivares-Mendez and Carol Martinez",

year = "2022",

month = oct,

day = "20",

doi = "10.48550/arXiv.2208.00818",

language = "English",

booktitle = "2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)",

publisher = "IEEE",

address = "United States",

note = "2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), IROS 2022 ; Conference date: 23-10-2022 Through 27-10-2022",

url = "https://iros2022.org",

}

Orsula, A, Bøgh, S, Olivares-Mendez, M & Martinez, C 2022, Learning to Grasp on the Moon from 3D Octree Observations with Deep Reinforcement Learning. i 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE, 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Kyoto, Japan, 23/10/2022. https://doi.org/10.48550/arXiv.2208.00818

Learning to Grasp on the Moon from 3D Octree Observations with Deep Reinforcement Learning. / Orsula, Andrej; Bøgh, Simon; Olivares-Mendez, Miguel et al.
2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE, 2022.

Publikation: Bidrag til bog/antologi/rapport/konference proceeding › Konferenceartikel i proceeding › Forskning › peer review

TY - GEN

T1 - Learning to Grasp on the Moon from 3D Octree Observations with Deep Reinforcement Learning

AU - Orsula, Andrej

AU - Bøgh, Simon

AU - Olivares-Mendez, Miguel

AU - Martinez, Carol

PY - 2022/10/20

Y1 - 2022/10/20

N2 - Extraterrestrial rovers with a general-purpose robotic arm have many potential applications in lunar and planetary exploration. Introducing autonomy into such systems is desirable for increasing the time that rovers can spend gathering scientific data and collecting samples. This work investigates the applicability of deep reinforcement learning for vision-based robotic grasping of objects on the Moon. A novel simulation environment with procedurally-generated datasets is created to train agents under challenging conditions in unstructured scenes with uneven terrain and harsh illumination. A model-free off-policy actor-critic algorithm is then employed for end-to-end learning of a policy that directly maps compact octree observations to continuous actions in Cartesian space. Experimental evaluation indicates that 3D data representations enable more effective learning of manipulation skills when compared to traditionally used image-based observations. Domain randomization improves the generalization of learned policies to novel scenes with previously unseen objects and different illumination conditions. To this end, we demonstrate zero-shot sim-to-real transfer by evaluating trained agents on a real robot in a Moon-analogue facility.

AB - Extraterrestrial rovers with a general-purpose robotic arm have many potential applications in lunar and planetary exploration. Introducing autonomy into such systems is desirable for increasing the time that rovers can spend gathering scientific data and collecting samples. This work investigates the applicability of deep reinforcement learning for vision-based robotic grasping of objects on the Moon. A novel simulation environment with procedurally-generated datasets is created to train agents under challenging conditions in unstructured scenes with uneven terrain and harsh illumination. A model-free off-policy actor-critic algorithm is then employed for end-to-end learning of a policy that directly maps compact octree observations to continuous actions in Cartesian space. Experimental evaluation indicates that 3D data representations enable more effective learning of manipulation skills when compared to traditionally used image-based observations. Domain randomization improves the generalization of learned policies to novel scenes with previously unseen objects and different illumination conditions. To this end, we demonstrate zero-shot sim-to-real transfer by evaluating trained agents on a real robot in a Moon-analogue facility.

KW - Reinforcement Learning

KW - Robotics

KW - Grasping

KW - Space Robotics

KW - Deep Learning

KW - Machine Learning

KW - Reinforcement Learning

KW - Robotics

KW - Grasping

KW - Space Robotics

KW - Deep Learning

KW - Maskinlæring

UR - https://github.com/AndrejOrsula/drl_grasping

U2 - 10.48550/arXiv.2208.00818

DO - 10.48550/arXiv.2208.00818

M3 - Article in proceeding

BT - 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

PB - IEEE

T2 - 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

Y2 - 23 October 2022 through 27 October 2022

ER -

Learning to Grasp on the Moon from 3D Octree Observations with Deep Reinforcement Learning

Abstract

Konference

Emneord

Adgang til dokumentet

AUB Link

Andre filer og links

Fingeraftryk

Citationsformater