Deep Reinforcement Learning for Robot Batching Optimization and Flow Control

Max Hildebrand; Rasmus Skovgaard Andersen; Simon Bøgh

doi:10.1016/j.promfg.2020.10.203

Deep Reinforcement Learning for Robot Batching Optimization and Flow Control

Max Hildebrand, Rasmus Skovgaard Andersen, Simon Bøgh

Research output: Contribution to journal › Conference article in Journal › Research › peer-review

9 Citations (Scopus)

59 Downloads (Pure)

Abstract

Robot batching is an optimization problem found in many industrial applications. Current state-of-the-art approaches utilize a combination of heuristic based parameters and statistical analysis. This approach necessitates many tunable parameters, which again provides challenges when delivering systems to new customers. We challenge current state-of-the-art in statistical approaches by presenting a novel application of a policy gradient method for a Deep Reinforcement Learning (DRL/RL) agent. We have developed a Unity simulation framework of an existing robot- batching cell, on which a RL agent is able to successfully train and obtain a policy for performing robot batching, using a tabula rasa approach. The trained agent is capable of packaging 47.86% of 1218 total batches within the prescribed tolerances, with a positive give-away of 8.76%. The application of DRL in performing robot batching is to the authors knowledge the first of its kind.

Original language	English
Journal	Procedia Manufacturing
Volume	51
Pages (from-to)	1462-1468
Number of pages	7
ISSN	2351-9789
DOIs	https://doi.org/10.1016/j.promfg.2020.10.203
Publication status	Published - Nov 2020
Event	30th International Conference on Flexible Automation and Intelligent Manufacturing - Athens, Greece Duration: 15 Jun 2021 → 18 Jun 2021 https://www.faimconference.org/

Conference

Conference	30th International Conference on Flexible Automation and Intelligent Manufacturing
Country/Territory	Greece
City	Athens
Period	15/06/2021 → 18/06/2021
Internet address	https://www.faimconference.org/

Keywords

Reinforcement Learning
Deep Reinforcement Learning
Artificial Intelligence
Robotics
Smart Manufacturing
Proximal Policy Optimization
Deep Learning

Access to Document

10.1016/j.promfg.2020.10.203Licence: CC BY-NC-ND 4.0

Open Access articleFinal published version, 723 KBLicence: CC BY-NC-ND 4.0

AUB Link

Search for the material in Aalborg University Library's search engine

Cite this

@inproceedings{a8b2b2777ac248148f0ec63b309e9bf3,

title = "Deep Reinforcement Learning for Robot Batching Optimization and Flow Control",

abstract = "Robot batching is an optimization problem found in many industrial applications. Current state-of-the-art approaches utilize a combination of heuristic based parameters and statistical analysis. This approach necessitates many tunable parameters, which again provides challenges when delivering systems to new customers. We challenge current state-of-the-art in statistical approaches by presenting a novel application of a policy gradient method for a Deep Reinforcement Learning (DRL/RL) agent. We have developed a Unity simulation framework of an existing robot- batching cell, on which a RL agent is able to successfully train and obtain a policy for performing robot batching, using a tabula rasa approach. The trained agent is capable of packaging 47.86% of 1218 total batches within the prescribed tolerances, with a positive give-away of 8.76%. The application of DRL in performing robot batching is to the authors knowledge the first of its kind.",

keywords = "Reinforcement Learning, Deep Reinforcement Learning, Artificial Intelligence, Robotics, Smart Manufacturing, Proximal Policy Optimization, Deep Learning",

author = "Max Hildebrand and Andersen, {Rasmus Skovgaard} and Simon B{\o}gh",

year = "2020",

month = nov,

doi = "10.1016/j.promfg.2020.10.203",

language = "English",

volume = "51",

pages = "1462--1468",

journal = "Procedia Manufacturing",

issn = "2351-9789",

publisher = "Elsevier",

note = "30th International Conference on Flexible Automation and Intelligent Manufacturing, FAIM 2021 ; Conference date: 15-06-2021 Through 18-06-2021",

url = "https://www.faimconference.org/",

}

TY - GEN

T1 - Deep Reinforcement Learning for Robot Batching Optimization and Flow Control

AU - Hildebrand, Max

AU - Andersen, Rasmus Skovgaard

AU - Bøgh, Simon

PY - 2020/11

Y1 - 2020/11

N2 - Robot batching is an optimization problem found in many industrial applications. Current state-of-the-art approaches utilize a combination of heuristic based parameters and statistical analysis. This approach necessitates many tunable parameters, which again provides challenges when delivering systems to new customers. We challenge current state-of-the-art in statistical approaches by presenting a novel application of a policy gradient method for a Deep Reinforcement Learning (DRL/RL) agent. We have developed a Unity simulation framework of an existing robot- batching cell, on which a RL agent is able to successfully train and obtain a policy for performing robot batching, using a tabula rasa approach. The trained agent is capable of packaging 47.86% of 1218 total batches within the prescribed tolerances, with a positive give-away of 8.76%. The application of DRL in performing robot batching is to the authors knowledge the first of its kind.

AB - Robot batching is an optimization problem found in many industrial applications. Current state-of-the-art approaches utilize a combination of heuristic based parameters and statistical analysis. This approach necessitates many tunable parameters, which again provides challenges when delivering systems to new customers. We challenge current state-of-the-art in statistical approaches by presenting a novel application of a policy gradient method for a Deep Reinforcement Learning (DRL/RL) agent. We have developed a Unity simulation framework of an existing robot- batching cell, on which a RL agent is able to successfully train and obtain a policy for performing robot batching, using a tabula rasa approach. The trained agent is capable of packaging 47.86% of 1218 total batches within the prescribed tolerances, with a positive give-away of 8.76%. The application of DRL in performing robot batching is to the authors knowledge the first of its kind.

KW - Reinforcement Learning

KW - Deep Reinforcement Learning

KW - Artificial Intelligence

KW - Robotics

KW - Smart Manufacturing

KW - Proximal Policy Optimization

KW - Deep Learning

U2 - 10.1016/j.promfg.2020.10.203

DO - 10.1016/j.promfg.2020.10.203

M3 - Conference article in Journal

SN - 2351-9789

VL - 51

SP - 1462

EP - 1468

JO - Procedia Manufacturing

JF - Procedia Manufacturing

T2 - 30th International Conference on Flexible Automation and Intelligent Manufacturing

Y2 - 15 June 2021 through 18 June 2021

ER -

Deep Reinforcement Learning for Robot Batching Optimization and Flow Control

Abstract

Conference

Keywords

Access to Document

AUB Link

Fingerprint

Cite this