A Speech-enabled Virtual Assistant for Efficient Human-Robot Interaction in Industrial Environments

Chen LI*, Dimitrios Chrysostomou, Hongji Yang

*Corresponding author for this work

Research output: Contribution to journalJournal articleResearchpeer-review

1 Citation (Scopus)
27 Downloads (Pure)

Abstract

This paper presents a natural language-enabled virtual assistant (VA), named Max, developed to support flexible and scalable human–robot interactions (HRI) with industrial robots. Regardless of the numerous natural language interfaces already proposed for intuitive HRI on the industrial shop floor, most of those interfaces remain tightly bound with a specific robotic system. Besides, the lack of a natural and efficient human–robot communication protocol hinders the user experience. Therefore three key elements characterize the proposed framework. First, a Client–Server style architecture is introduced so Max can provide a centralized solution for managing and controlling various types of robots deployed on the shop floor. Second, inspired by human–human communication, two conversation strategies, lexical-semantic and general diversion strategies, are used to guide Max's response generation. These conversation strategies were embedded to improve the operator's engagement with the manufacturing tasks. Third, we fine-tuned the state-of-the-art (SOTA) pre-trained model, Bidirectional Encoder Representations from Transformers (BERT), to support a highly accurate prediction of requested intents from the operator and robot services. Multiple experiments were conducted using the latest iteration of our autonomous industrial mobile manipulator, “Little Helper (LH)”, to validate Max's performance in a real manufacturing environment.
Original languageEnglish
Article number111818
JournalJournal of Systems and Software
Volume205
ISSN0164-1212
DOIs
Publication statusPublished - Nov 2023

Bibliographical note

Publisher Copyright:
© 2023 The Author(s)

Keywords

  • Client–server systems
  • Human–robot interaction
  • Interactive systems
  • Natural language processing

Fingerprint

Dive into the research topics of 'A Speech-enabled Virtual Assistant for Efficient Human-Robot Interaction in Industrial Environments'. Together they form a unique fingerprint.

Cite this