CIRP Annals - Manufacturing Technology: Christos Gkournelos, Christos Konstantinou, Sotiris Makris
CIRP Annals - Manufacturing Technology: Christos Gkournelos, Christos Konstantinou, Sotiris Makris
A R T I C L E I N F O A B S T R A C T
Article history: The complexity in the collaboration between humans and robots in smart manufacturing remains a signifi-
Available online 22 May 2024 cant challenge. This paper introduces an LLM-based manufacturing execution system enhancing Human-
Robot Collaboration (HRC) in smart manufacturing. By leveraging Large Language Models (LLMs), the system
Keywords: provides a natural language interface for operators, integrates with Digital Twins for real-time data, and
Large language models
employs behavior-based control for robots. This integration facilitates intuitive interactions and rapid system
Human robot collaboration
Manufacturing systems
programming, addressing communication complexities in HRC. The effectiveness of this approach is validated
through two HRC assembly case studies, demonstrating significant improvements in collaboration and effi-
ciency.
© 2024 The Author(s). Published by Elsevier Ltd on behalf of CIRP. This is an open access article under the CC
BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)
https://doi.org/10.1016/j.cirp.2024.04.002
0007-8506/© 2024 The Author(s). Published by Elsevier Ltd on behalf of CIRP. This is an open access article under the CC BY-NC-ND license
(http://creativecommons.org/licenses/by-nc-nd/4.0/)
10 C. Gkournelos et al. / CIRP Annals - Manufacturing Technology 73 (2024) 9 12
The paper is organized as follows. The Section 2 presents the sys- instrumental in minimizing hallucination issues. By defining a
tematic methodology and the implementation details of the system. sequence of reasoning steps, this method narrows the response
In Section 3, the application of the proposed system in two industrial space, thereby focusing on targeted, relevant outputs.
case studies is presented. Section 4 presents the evaluation of the sys- The Interaction Agent is primarily engaged in handling the conver-
tem while in Section 5 the conclusions and future developments are sation with the users. It employs a ’plan-and-execute’ strategy and is
reported. equipped with a conversational memory buffer, enhancing its ability
to manage and sustain user interactions. To meet the natural interac-
2. LLM-based execution system for HRC tion requirement, a series of tests were conducted to select the most
suitable LLM for this agent. GPT-3.5 emerged as the preferred choice,
The proposed LLM-based manufacturing execution system is attributed to its high accuracy in generating responses and optimal
designed to integrate and enhance HRC in smart manufacturing envi- response time. This selected model has been fine-tuned using CoT,
ronments. The system is characterized by several key features: enabling it to effectively discern whether an input should be redir-
ected to the Manufacturing Agent or addressed as a general query.
1. System knowledge: Access to manufacturing proprietary knowl- The Manufacturing Agent addresses questions related to the HRC
edge bases and data sources. assembly deployed in the system. A custom LLM, HRC-GPT, based on
2. Online information: Integration with the Digital Twin (DT) of the the gpt-4 0613 model [13], was specifically trained for this purpose.
system allowing the access of dynamically updated data. This fea- This agent is responsible for generating comprehensive production
ture is mandatory when there is a need to obtain and use the run- schedules or modifying existing tasks in the running assembly sched-
time state of the system. ule. HRC-GPT possesses intrinsic knowledge of the production sys-
3. Scalability: Modular architecture for allowing easy reconfigura- tem, a capability enhanced through the integration of manufacturing
tion based on application requirements. The training of the AI data via vector augmentation and prompting techniques, as outlined
components is performed independently to easily add new capa- in Section 2.2. Endowed with "manufacturing reasoning," HRC-GPT is
bilities without compromising the performance of the existing adept at creating accurate assembly schedules and interfacing with
ones. the respective modules that manage resource control. This function-
4. Explainability: A transparent process that can be monitored by ality underscores its pivotal role in the HRC process, enabling precise
humans allowing a level of control in the observations and deci- and efficient coordination of manufacturing activities.
sions.
2.2. Modules
The framework of the proposed system is illustrated in Fig. 1. The
proposed LLM-based approach for HRC is based on an extendable set The methodology proposed in this paper includes two main mod-
of modules, each serving a distinct function within the HRC environ- ules that serve as bridges to the physical world: the Human Interac-
ment. There are two primary categories of modules: a) Agents and b) tion Module and the Robot Behavior Planning module.
Modules. The Agents are neural modules including LLMs, fine-tuned The Human Interaction module is designed to facilitate natural
LMs, or any other AI enabled component with the objective to classify communication between human operators and the system. It is
an action based on inputs. Modules are symbolic tools that imple- equipped with Natural Language Processing (NLP) capabilities,
ment action functions called by Agents. These modules are used for including both speech-to-text and text-to-speech functionalities.
connecting external APIs, such as databases, robot controllers, etc. This module plays a dual role in the LLM-based system: firstly, as
an input interface, enabling human operators to interact seamlessly
with the system through natural language; and secondly, as an out-
put interface, effectively communicating system-generated informa-
tion back to the human users. Its design focuses on ensuring that
interactions are intuitive and user-friendly, thereby enhancing the
overall efficiency and effectiveness of the HRC process.
In managing robotic resources, the system adheres to the Behav-
ior Trees (BTs) methodology [13]. This approach is particularly advan-
tageous across various applications due to its structured manner of
representing and controlling autonomous robots. The Robot Behavior
Planning module plays a critical role in this process. It is tasked with
receiving the BTs generated by the Manufacturing Agent and subse-
quently relaying them for execution. This methodology empowers
the LLM system to produce precise outputs that are not only theoreti-
cally sound but also practically executable in the physical world.
Through this integration, the LLM system transcends traditional
boundaries, effectively bridging the gap between high-level planning
and real-world implementation.
Fig. 1. The proposed LLM-based approach for HRC execution system. 2.3. Manufacturing data and training
The proposed system consists of one formating Agent with NLP The training of the LLM for the specific HRC knowledge integra-
capabilities, two language model (LM) Agents (Interaction Agent tion into our assembly system employs a comprehensive approach,
and Manufacturing Agent), and two action Modules (Robot Behavior utilizing vectorized representations of diverse data sources. As
Planning, Human Interaction). depicted in Fig. 2, the input manufacturing data is organized in sev-
eral structured files. This data encompasses several critical compo-
2.1. Agents nents: process planning details, comprehensive resource
information, geometric layout data, and a collection of primitive
Within the proposed LLM-based execution system for HRC, agents behaviors designed for robotic execution.
are integral neural modules tasked with system observation and Resource data provides an extensive inventory of tools and equip-
interaction. The system incorporates two specialized agents: the ment, outlining their constraints and operational limits. These data
"Interaction Agent" and the "Manufacturing Agent." Both agents are are considered crucial for facilitating improved resource allocation
fine-tuned utilizing the chain-of-thought (CoT) [12] method, which is aiding the planning of task scheduling. Geometric layout data,
C. Gkournelos et al. / CIRP Annals - Manufacturing Technology 73 (2024) 9 12 11
Table 1
Number of tasks in the industrial use cases.
E-motor 16 15 14 1
Air compressor 14 14 14 6
extracted from CAD and Digital Twin sources, precisely map the posi-
tions and orientations of all components within the manufacturing
cell, ensuring spatial accuracy in planning. The process plan data
delineates the sequence of assembly and disassembly tasks, while
the primitive behaviors represent a suite of fundamental actions exe-
cutable by robots. Fig. 3. HRC schedule generation from a simple language instruction.
To effectively incorporate this data into the LLM, it undergoes
transformation into a vectorized format using an embedding model. 3.2. Dynamic reconfiguration
Each data segment is processed to generate a semantic vector. This
vectorization is performed using the text-embedding-ada-002 model The second use case for the LLM-based execution system involves
[13], which produces a 1536-dimensional output matrix for each its role in facilitating dynamic interaction with the human operator
data segment. This matrix is used as a foundation for the creation of during execution phases. Central to this interaction is an innovative
the HRC-GPT. The custom trained model employs a cosine similarity interface, named CollabAI, which is deployed on Augmented Reality
metric [14] to retrieve relevant data segments, which are conceptual- (AR) glasses and serves as a critical communication link between the
ized as neighboring vectors in the vector space. The system then gen- operator and the system (Fig. 4). This AR interface is designed to sup-
erates a prompt that combines this retrieved data with the internal port both voice and text modalities. The LLM system provides real-
memory summary. This method in collation with the CoT prompt time guidance and feedback. This level of interaction enhances the
engineering enables the generation of contextually relevant and pre- operator’s situational awareness and decision-making capabilities
cise responses for the HRC system. and ensures that the manufacturing process remains flexible and
responsive to real-time changes.
3. Case study
Intermediate dataset is enriched with additional information on capabilities. The proposed solution showcased an end to end (from
each task’s process time by the resources. It allows for the analysis of programming to execution) application of an LLM system in an actual
two KPIs: Resource Utilization and Assembly Cycle Time. HRC assembly station. The deployment and evaluation of this
Complex dataset: Incorporating all the information related to the approach using real industrial data have highlighted the significant
assembly process, this is the complete dataset of each case. It enables potential of this system in practical scenarios.
the generation of schedules considering up to five criteria: Resource Future work will go deeper into refining training methodologies
Utilization, Cycle Time, Non-adding Value Activities Time, Ergonom- and prompt engineering strategies to improve the system’s perfor-
ics, and Safety. mance in complex reasoning involving multiple parameters.
The experimental methodology was structured around these three Additionally, another goal is a dataset creation using diverse
dataset categories. Each use case was associated with a group of data- manufacturing tasks, serving as a benchmark to assess and compare
sets, and an additional combined case was created by merging data methodologies, further advancing in the field.
from both the electric motor and the air compressor assembly cases.
A diverse set of input queries were crafted for the experiments. Declaration of competing interest
Initially, one query was designed for each dataset case and type. Sub-
sequently, the general GPT-4 model was utilized to generate multiple The authors declare that they have no known competing financial
variations of the initial query, validating performance across different interests or personal relationships that could have appeared to influ-
phrasings. For the combined case, the queries from both the electric ence the work reported in this paper.
motor and air compressor assembly were employed. Starting from 30
distinct input queries (5 for simple, 10 for intermediate, and 15 for
CRediT authorship contribution statement
complex), our augmentation technique expanded this to a set of
1000 input queries, with 500 for each case (50 for simple, 100 for
Christos Gkournelos: Conceptualization, Formal analysis, Soft-
intermediate, and 350 for complex).
ware, Writing original draft, Writing review & editing. Christos
The output evaluation involved comparing the HRC-GPT’s
Konstantinou: Software, Validation, Writing original draft, Writing
responses to the outputs of the AI-task planning tool [15], which
review & editing. Sotiris Makris: Conceptualization, Funding
served as a ground truth. For each input query, the task planning tool
acquisition, Project administration, Writing review & editing.
was configured accordingly. For the evaluation three metrics were
employed:
Acknowledgments
Format Accuracy (FA): This metric evaluates the HRC-GPT’s
responsiveness to deliver a structurally correct output. This research has been supported by the “CONVERGING Social
Executability (EX): This assesses the system’s ability to deliver industrial collaborative environments integrating AI, Big Data and
functionally correct schedule, ready for execution. "Correct" here Robotics for smart manufacturing (Grant Agreement: 101058521)”.
refers to the accurate assignment of resources to tasks they can
perform. Supplementary materials
Goal Accuracy (GA): This measures the correlation between the
expected output schedule and the generated one, particularly in Supplementary material associated with this article can be found
terms of KPI compliance. in the online version at doi:10.1016/j.cirp.2024.04.002.