Please use this identifier to cite or link to this item: http://cmuir.cmu.ac.th/jspui/handle/6653943832/72757
Title: Integration of the A2C Algorithm for Production Scheduling in a Two-Stage Hybrid Flow Shop Environment
Authors: Falk T. Gerpott
Sebastian Lang
Tobias Reggelin
Hartmut Zadek
Poti Chaopaisarn
Sakgasem Ramingwong
Authors: Falk T. Gerpott
Sebastian Lang
Tobias Reggelin
Hartmut Zadek
Poti Chaopaisarn
Sakgasem Ramingwong
Keywords: Computer Science
Issue Date: 1-Jan-2022
Abstract: The paper introduces an approach to apply reinforcement learning (RL) for production scheduling in a two-stage hybrid flow shop (THFS) production system. The Advantage-Actor Critic (A2C) method is used to train multiple agents to minimize the total tardiness and makespan of a production program. The two-stage hybrid flow shop scheduling problem is a NP-hard combinatorial optimization problem that describes a production system with two stages, each consisting of a set of parallel machines. Our concept combines a Discrete-Event Simulation with a pre-implemented RL algorithm using Stable Baselines3. Since similar research often lacks concrete implementation information, the configuration of the OpenAI Gym interface and the agent-environment interaction is presented.
URI: https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85127829514&origin=inward
http://cmuir.cmu.ac.th/jspui/handle/6653943832/72757
ISSN: 18770509
Appears in Collections:CMUL: Journal Articles

Files in This Item:
There are no files associated with this item.


Items in CMUIR are protected by copyright, with all rights reserved, unless otherwise indicated.