| International Journal of Computer Applications |
| Foundation of Computer Science (FCS), NY, USA |
| Volume 187 - Number 110 |
| Year of Publication: 2026 |
| Authors: Duc Hoang Nguyen |
10.5120/ijcafd4b2cbcdd0d
|
Duc Hoang Nguyen . Adaptive SKU Allocation in Mattress Manufacturing via Multi-Agent Reinforcement Learning under Dynamic Loading Conditions. International Journal of Computer Applications. 187, 110 ( May 2026), 26-33. DOI=10.5120/ijcafd4b2cbcdd0d
Dynamic SKU allocation at loading stations plays a critical role in throughput, line balance, and Overall Equipment Effectiveness (OEE) in mattress assembly lines. Conventional static dispatching rules and equilibrium-based approaches perform adequately under stable conditions but deteriorate when key parameters, such as SKU loading time, vary. This study proposes a decentralized multi-agent reinforcement learning (MARL) approach for adaptive SKU allocation, formulated as a cooperative Dec-POMDP. Each station acts as an agent, making bidding decisions based on local observations, including buffer levels, in-transit items, and station OEE. A softmax-based allocation mechanism and a combined local–global reward are used to encourage both high throughput and balanced operations. A co-simulation framework integrates a Tecnomatix Plant Simulation model with a Python-based MARL system trained using MAPPO. Under nominal conditions, the MARL policy achieves performance comparable to a baseline equilibrium rule in terms of mean OEE and throughput, while maintaining low variability across stations. However, under moderate (30s→45s) and severe (30s→60s) loading-time disruptions, static methods show clear degradation, including reduced OEE, higher variance, and pronounced line imbalance. In contrast, the MARL approach maintains higher OEE and throughput, while improving system balance. These results highlight the effectiveness of decentralized MARL in improving robustness of cyber-physical production systems under dynamic disturbances.