Reinforcement learning for the supply chain dynamics problem with production capacity constraints and non-stationary demand | IEEE Conference Publication | IEEE Xplore