Surriani, Atikah and Maghfiroh, Hari and Wahyunggoro, Oyas and Cahyadi, Adha Imam and Fajrin, Hanifah Rahmi (2025) Discount Factor Parametrization for Deep Reinforcement Learning for Inverted Pendulum Swing-up Control. Buletin Ilmiah Sarjana Teknik Elektro, 7 (1). 56 - 67. ISSN 26857936
Discount_Factor_Parametrization_for_Deep_Reinforce.pdf - Published Version
Restricted to Registered users only
Download (1MB) | Request a copy
Abstract
This study explores the application of deep reinforcement learning (DRL) to solve the control problem of a single swing-up inverted pendulum. The primary focus is on investigating the impact of discount factor parameterization within the DRL framework. Specifically, the Deep Deterministic Policy Gradient (DDPG) algorithm is employed due to its effectiveness in handling continuous action spaces. A range of discount factor values is tested to evaluate their influence on training performance and stability. The results indicate that a discount factor of 0.99 yields the best overall performance, enabling the DDPG agent to successfully learn a stable swing-up strategy and maximize cumulative rewards. These findings highlight the critical role of the discount factor in DRL-based control systems and offer insights for optimizing learning performance in similar nonlinear control problems.
| Item Type: | Article |
|---|---|
| Additional Information: | Cited by: 0; All Open Access; Gold Open Access |
| Uncontrolled Keywords: | Discount Factor; Single Swing-up Inverted Pendulum; Deep Reinforcement Learning (DRL); Deep Deterministic Policy Gradient (DDPG) |
| Subjects: | T Technology > TK Electrical engineering. Electronics Nuclear engineering |
| Divisions: | Faculty of Engineering > Electrical and Information Technology Department |
| Depositing User: | Rita Yulianti Yulianti |
| Date Deposited: | 02 Jun 2026 03:10 |
| Last Modified: | 02 Jun 2026 03:10 |
| URI: | https://ir.lib.ugm.ac.id/id/eprint/24665 |
