abstract
- © 2022 IEEE.In this paper, a novel control strategy based on Reinforcement Learning is presented to achieve better performance of attitude control for quadcopters. By using Proximal Policy Optimization, the agent is trained via a reward function and interaction with the environment. The control algorithm obtained from this training process is simulated and tested against proportional-integral-derivative control, being the most common attitude control algorithm used in drone races. The resulting control policies were comparable to the baseline counterpart and, in some cases, outperformed it in terms of noise rejection and robustness to external disturbances.