Abstract
This article addresses the design problem of fractional order PID (FOPID) controllers in environments with disturbances and uncertainties for fixed-wing unmanned aerial vehicles (UAVs). While traditional FOPID controllers offer stable tracking, their performance degrades under dynamic external and internal interferences. To overcome this limitation, we propose a novel hybrid intelligent control strategy, termed PPO-based FOPID, which synergistically combines the stability of control theory with the adaptive learning capabilities of reinforcement learning. In this framework, an FOPID controller provides a baseline control policy, ensuring stability and tracking efficiency. Concurrently, a proximal policy optimization agent acts as a supplementary learning controller, continuously optimizing control commands via an actor-critic mechanism to actively counteract disturbances and uncertainties. The proposed strategy is validated on the attitude control system of a fixed-wing UAV. Simulation results validate the proposed controller’s superior performance. In experiments against conventional PID, FOPID, and an advanced fuzzy PID plus PID hybrid strategy, our method consistently achieves the lowest tracking error. It improves the root mean square error by 3.1\(-\)4.3% under external disturbances and up to 8.1% under parametric uncertainties, demonstrating significantly enhanced robustness and adaptability for high-precision UAV control in complex scenarios.















Similar content being viewed by others
Data availability
No datasets were generated or analysed during the current study.
References
Messaoudi K, Oubbati OS, Rachedi A, Lakas A, Bendouma T, Chaib N (2023) A survey of UAV-based data collection: challenges, solutions and future perspectives. J Netw Comput Appl 216:103670. https://doi.org/10.1016/j.jnca.2023.103670
Gu Z, Yin T, Lu Q, Park JH (2025) Resilient event-triggered formation control and secure estimation of multi-UAV systems. IEEE Trans Industr Inf 21(6):4915–4923. https://doi.org/10.1109/TII.2025.3547012
Yu Z, Li Y, Lv M, Pei B, Fu A (2024) Event-triggered adaptive fuzzy fault-tolerant attitude control for tailless flying-wing UAV with fixed-time convergence. IEEE Trans Veh Technol 73(4):4858–4869. https://doi.org/10.1109/TVT.2023.3329470
Zhang Y, Li S, Wang S, Wang X, Duan H (2023) Distributed bearing-based formation maneuver control of fixed-wing UAVs by finite-time orientation estimation. Aerosp Sci Technol 136:108241. https://doi.org/10.1016/j.ast.2023.108241
Du Z, Qu X, Shi J, Lu J (2024) Formation control of fixed-wing UAVs with communication delay. ISA Trans 146:154–164. https://doi.org/10.1016/j.isatra.2023.12.036
Mitridis D, Kapsalis S, Terzis D, Panagiotou P (2023) An evaluation of fixed-wing unmanned aerial vehicle trends and correlations with respect to NATO classification, region, EIS date and operational specifications. Aerospace 10(4):382. https://doi.org/10.3390/aerospace10040382
Guan X, Yiu K-FC, Li B, Zeng Y, Zhang R (2025) 3d trajectory optimization for fixed-wing UAV communications with full UAV dynamics. IEEE Trans Veh Technol. https://doi.org/10.1109/TVT.2025.3570729
Huang Z, Chen M (2022) Coordinated disturbance observer-based flight control of fixed-wing UAV. IEEE Trans Circuits Syst II Express Briefs 69(8):3545–3549. https://doi.org/10.1109/TCSII.2022.3165366
Huang Z, Chen M (2024) Augmented disturbance observer-based appointed-time tracking control of UAVs under exogenous disturbance. IEEE Trans Intell Veh 9(1):2822–2835. https://doi.org/10.1109/TIV.2023.3303348
Wei L, Chen M, Shi S (2024) Dynamic event-triggered consensus cost-based switching control for UAV formation with disturbances. IEEE Trans Intell Veh 9(2):3531–3543. https://doi.org/10.1109/TIV.2023.3344606
Lopez-Sanchez I, Moreno-Valenzuela J (2023) PID control of quadrotor UAVs: a survey. Annu Rev Control 56:100900. https://doi.org/10.1016/j.arcontrol.2023.100900
Chen G, Wang W, Dong J (2025) Performance-optimize adaptive robust tracking control for USV-UAV heterogeneous systems with uncertainty. IEEE Trans Veh Technol 74(5):7251–7262. https://doi.org/10.1109/TVT.2025.3526360
Yang B, Wang J, Wang J, Shu H, Li D, Zeng C, Chen Y, Zhang X, Yu T (2021) Robust fractional-order PID control of supercapacitor energy storage systems for distribution network applications: a perturbation compensation based approach. J Clean Prod 279:123362. https://doi.org/10.1016/j.jclepro.2020.123362
Xu L-X, Wang Y-L, Wang X, Peng C (2024) Distributed active disturbance rejection formation tracking control for quadrotor UAVs. IEEE Trans Cybern 54(8):4678–4689. https://doi.org/10.1109/TCYB.2023.3324752
Xie T, Xian B, Gu X, Hu J, Liu M (2024) Disturbance observer-based fixed-time tracking control for a tilt Trirotor unmanned aerial vehicle. IEEE Trans Industr Electron 71(4):3894–3903. https://doi.org/10.1109/TIE.2023.3277090
Liu K, Wang R (2021) Antisaturation command filtered backstepping control-based disturbance rejection for a quadarotor UAV. IEEE Trans Circuits Syst II Express Briefs 68(12):3577–3581. https://doi.org/10.1109/TCSII.2021.3069967
Guo K, Liu C, Zhang X, Yu X, Zhang Y, Xie L, Guo L (2024) A bio-inspired safety control system for UAVs in confined environment with disturbance. IEEE Trans Cybern 54(2):1308–1320. https://doi.org/10.1109/TCYB.2022.3217982
Ma B, Liu Z, Dang Q, Zhao W, Wang J, Cheng Y, Yuan Z (2023) Deep reinforcement learning of UAV tracking control under wind disturbances environments. IEEE Trans Instrum Meas 72:1–13. https://doi.org/10.1109/TIM.2023.3265741
Lan T, Song J, Hou Z, Chen K, He S, Wang H, Liu JJR (2025) Event-triggered fixed-time sliding mode control for Lip-reading-driven UAV: disturbance rejection using wind field optimization. IEEE Trans Autom Sci Eng 22:9090–9103. https://doi.org/10.1109/TASE.2024.3496939
Xue W, Kolaric P, Fan J, Lian B, Chai T, Lewis FL (2022) Inverse reinforcement learning in tracking control based on inverse optimal control. IEEE Trans Cybern 52(10):10570–10581. https://doi.org/10.1109/TCYB.2021.3062856
De Lellis F, Coraggio M, Russo G, Musolesi M, Bernardo M (2024) Guaranteeing control requirements via reward shaping in reinforcement learning. IEEE Trans Control Syst Technol 32(6):2102–2113. https://doi.org/10.1109/TCST.2024.3393210
Chen L, Meng F, Zhang Y (2022) MBRL-MC: an HVAC control approach via combining model-based deep reinforcement learning and model predictive control. IEEE Internet Things J 9(19):19160–19173. https://doi.org/10.1109/JIOT.2022.3164023
Shuprajhaa T, Sujit SK, Srinivasan K (2022) Reinforcement learning based adaptive PID controller design for control of linear/nonlinear unstable processes. Appl Soft Comput 128:109450. https://doi.org/10.1016/j.asoc.2022.109450
Yu Z, Zhang Y, Liu Z, Qu Y, Su C-Y (2019) Distributed adaptive fractional-order fault-tolerant cooperative control of networked unmanned aerial vehicles via fuzzy neural networks. IET Control Theory Appl 13(17):2917–2929. https://doi.org/10.1049/iet-cta.2018.6262
Xu J (2015) Flight Control Systems-Design, Prototype Systems and Semi-physical Simulation of Real Experimentation. Beijing Institute of Technology Press, Beijing
Zhang W (2009) Design of modern flight control system. Northwestern Polytechnical University Press, Xi’an
Gheisarnejad M, Khooban MH (2021) An intelligent non-integer PID controller-based deep reinforcement learning: implementation and experimental results. IEEE Trans Industr Electron 68(4):3609–3618. https://doi.org/10.1109/TIE.2020.2979561
Oustaloup A, Levron F, Mathieu B, Nanot FM (2000) Frequency-band complex noninteger differentiator: characterization and synthesis. IEEE Trans Circuits Syst I Fundam Theory Appl 47(1):25–39. https://doi.org/10.1109/81.817385
Ji Q, Wang XV, Wang L, Feng L (2022) Online reinforcement learning for the shape morphing adaptive control of 4D printed shape memory polymer. Control Eng Pract 126:105257. https://doi.org/10.1016/j.conengprac.2022.105257
Murch AM, Paw YC, Pandita R, Li Z, Balas GJ (2011) A low cost small UAV flight research facility. In: Holzapfel F, Theil S (eds) Advances in aerospace guidance, navigation and control. Springer, Berlin, pp 29–40. https://doi.org/10.1007/978-3-642-19817-5_3
Kumar A, Suhag S (2022) Whale optimization algorithm optimized fuzzy-PID plus PID hybrid controller for frequency regulation in hybrid power system. J Inst Eng (India) Ser B 103(2):633–648. https://doi.org/10.1007/s40031-021-00656-9
Acknowledgements
The work is financially supported by the National Natural Science Foundation of China (NSFC) under Grant 62373068 and U2034209.
Author information
Authors and Affiliations
Contributions
XF conceptualization, investigation, methodology, writing original draft, validation, writing review and editing, validation, software. CY conceptualization, formal analysis, writing review and editing, supervision, project administration, resources, funding acquisition. CXL conceptualization, data curation, investigation, writing review and editing, software. ZK Formal analysis, writing review and editing, supervision, software. LQ conceptualization, investigation, methodology, writing review and editing, supervision, project administration, resources, funding acquisition.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this article.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Xie, F., Chai, Y., Chen, X. et al. Robust fractional-order PID controller design for fixed-wing UAVs through proximal-policy-optimization for disturbance rejection. Int. J. Mach. Learn. & Cyber. (2025). https://doi.org/10.1007/s13042-025-02801-y
Received:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1007/s13042-025-02801-y


