Skip to main content
Log in

Robust fractional-order PID controller design for fixed-wing UAVs through proximal-policy-optimization for disturbance rejection

  • Original Article
  • Published:
International Journal of Machine Learning and Cybernetics Aims and scope Submit manuscript

Abstract

This article addresses the design problem of fractional order PID (FOPID) controllers in environments with disturbances and uncertainties for fixed-wing unmanned aerial vehicles (UAVs). While traditional FOPID controllers offer stable tracking, their performance degrades under dynamic external and internal interferences. To overcome this limitation, we propose a novel hybrid intelligent control strategy, termed PPO-based FOPID, which synergistically combines the stability of control theory with the adaptive learning capabilities of reinforcement learning. In this framework, an FOPID controller provides a baseline control policy, ensuring stability and tracking efficiency. Concurrently, a proximal policy optimization agent acts as a supplementary learning controller, continuously optimizing control commands via an actor-critic mechanism to actively counteract disturbances and uncertainties. The proposed strategy is validated on the attitude control system of a fixed-wing UAV. Simulation results validate the proposed controller’s superior performance. In experiments against conventional PID, FOPID, and an advanced fuzzy PID plus PID hybrid strategy, our method consistently achieves the lowest tracking error. It improves the root mean square error by 3.1\(-\)4.3% under external disturbances and up to 8.1% under parametric uncertainties, demonstrating significantly enhanced robustness and adaptability for high-precision UAV control in complex scenarios.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+
from €37.37 /Month
  • Starting from 10 chapters or articles per month
  • Access and download chapters and articles from more than 300k books and 2,500 journals
  • Cancel anytime
View plans

Buy Now

Price includes VAT (Norway)

Instant access to the full article PDF.

Fig. 1
Fig. 2
Algorithm 1
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14

Similar content being viewed by others

Data availability

No datasets were generated or analysed during the current study.

References

  1. Messaoudi K, Oubbati OS, Rachedi A, Lakas A, Bendouma T, Chaib N (2023) A survey of UAV-based data collection: challenges, solutions and future perspectives. J Netw Comput Appl 216:103670. https://doi.org/10.1016/j.jnca.2023.103670

    Article  Google Scholar 

  2. Gu Z, Yin T, Lu Q, Park JH (2025) Resilient event-triggered formation control and secure estimation of multi-UAV systems. IEEE Trans Industr Inf 21(6):4915–4923. https://doi.org/10.1109/TII.2025.3547012

    Article  Google Scholar 

  3. Yu Z, Li Y, Lv M, Pei B, Fu A (2024) Event-triggered adaptive fuzzy fault-tolerant attitude control for tailless flying-wing UAV with fixed-time convergence. IEEE Trans Veh Technol 73(4):4858–4869. https://doi.org/10.1109/TVT.2023.3329470

    Article  Google Scholar 

  4. Zhang Y, Li S, Wang S, Wang X, Duan H (2023) Distributed bearing-based formation maneuver control of fixed-wing UAVs by finite-time orientation estimation. Aerosp Sci Technol 136:108241. https://doi.org/10.1016/j.ast.2023.108241

    Article  Google Scholar 

  5. Du Z, Qu X, Shi J, Lu J (2024) Formation control of fixed-wing UAVs with communication delay. ISA Trans 146:154–164. https://doi.org/10.1016/j.isatra.2023.12.036

    Article  Google Scholar 

  6. Mitridis D, Kapsalis S, Terzis D, Panagiotou P (2023) An evaluation of fixed-wing unmanned aerial vehicle trends and correlations with respect to NATO classification, region, EIS date and operational specifications. Aerospace 10(4):382. https://doi.org/10.3390/aerospace10040382

    Article  Google Scholar 

  7. Guan X, Yiu K-FC, Li B, Zeng Y, Zhang R (2025) 3d trajectory optimization for fixed-wing UAV communications with full UAV dynamics. IEEE Trans Veh Technol. https://doi.org/10.1109/TVT.2025.3570729

    Article  Google Scholar 

  8. Huang Z, Chen M (2022) Coordinated disturbance observer-based flight control of fixed-wing UAV. IEEE Trans Circuits Syst II Express Briefs 69(8):3545–3549. https://doi.org/10.1109/TCSII.2022.3165366

    Article  Google Scholar 

  9. Huang Z, Chen M (2024) Augmented disturbance observer-based appointed-time tracking control of UAVs under exogenous disturbance. IEEE Trans Intell Veh 9(1):2822–2835. https://doi.org/10.1109/TIV.2023.3303348

    Article  Google Scholar 

  10. Wei L, Chen M, Shi S (2024) Dynamic event-triggered consensus cost-based switching control for UAV formation with disturbances. IEEE Trans Intell Veh 9(2):3531–3543. https://doi.org/10.1109/TIV.2023.3344606

    Article  Google Scholar 

  11. Lopez-Sanchez I, Moreno-Valenzuela J (2023) PID control of quadrotor UAVs: a survey. Annu Rev Control 56:100900. https://doi.org/10.1016/j.arcontrol.2023.100900

    Article  MathSciNet  Google Scholar 

  12. Chen G, Wang W, Dong J (2025) Performance-optimize adaptive robust tracking control for USV-UAV heterogeneous systems with uncertainty. IEEE Trans Veh Technol 74(5):7251–7262. https://doi.org/10.1109/TVT.2025.3526360

    Article  Google Scholar 

  13. Yang B, Wang J, Wang J, Shu H, Li D, Zeng C, Chen Y, Zhang X, Yu T (2021) Robust fractional-order PID control of supercapacitor energy storage systems for distribution network applications: a perturbation compensation based approach. J Clean Prod 279:123362. https://doi.org/10.1016/j.jclepro.2020.123362

    Article  Google Scholar 

  14. Xu L-X, Wang Y-L, Wang X, Peng C (2024) Distributed active disturbance rejection formation tracking control for quadrotor UAVs. IEEE Trans Cybern 54(8):4678–4689. https://doi.org/10.1109/TCYB.2023.3324752

    Article  Google Scholar 

  15. Xie T, Xian B, Gu X, Hu J, Liu M (2024) Disturbance observer-based fixed-time tracking control for a tilt Trirotor unmanned aerial vehicle. IEEE Trans Industr Electron 71(4):3894–3903. https://doi.org/10.1109/TIE.2023.3277090

    Article  Google Scholar 

  16. Liu K, Wang R (2021) Antisaturation command filtered backstepping control-based disturbance rejection for a quadarotor UAV. IEEE Trans Circuits Syst II Express Briefs 68(12):3577–3581. https://doi.org/10.1109/TCSII.2021.3069967

    Article  Google Scholar 

  17. Guo K, Liu C, Zhang X, Yu X, Zhang Y, Xie L, Guo L (2024) A bio-inspired safety control system for UAVs in confined environment with disturbance. IEEE Trans Cybern 54(2):1308–1320. https://doi.org/10.1109/TCYB.2022.3217982

    Article  Google Scholar 

  18. Ma B, Liu Z, Dang Q, Zhao W, Wang J, Cheng Y, Yuan Z (2023) Deep reinforcement learning of UAV tracking control under wind disturbances environments. IEEE Trans Instrum Meas 72:1–13. https://doi.org/10.1109/TIM.2023.3265741

    Article  Google Scholar 

  19. Lan T, Song J, Hou Z, Chen K, He S, Wang H, Liu JJR (2025) Event-triggered fixed-time sliding mode control for Lip-reading-driven UAV: disturbance rejection using wind field optimization. IEEE Trans Autom Sci Eng 22:9090–9103. https://doi.org/10.1109/TASE.2024.3496939

    Article  Google Scholar 

  20. Xue W, Kolaric P, Fan J, Lian B, Chai T, Lewis FL (2022) Inverse reinforcement learning in tracking control based on inverse optimal control. IEEE Trans Cybern 52(10):10570–10581. https://doi.org/10.1109/TCYB.2021.3062856

    Article  Google Scholar 

  21. De Lellis F, Coraggio M, Russo G, Musolesi M, Bernardo M (2024) Guaranteeing control requirements via reward shaping in reinforcement learning. IEEE Trans Control Syst Technol 32(6):2102–2113. https://doi.org/10.1109/TCST.2024.3393210

    Article  Google Scholar 

  22. Chen L, Meng F, Zhang Y (2022) MBRL-MC: an HVAC control approach via combining model-based deep reinforcement learning and model predictive control. IEEE Internet Things J 9(19):19160–19173. https://doi.org/10.1109/JIOT.2022.3164023

    Article  Google Scholar 

  23. Shuprajhaa T, Sujit SK, Srinivasan K (2022) Reinforcement learning based adaptive PID controller design for control of linear/nonlinear unstable processes. Appl Soft Comput 128:109450. https://doi.org/10.1016/j.asoc.2022.109450

    Article  Google Scholar 

  24. Yu Z, Zhang Y, Liu Z, Qu Y, Su C-Y (2019) Distributed adaptive fractional-order fault-tolerant cooperative control of networked unmanned aerial vehicles via fuzzy neural networks. IET Control Theory Appl 13(17):2917–2929. https://doi.org/10.1049/iet-cta.2018.6262

    Article  MathSciNet  Google Scholar 

  25. Xu J (2015) Flight Control Systems-Design, Prototype Systems and Semi-physical Simulation of Real Experimentation. Beijing Institute of Technology Press, Beijing

    Google Scholar 

  26. Zhang W (2009) Design of modern flight control system. Northwestern Polytechnical University Press, Xi’an

    Google Scholar 

  27. Gheisarnejad M, Khooban MH (2021) An intelligent non-integer PID controller-based deep reinforcement learning: implementation and experimental results. IEEE Trans Industr Electron 68(4):3609–3618. https://doi.org/10.1109/TIE.2020.2979561

    Article  Google Scholar 

  28. Oustaloup A, Levron F, Mathieu B, Nanot FM (2000) Frequency-band complex noninteger differentiator: characterization and synthesis. IEEE Trans Circuits Syst I Fundam Theory Appl 47(1):25–39. https://doi.org/10.1109/81.817385

    Article  Google Scholar 

  29. Ji Q, Wang XV, Wang L, Feng L (2022) Online reinforcement learning for the shape morphing adaptive control of 4D printed shape memory polymer. Control Eng Pract 126:105257. https://doi.org/10.1016/j.conengprac.2022.105257

    Article  Google Scholar 

  30. Murch AM, Paw YC, Pandita R, Li Z, Balas GJ (2011) A low cost small UAV flight research facility. In: Holzapfel F, Theil S (eds) Advances in aerospace guidance, navigation and control. Springer, Berlin, pp 29–40. https://doi.org/10.1007/978-3-642-19817-5_3

    Chapter  Google Scholar 

  31. Kumar A, Suhag S (2022) Whale optimization algorithm optimized fuzzy-PID plus PID hybrid controller for frequency regulation in hybrid power system. J Inst Eng (India) Ser B 103(2):633–648. https://doi.org/10.1007/s40031-021-00656-9

    Article  Google Scholar 

Download references

Acknowledgements

The work is financially supported by the National Natural Science Foundation of China (NSFC) under Grant 62373068 and U2034209.

Author information

Authors and Affiliations

Contributions

XF conceptualization, investigation, methodology, writing original draft, validation, writing review and editing, validation, software. CY conceptualization, formal analysis, writing review and editing, supervision, project administration, resources, funding acquisition. CXL conceptualization, data curation, investigation, writing review and editing, software. ZK Formal analysis, writing review and editing, supervision, software. LQ conceptualization, investigation, methodology, writing review and editing, supervision, project administration, resources, funding acquisition.

Corresponding author

Correspondence to Qie Liu.

Ethics declarations

Conflict of interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this article.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Xie, F., Chai, Y., Chen, X. et al. Robust fractional-order PID controller design for fixed-wing UAVs through proximal-policy-optimization for disturbance rejection. Int. J. Mach. Learn. & Cyber. (2025). https://doi.org/10.1007/s13042-025-02801-y

Download citation

  • Received:

  • Accepted:

  • Published:

  • Version of record:

  • DOI: https://doi.org/10.1007/s13042-025-02801-y

Keywords

Profiles

  1. Qie Liu