A Deep Q-Learning Framework for Modeling and Nonlinear Adaptive Control of Autonomous Space Rocket Maneuvers

Xiaoan  Zhan

doi:10.71222/cr6cmx88

Authors

Xiaoan Zhan New York University, Brooklyn, NY 11201, USA Author

DOI:

https://doi.org/10.71222/cr6cmx88

Keywords:

deep learning, Q-learning, adaptive control, aerospace robotics

Abstract

In this paper, we develop a mathematical model and a deep Q-learning-assisted adaptive controller for autonomous space-rocket maneuvers. Our approach focuses on executing a multi-phase trajectory alignment without relying on external markers or inter-rocket communication. The proposed feedback mechanism uses onboard sensors to track the rocket's current position and orientation, applying standard robotics nomenclature for translational and rotational states. We formulate a general kinematic description of the rocket system, treating the maneuver as a tracking problem with respect to polynomial-based reference paths, which are generated online for each phase. A deep Q-learning module refines the control policy in real time by exploring actions that optimize the rocket's flight profile under uncertainties, such as unknown velocities. Simulation results verify that the integrated controller is capable of accurately following the desired trajectories and can robustly adapt to dynamic variations, illustrating the effectiveness of our proposed method for autonomous rocket guidance.

References

1. L. Casalino, and D. Pastrone, "Optimal design and control of hybrid rockets for access to space," In 41st AIAA/ASME/SAE/ASEE Joint Propulsion Conference & Exhibit, July, 2005, p. 3547. doi: 10.2514/6.2005-3547

2. E. N. Johnson, A. J. Calise, and J. E. Corban, "Adaptive guidance and control for autonomous launch vehicles," In 2001 IEEE Aerospace Conference Proceedings (Cat. No. 01TH8542), March, 2001, pp. 2669-2682.

3. X. Liu, "Fuel-optimal rocket landing with aerodynamic controls," Journal of Guidance, Control, and Dynamics, vol. 42, no. 1, pp. 65-77, 2019. doi: 10.2514/1.g003537

4. B. D. Fried, and J. M. Richardson, "Optimum rocket trajectories," Journal of Applied Physics, vol. 27, no. 8, pp. 955-961, 1956. doi: 10.1063/1.1722521

5. K. Kondo, I. Kolmanovsky, Y. Yoshimura, M. Bando, S. Nagasaki, and T. Hanada, "Nonlinear model predictive detumbling of small satellites with a single-axis magnetorquer," Journal of Guidance, Control, and Dynamics, vol. 44, no. 6, pp. 1211-1218, 2021. doi: 10.2514/1.g005877

6. L. Zhao, Z. Shi, and Y. Zhu, "Acceleration autopilot for a guided spinning rocket via adaptive output feedback," Aerospace Science and Technology, vol. 77, pp. 573-584, 2018. doi: 10.1016/j.ast.2018.04.012

7. G. Colasurdo, D. Pastrone, and L. Casalino, "Optimal performance of a dual-fuel single-stage rocket," Journal of spacecraft and rockets, vol. 35, no. 5, pp. 667-671, 1998. doi: 10.2514/2.3383

8. M. Wang, and H. N. Wu, "Autonomous game control for spacecraft rendezvous via adaptive perception and interaction," IEEE Transactions on Aerospace and Electronic Systems, vol. 59, no. 3, pp. 3188-3200, 2022.

9. D. F. Lawden, "Minimal rocket trajectories," Journal of the American Rocket Society, vol. 23, no. 6, pp. 360-367, 1953.

10. C. Riano-Rios, R. Bevilacqua, and W. E. Dixon, "Adaptive control for differential drag-based rendezvous maneuvers with an unknown target," Acta Astronautica, vol. 181, pp. 733-740, 2021. doi: 10.1016/j.actaastro.2020.03.011

11. L. P. Kaelbling, M. L. Littman, and A. W. Moore, "Reinforcement learning: A survey," Journal of artificial intelligence research, vol. 4, pp. 237-285, 1996. doi: 10.1613/jair.301

12. L. Gao, K. Aubert, D. Saldana, C. Danielson, and R. Fierro, "Decentralized adaptive aerospace transportation of unknown loads using a team of robots," In International Symposium on Distributed Autonomous Robotic Systems, October, 2024, pp. 28-41.

13. R. S. Sutton, "Integrated architectures for learning, planning, and reacting based on approximating dynamic programming," In Machine learning proceedings 1990, 1990, pp. 216-224. doi: 10.1016/b978-1-55860-141-3.50030-4

14. P. A. Ioannou, and J. Sun, "Robust adaptive control (Vol. 1, pp. 75-76)," Upper Saddle River, NJ: PTR Prentice-Hall, 1996.

15. R. Kumar, and H. J. Kelley, "Singular optimal atmospheric rocket trajectories," Journal of Guidance, Control, and Dynamics, vol. 11, no. 4, pp. 305-312, 1988. doi: 10.2514/3.20312

A Deep Q-Learning Framework for Modeling and Nonlinear Adaptive Control of Autonomous Space Rocket Maneuvers

Authors

DOI:

Keywords:

Abstract

References

Downloads

Published

Issue

Section

License

How to Cite

ISSN

Make a Submission

Indexing & Abstracting