Gao, Shengyi. “Optimization of Large Models for Efficient Inference: Algorithm, Compiler, and System Co-Design”. Journal of Computer, Signal, and System Research 2, no. 6 (November 25, 2025): 109–120. Accessed January 30, 2026. https://www.gbspress.com/index.php/JCSSR/article/view/503.