Gao, Shengyi. “Optimization of Large Models for Efficient Inference: Algorithm, Compiler, and System Co-Design”. Journal of Computer, Signal, and System Research, vol. 2, no. 6, Nov. 2025, pp. 109-20, https://doi.org/10.71222/078xh379.