Gao, S. (2025). Optimization of Large Models for Efficient Inference: Algorithm, Compiler, and System Co-Design. Journal of Computer, Signal, and System Research, 2(6), 109-120. https://doi.org/10.71222/078xh379