Research

Publications and Preprints

(\(\dagger\) indicates the research group leader and advising.)

Reinforcement Learning

Machine Learning for Combinatorial Optimization

VLM/LLM Training

  • The Primacy of Magnitude in Low-Rank Adaptation
    Zicheng Zhang, Haoran Li, Yifeng Zhang, Guoqiang Gong, Jiaxing Wang, Pengzhang Liu, Qixia Jiang, Junxing Hu.
    (NeurIPS 2025 Spotlight, < 3%) 39th Annual Conference on Neural Information Processing Systems.

  • TANDEM: Bi-Level Data Mixture Optimization with Twin Networks
    Jiaxing Wang, Deping Xiang, Jin Xu, Mingyang Yi, Guoqiang Gong, Zicheng Zhang, Haoran Li, Pengzhang Liu, Zhen Chen, Ke Zhang, Ju Fan, Qixia Jiang.
    (NeurIPS 2025) 39th Annual Conference on Neural Information Processing Systems.