Yuhui Xu

Research Scientist, Salesforce AI Research.

yuhui.jpeg

I am a research scientist with Salesforce AI Research. I was part of the MIN LAB, advised by Prof. Hongkai Xiong and Prof. Weiyao Lin. I was a visiting student of CCVL LAB, advised by Prof. Alan Yuille. Prior to SJTU, I obtained my B.S. degree in Chien-Shiung Wu College from Southeast University in 2016.

News

May 15, 2025 One paper accepted to ACL 2025 (Main Conference, Oral)
May 01, 2025 One paper accepted to ICML 2025
Jan 23, 2025 One paper accepted to ICLR2025 Spotlight
Jan 23, 2025 One paper accepted to WWW25
May 16, 2024 One paper accepted to ACL 2024 (Main Conference)

Selected Publications

  1. frac-frame.png
    Fractured Chain-of-Thought Reasoning
    Baohao Liao* ,  Hanze Dong* ,  Yuhui Xu* , and 4 more authors
    arXiv preprint arXiv:2505.12992, 2025
    * = equal contribution
  2. e1.png
    Scalable Chain of Thoughts via Elastic Reasoning
    Yuhui Xu ,  Hanze Dong ,  Lei Wang , and 3 more authors
    arXiv preprint arXiv:2505.05315, 2025
  3. llmqfa.png
    One QuantLLM for ALL: Fine-tuning Quantized LLMs Once for Efficient Deployments
    Ke Yi* ,  Yuhui Xu* ,  Heng Chang , and 4 more authors
    The 63rd Annual Meeting of the Association for Computational Linguistics (Oral), 2025
    * = equal contribution
  4. rsd.png
    Reward-Guided Speculative Decoding for Efficient LLM Reasoning
    Baohao Liao* ,  Yuhui Xu* ,  Hanze Dong* , and 5 more authors
    International Conference on Machine Learning, 2025
    * = equal contribution
  5. think.png
    ThinK: Thinner Key Cache by Query-Driven Pruning
    Yuhui Xu ,  Zhanming Jie ,  Hanze Dong , and 6 more authors
    International Conference on Learning Representation (Spotlight), 2025
  6. spp.png
    SPP: Sparsity-Preserved Parameter-Efficient Fine-Tuning for Large Language Models
    Xudong Lu* ,  Aojun Zhou* ,  Yuhui Xu* , and 3 more authors
    International Conference on Machine Learning, 2024
    * = equal contribution
  7. qalora.png
    QA-LoRA: Quantization-aware low-rank adaptation of large language models
    Yuhui Xu ,  Lingxi Xie ,  Xiaotao Gu , and 6 more authors
    International Conference on Learning Representation, 2024
  8. pc-darts.png
    PC-DARTS: Partial Channel Connections for Memory-Efficient Architecture Search
    Yuhui Xu ,  Lingxi Xie ,  Xiaopeng Zhang , and 4 more authors
    International Conference on Learning Representation (Spotlight), 2020