Yuhui Xu

SJTU   Ph.D

I am a research scientist with Salesforce AI Research. I was part of the MIN LAB, advised by Prof. Hongkai Xiong and Prof. Weiyao Lin. I was a visiting student of CCVL LAB, advised by Prof. Alan Yuille. Prior to SJTU, I obtained my B.S. degree in Chien-Shiung Wu College from Southeast University in 2016.

Research Interests: Neural Network Compression, Neural Architecture Search, Large Language Models

Preprint

Not All Experts are Equal: Efficient Expert Pruning and Skipping for Mixture-of-Experts Large Language Models

Xudong Lu, Qi Liu, Yuhui Xu, Aojun Zhou, Siyuan Huang, Bo Zhang, Junchi Yan, Hongsheng Li
Preprint [PDF][Code]

Latency-Aware Differentiable Neural Architecture Search

Yuhui Xu, Lingxi Xie, Xiaopeng Zhang, Xin Chen, Bowen Shi, Qi Tian, Hongkai Xiong
Preprint [PDF]

Conference Papers

QA-LoRA: Quantization-Aware Low-Rank Adaptation of Large Language Models

ICLR 2024
Yuhui Xu, Lingxi Xie, Xiaotao Gu, Xin Chen, Heng Chang, Hengheng Zhang, Zhengsu Chen, Xiaopeng Zhang, Qi Tian
accepted by International Conference on Learning Representations (ICLR 2024) [PDF][Code]

Fitting the Search Space of Weight-sharing NAS with Graph Convolutional Networks

AAAI 2021
Xin Chen, Lingxi Xie, Jun Wu, Longhui Wei, Yuhui Xu, Qi Tian
accepted by 35nd Association for the Advancement of Artificial Intelligence Conference on Artificial Intelligence (AAAI21) [PDF]

TRP: Trained Rank Pruning for Efficient Deep Neural Networks

IJCAI 2020
Yuhui Xu, Yuxi Li, Shuai Zhang, Wei Wen, Botao Wang, Yingyong Qi, Yiran Chen, Weiyao Lin, Hongkai Xiong
Accepetd by International Joint Conference on Artificial Intelligence (IJCAI 2020), Yokohama, Japan, July 2020 [PDF][Code]

PC-DARTS: Partial Channel Connections for Memory-Efficient Differentiable Architecture Search

ICLR 2020
Yuhui Xu, Lingxi Xie, Xiaopeng Zhang, Xin Chen, Guo-Jun Qi, Qi Tian, Hongkai Xiong
accepted by International Conference on Learning Representations (ICLR 2020) [PDF][Code]

Trained Rank Pruning for Efficient Deep Neural Networks

NIPS-EMC2
Yuhui Xu, Yuxi Li, Shuai Zhang, Wei Wen, Botao Wang, Yingyong Qi, Yiran Chen, Weiyao Lin, Hongkai Xiong
Accepetd by NIPS-EMC2 Workshop 2019 [Web], [PDF][Code]

DNQ: Dynamic Network Quantization

DCC 2019
Yuhui Xu, Shuai Zhang, Yingyong Qi, Jiaxian Guo, Weiyao Lin, Hongkai Xiong
Accepetd by IEEE DCC 2019 [PDF]

Deep Neural Network Compression with Single and Multiple Level Qauntization

AAAI18
Yuhui Xu, Yongzhuang Wang, Aojun Zhou, Weiyao Lin, Hongkai Xiong
accepted by 32nd Association for the Advancement of Artificial Intelligence Conference on Artificial Intelligence (AAAI18) [PDF] [Code] [AAAI Digital Library]

Journals

BNET: Batch Normalization with Enhanced Linear Transformation

TPAMI
Yuhui Xu, Lingxi Xie, Cihang Xie, Wenrui Dai, Jieru Mei, Siyuan Qiao, Wei Shen, Hongkai Xiong, Alan Yuille
accepted by IEEE Transactions on Pattern Analysis and Machine Intelligence [PDF][Code]

Partially-Connected Neural Architecture Search for Reduced Computational Redundancy

TPAMI
Yuhui Xu, Lingxi Xie, Wenrui Dai, Xiaopeng Zhang, Xin Chen, Guo-Jun Qi, Qi Tian, Hongkai Xiong
accepted by IEEE Transactions on Pattern Analysis and Machine Intelligence

[IEEE Xplore]

Iterative Deep Neural Network Quantization with Lipschitz Constraint

TMM
Yuhui Xu, Wenrui Dai, Yingyong Qi, Junni Zou, Hongkai Xiong
accepted by IEEE Transactions on Multimedia

[IEEE Xplore]