Return to home page

Selected Publications

Please refer to the Google Scholar profile for more up-to-date papers.

envlm_png Beyond Sole Strength: Customized Ensembles for Generalized Vision-Language Models
Zhihe Lu, Jiawang Bai, Xin Li, Zeyu Xiao, Xinchao Wang
ICML, 2024
code

The first investigation of ensemble learning for VLMs.

graph_png GraphAdapter: Tuning Vision-Language Models With Dual Knowledge Graph
Xin Li*, Dongze Lian*, Zhihe Lu*, Jiawang Bai, Zhibo Chen, Xinchao Wang
* Equal Contribution

NeurIPS, 2023
code

Introduce knowledge graph for tuning vision and language models.

fda_png Frequency-enhanced Data Augmentation for Vision-and-Language Navigation
Keji He, Chenyang Si, Zhihe Lu, Yan Huang, Liang Wang, Xinchao Wang
NeurIPS, 2023
code

A work to explore the significance of high-frequency information for enhanced Vision-and-Language Navigation.

pcn_png Uncertainty-aware Source-free Domain Adaptive Semantic Segmentation
Zhihe Lu, Da Li, Yi-Zhe Song, Tao Xiang, Timothy M. Hospedales
TIP, 2023

An uncertainty-aware solution for SFDASS.

pcn_png Prediction Calibration for Generalized Few-shot Semantic Segmentation
Zhihe Lu, Sen He, Da Li, Yi-Zhe Song, Tao Xiang
TIP, 2023

Investigating the feature-prediction covariance based Transformer for calibrating the biases in GFSS.

taskres_png Task Residual for Tuning Vision-Language Models
Tao Yu*, Zhihe Lu*, Xin Jin, Zhibo Chen, Xinchao Wang
* Equal Contribution

CVPR, 2023
code

A simple yet effective tuning method for vision and language models.

cwt_png Simpler is Better: Few-shot Semantic Segmentation with Classifier Weight Transformer
Zhihe Lu, Sen He, Xiatian Zhu, Li Zhang, Yi-Zhe Song, Tao Xiang
ICCV, 2021
code

A novel training pipeline for few-shot segmentation with classifier weight transformer.

star_png Stochastic Classifiers for Unsupervised Domain Adaptation
Zhihe Lu, Yongxin Yang, Xiatian Zhu, Cong Liu, Yi-Zhe Song, Tao Xiang
CVPR, 2020
code

A novel way to use infinite number of classifiers without extra parameters to identify misaligned regions.

gafp_png Conditional Expression Synthesis with Face Parsing Transformation
Zhihe Lu, Tanhao Hu, Lingxiao Song, Zhaoxiang Zhang, Ran He
ACM MM, 2018

A Couple-Agent Face Parsing based Generative Adversarial Network (CAFP-GAN) that unites the knowledge of facial semantic regions and controllable expression signals.

g2_png Geometry Guided Adversarial Facial Expression Synthesis
Lingxiao Song, Zhihe Lu, Ran He, Zhenan Sun, Tieniu Tan
ACM MM, 2018

A Geometry-Guided Generative Adversarial Network (G2-GAN) was proposed for photorealistic and identity-preserving facial expression synthesis.

face_survey_png Recent Progress of Face Image Synthesis
Zhihe Lu, Zhihang Li, Jie Cao, Ran He, Zhenan Sun
ACPR, 2017

A very early survey for face image synthesis.