
I am a research fellow at the Centre for Augmented Reasoning, Australian Institute for Machine Learning (AIML) at the University of Adelaide, working with Prof. Anton Van Den Hengel. My current research focuses on video generative models.
I obtained my PhD from the Australian National University in 2024, advised by Prof. Stephen Gould and Dr. Damien Teney (Idiap Research Institute | AIML). My PhD surrounds multi-modal learning, in particular, vision-and-language, where I focused on a task termed composed image retrieval.
Research Summary
I am currently working on:
- video generative models.
Previously, I have researched on:
- multi-modal learning, in particular, vision-and-language tasks;
- weakly-supervised learning.
I maintain my publication record at Google Scholar. Source code for all first-author projects are available at GitHub.
News
Jul 2024 We have one paper accepted by ECCV 2024, congratulations to Changsheng!
May 2024 I have joined AIML as a postdoctoral research fellow.
Apr 2024 I have received my award of PhD from ANU. A big thank you to my supervisors, my parents and all my friends for their kind support through this journey.

Frame-wise Conditioning Adaptation for Fine-Tuning Diffusion Models in Text-to-Video Prediction.New
Zheyuan Liu, Junyan Wang, Zicheng Duan, Cristian Rodriguez-Opazo, Anton Van Den Hengel.
2025.
bib | arXiv | code
@article{liu2025frame,
title={Frame-wise Conditioning Adaptation for Fine-Tuning Diffusion Models in Text-to-Video Prediction},
author={Liu, Zheyuan and Wang, Junyan and Duan, Zicheng and Rodriguez-Opazo, Cristian and Hengel, Anton van den},
journal={arXiv preprint arXiv:2503.12953},
year={2025}
}

OpenKD: Opening Prompt Diversity for Zero- and Few-shot Keypoint Detection.
Changsheng Lu, Zheyuan Liu, Piotr Koniusz.
European Conference on Computer Vision (ECCV), 2024.
bib | pdf | arXiv | code
@inproceedings{lu2025openkd,
title={OpenKD: Opening Prompt Diversity for Zero-and Few-shot Keypoint Detection},
author={Lu, Changsheng and Liu, Zheyuan and Koniusz, Piotr},
booktitle={European Conference on Computer Vision},
pages={148--165},
year={2025},
organization={Springer}
}

Retrieving Images through Bi-modal Visual and Language Queries.
Zheyuan Liu.
Thesis (PhD), Australian National University, 2024.
bib | ANU archive | pdf
@phdthesis{liu2024retrieving,
title={Retrieving Images through Bi-modal Visual and Language Queries},
author={Liu, Zheyuan},
school={The Australian National University},
year={2024}
}

Candidate Set Re-ranking for Composed Image Retrieval with Dual Multi-modal Encoder.
Zheyuan Liu, Weixuan Sun, Damien Teney, and Stephen Gould.
Transactions on Machine Learning Research (TMLR), 2024.
bib | pdf | arXiv | code
@article{liu2024candidate,
title={Candidate Set Re-ranking for Composed Image Retrieval with Dual Multi-modal Encoder},
author={Zheyuan Liu and Weixuan Sun and Damien Teney and Stephen Gould},
journal={Transactions on Machine Learning Research},
issn={2835-8856},
year={2024},
url={https://openreview.net/forum?id=fJAwemcvpL},
note={}
}

Bi-Directional Training for Composed Image Retrieval via Text Prompt Learning.
Zheyuan Liu, Weixuan Sun, Yicong Hong, Damien Teney, and Stephen Gould.
IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2024.
bib | pdf | arXiv | code
@InProceedings{Liu_2024_WACV,
author={Liu, Zheyuan and Sun, Weixuan and Hong, Yicong and Teney, Damien and Gould, Stephen},
title={Bi-Directional Training for Composed Image Retrieval via Text Prompt Learning},
booktitle={Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)},
month={January},
year={2024},
pages={5753-5762}
}

All-pairs Consistency Learning for Weakly Supervised Semantic Segmentation.
Weixuan Sun, Yanhao Zhang, Zhen Qin, Zheyuan Liu, Lin Cheng, Fanyi Wang, Yiran Zhong, and Nick Barnes.
IEEE/CVF International Conference on Computer Vision (ICCV) Workshops. 2023.
bib | pdf | arXiv | code
@InProceedings{Sun_2023_ICCV,
author={Sun, Weixuan and Zhang, Yanhao and Qin, Zhen and Liu, Zheyuan and Cheng, Lin and Wang, Fanyi and Zhong, Yiran and Barnes, Nick},
title={All-pairs Consistency Learning forWeakly Supervised Semantic Segmentation},
booktitle={Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) Workshops},
month={October},
year={2023},
pages={826-837}
}

Learning Audio-Visual Source Localization via False Negative Aware Contrastive Learning.
Weixuan Sun, Jiayi Zhang, Jianyuan Wang, Zheyuan Liu, Yiran Zhong, Tianpeng Feng, Yandong Guo, Yanhao Zhang, and Nick Barnes.
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 2023.
bib | pdf | arXiv | code
@InProceedings{Sun_2023_CVPR,
author={Sun, Weixuan and Zhang, Jiayi and Wang, Jianyuan and Liu, Zheyuan and Zhong, Yiran and Feng, Tianpeng and Guo, Yandong and Zhang, Yanhao and Barnes, Nick},
title={Learning Audio-Visual Source Localization via False Negative Aware Contrastive Learning},
booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
month={June},
year={2023},
pages={6420-6429}
}

Image Retrieval on Real-Life Images With Pre-Trained Vision-and-Language Models.
Zheyuan Liu, Cristian Rodriguez-Opazo, Damien Teney, and Stephen Gould.
IEEE/CVF International Conference on Computer Vision (ICCV). 2021.
bib | pdf | arXiv | dataset | code
@InProceedings{Liu_2021_ICCV,
author={Liu, Zheyuan and Rodriguez-Opazo, Cristian and Teney, Damien and Gould, Stephen},
title ={Image Retrieval on Real-Life Images With Pre-Trained Vision-and-Language Models},
booktitle={Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
month={October},
year={2021},
pages={2125-2134}
}