
I am a Senior Applied Scientist at Oracle Health AI (OHAI), applying generative AI to the medical domain.
Previously, I am a research fellow at the Centre for Augmented Reasoning, Australian Institute for Machine Learning (AIML) at the University of Adelaide, working with Prof. Anton Van Den Hengel. My research has focused on video generative models.
I obtained my PhD from the Australian National University in 2024, advised by Prof. Stephen Gould and Dr. Damien Teney (Idiap Research Institute | AIML). My PhD surrounds multi-modal learning, in particular, vision-and-language, where I focused on a task termed composed image retrieval.
Prior to that, I obtained my bachelor degree in Electronics Engineering (R&D) with first-class honours from the Australian National University in 2018.
Research Summary
I am currently working on:
- large language model;
- video generative / video editing models.
Previously, I have researched on:
- multi-modal learning, in particular, vision-and-language tasks;
- weakly-supervised learning.
I maintain my publication record at Google Scholar. Source code for all first-author projects are available at GitHub.
LinkedIn profile. OpenReview profile.
I am actively seeking talented students for research projects, if interested please contact me via the email address above.
News
Sep 2025 I have joined Oracle Health AI as a Senior Applied Scientist.
May 2024 I have joined AIML as a postdoctoral research fellow.
Apr 2024 I have received my award of PhD from ANU. A big thank you to my supervisors, my parents and all my friends for their kind support through this journey.

FlowerTune: A Cross-Domain Benchmark for Federated Fine-Tuning of Large Language Models.Coming soon
Yan Gao, Massimo Roberto Scamarcia, Javier Fernandez-Marques, Mohammad Naseri, Chong Shen Ng, Dimitris Stripelis, Zexi Li, Tao Shen, Jiamu Bai, Daoyuan Chen, Zikai Zhang, Rui Hu, InSeo Song, Lee KangYoon, Hong Jia, Ting Dang, Junyan Wang, Zheyuan Liu, Daniel Janes Beutel, Lingjuan Lyu, Nicholas D Lane.
To appear in Neural Information Processing Systems (NeurIPS),
2025.
bib | arXiv
coming soon

Frame-wise Conditioning Adaptation for Fine-Tuning Diffusion Models in Text-to-Video Prediction.New
Zheyuan Liu, Junyan Wang, Zicheng Duan, Cristian Rodriguez-Opazo, Anton Van Den Hengel.
2025.
bib | arXiv | code
@article{liu2025frame,
title={Frame-wise Conditioning Adaptation for Fine-Tuning Diffusion Models in Text-to-Video Prediction},
author={Liu, Zheyuan and Wang, Junyan and Duan, Zicheng and Rodriguez-Opazo, Cristian and Hengel, Anton van den},
journal={arXiv preprint arXiv:2503.12953},
year={2025}
}

OpenKD: Opening Prompt Diversity for Zero- and Few-shot Keypoint Detection.
Changsheng Lu, Zheyuan Liu, Piotr Koniusz.
European Conference on Computer Vision (ECCV), 2024.
bib | pdf | arXiv | code
@inproceedings{lu2025openkd,
title={OpenKD: Opening Prompt Diversity for Zero-and Few-shot Keypoint Detection},
author={Lu, Changsheng and Liu, Zheyuan and Koniusz, Piotr},
booktitle={European Conference on Computer Vision},
pages={148--165},
year={2025},
organization={Springer}
}

Retrieving Images through Bi-modal Visual and Language Queries.
Zheyuan Liu.
Thesis (PhD), Australian National University, 2024.
bib | ANU archive | pdf
@phdthesis{liu2024retrieving,
title={Retrieving Images through Bi-modal Visual and Language Queries},
author={Liu, Zheyuan},
school={The Australian National University},
year={2024}
}

Candidate Set Re-ranking for Composed Image Retrieval with Dual Multi-modal Encoder.
Zheyuan Liu, Weixuan Sun, Damien Teney, and Stephen Gould.
Transactions on Machine Learning Research (TMLR), 2024.
bib | pdf | arXiv | code
@article{liu2024candidate,
title={Candidate Set Re-ranking for Composed Image Retrieval with Dual Multi-modal Encoder},
author={Zheyuan Liu and Weixuan Sun and Damien Teney and Stephen Gould},
journal={Transactions on Machine Learning Research},
issn={2835-8856},
year={2024},
url={https://openreview.net/forum?id=fJAwemcvpL},
note={}
}

Bi-Directional Training for Composed Image Retrieval via Text Prompt Learning.
Zheyuan Liu, Weixuan Sun, Yicong Hong, Damien Teney, and Stephen Gould.
IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2024.
bib | pdf | arXiv | code
@InProceedings{Liu_2024_WACV,
author={Liu, Zheyuan and Sun, Weixuan and Hong, Yicong and Teney, Damien and Gould, Stephen},
title={Bi-Directional Training for Composed Image Retrieval via Text Prompt Learning},
booktitle={Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)},
month={January},
year={2024},
pages={5753-5762}
}

All-pairs Consistency Learning for Weakly Supervised Semantic Segmentation.
Weixuan Sun, Yanhao Zhang, Zhen Qin, Zheyuan Liu, Lin Cheng, Fanyi Wang, Yiran Zhong, and Nick Barnes.
IEEE/CVF International Conference on Computer Vision (ICCV) Workshops. 2023.
bib | pdf | arXiv | code
@InProceedings{Sun_2023_ICCV,
author={Sun, Weixuan and Zhang, Yanhao and Qin, Zhen and Liu, Zheyuan and Cheng, Lin and Wang, Fanyi and Zhong, Yiran and Barnes, Nick},
title={All-pairs Consistency Learning forWeakly Supervised Semantic Segmentation},
booktitle={Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) Workshops},
month={October},
year={2023},
pages={826-837}
}

Learning Audio-Visual Source Localization via False Negative Aware Contrastive Learning.
Weixuan Sun, Jiayi Zhang, Jianyuan Wang, Zheyuan Liu, Yiran Zhong, Tianpeng Feng, Yandong Guo, Yanhao Zhang, and Nick Barnes.
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 2023.
bib | pdf | arXiv | code
@InProceedings{Sun_2023_CVPR,
author={Sun, Weixuan and Zhang, Jiayi and Wang, Jianyuan and Liu, Zheyuan and Zhong, Yiran and Feng, Tianpeng and Guo, Yandong and Zhang, Yanhao and Barnes, Nick},
title={Learning Audio-Visual Source Localization via False Negative Aware Contrastive Learning},
booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
month={June},
year={2023},
pages={6420-6429}
}

Image Retrieval on Real-Life Images With Pre-Trained Vision-and-Language Models.
Zheyuan Liu, Cristian Rodriguez-Opazo, Damien Teney, and Stephen Gould.
IEEE/CVF International Conference on Computer Vision (ICCV). 2021.
bib | pdf | arXiv | dataset | code
@InProceedings{Liu_2021_ICCV,
author={Liu, Zheyuan and Rodriguez-Opazo, Cristian and Teney, Damien and Gould, Stephen},
title ={Image Retrieval on Real-Life Images With Pre-Trained Vision-and-Language Models},
booktitle={Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
month={October},
year={2021},
pages={2125-2134}
}
Related Links
CIRR Dataset | Image download [main] | Test-split Server [main] [mirror1]