刘 哲源

Zheyuan (David) Liu

profile picture

Research Summary

I am currently working on:

  • video generative models.

Previously, I have researched on:

  • multi-modal learning, in particular, vision-and-language tasks;
  • weakly-supervised learning.

I maintain my publication record at Google Scholar. Source code for all first-author projects are available at GitHub.




News

Jul 2024 We have one paper accepted by ECCV 2024, congratulations to Changsheng!

May 2024 I have joined AIML as a postdoctoral research fellow.

Apr 2024 I have received my award of PhD from ANU. A big thank you to my supervisors, my parents and all my friends for their kind support through this journey.


Recent Research
frame_wise_conditioning_img

Frame-wise Conditioning Adaptation for Fine-Tuning Diffusion Models in Text-to-Video Prediction.New
Zheyuan Liu, Junyan Wang, Zicheng Duan, Cristian Rodriguez-Opazo, Anton Van Den Hengel. 2025.

bib | arXiv | code

  @article{liu2025frame,
    title={Frame-wise Conditioning Adaptation for Fine-Tuning Diffusion Models in Text-to-Video Prediction},
    author={Liu, Zheyuan and Wang, Junyan and Duan, Zicheng and Rodriguez-Opazo, Cristian and Hengel, Anton van den},
    journal={arXiv preprint arXiv:2503.12953},
    year={2025}
  }

eccv2024_0_colab_img

OpenKD: Opening Prompt Diversity for Zero- and Few-shot Keypoint Detection.
Changsheng Lu, Zheyuan Liu, Piotr Koniusz. European Conference on Computer Vision (ECCV), 2024.

bib | pdf | arXiv | code

  @inproceedings{lu2025openkd,
  title={OpenKD: Opening Prompt Diversity for Zero-and Few-shot Keypoint Detection},
  author={Lu, Changsheng and Liu, Zheyuan and Koniusz, Piotr},
  booktitle={European Conference on Computer Vision},
  pages={148--165},
  year={2025},
  organization={Springer}
}

anu_logo_img

Retrieving Images through Bi-modal Visual and Language Queries.
Zheyuan Liu. Thesis (PhD), Australian National University, 2024.

bib | ANU archive | pdf

  @phdthesis{liu2024retrieving,
  title={Retrieving Images through Bi-modal Visual and Language Queries},
  author={Liu, Zheyuan},
  school={The Australian National University},
  year={2024}
}

tmlr2024_0_img

Candidate Set Re-ranking for Composed Image Retrieval with Dual Multi-modal Encoder.
Zheyuan Liu, Weixuan Sun, Damien Teney, and Stephen Gould. Transactions on Machine Learning Research (TMLR), 2024.

bib | pdf | arXiv | code

  @article{liu2024candidate,
    title={Candidate Set Re-ranking for Composed Image Retrieval with Dual Multi-modal Encoder},
    author={Zheyuan Liu and Weixuan Sun and Damien Teney and Stephen Gould},
    journal={Transactions on Machine Learning Research},
    issn={2835-8856},
    year={2024},
    url={https://openreview.net/forum?id=fJAwemcvpL},
    note={}
  }

wacv2024_0_img

Bi-Directional Training for Composed Image Retrieval via Text Prompt Learning.
Zheyuan Liu, Weixuan Sun, Yicong Hong, Damien Teney, and Stephen Gould. IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2024.

bib | pdf | arXiv | code

  @InProceedings{Liu_2024_WACV,
    author={Liu, Zheyuan and Sun, Weixuan and Hong, Yicong and Teney, Damien and Gould, Stephen},
    title={Bi-Directional Training for Composed Image Retrieval via Text Prompt Learning},
    booktitle={Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)},
    month={January},
    year={2024},
    pages={5753-5762}
  }

iccv2023_0_colab_img

All-pairs Consistency Learning for Weakly Supervised Semantic Segmentation.
Weixuan Sun, Yanhao Zhang, Zhen Qin, Zheyuan Liu, Lin Cheng, Fanyi Wang, Yiran Zhong, and Nick Barnes. IEEE/CVF International Conference on Computer Vision (ICCV) Workshops. 2023.

bib | pdf | arXiv | code

  @InProceedings{Sun_2023_ICCV,
    author={Sun, Weixuan and Zhang, Yanhao and Qin, Zhen and Liu, Zheyuan and Cheng, Lin and Wang, Fanyi and Zhong, Yiran and Barnes, Nick},
    title={All-pairs Consistency Learning forWeakly Supervised Semantic Segmentation},
    booktitle={Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) Workshops},
    month={October},
    year={2023},
    pages={826-837}
  }

cvpr2023_0_colab_img

Learning Audio-Visual Source Localization via False Negative Aware Contrastive Learning.
Weixuan Sun, Jiayi Zhang, Jianyuan Wang, Zheyuan Liu, Yiran Zhong, Tianpeng Feng, Yandong Guo, Yanhao Zhang, and Nick Barnes. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 2023.

bib | pdf | arXiv | code

  @InProceedings{Sun_2023_CVPR,
    author={Sun, Weixuan and Zhang, Jiayi and Wang, Jianyuan and Liu, Zheyuan and Zhong, Yiran and Feng, Tianpeng and Guo, Yandong and Zhang, Yanhao and Barnes, Nick},
    title={Learning Audio-Visual Source Localization via False Negative Aware Contrastive Learning},
    booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
    month={June},
    year={2023},
    pages={6420-6429}
  }

iccv2021_0_img

Image Retrieval on Real-Life Images With Pre-Trained Vision-and-Language Models.
Zheyuan Liu, Cristian Rodriguez-Opazo, Damien Teney, and Stephen Gould. IEEE/CVF International Conference on Computer Vision (ICCV). 2021.

bib | pdf | arXiv | dataset | code

  @InProceedings{Liu_2021_ICCV,
    author={Liu, Zheyuan and Rodriguez-Opazo, Cristian and Teney, Damien and Gould, Stephen},
    title ={Image Retrieval on Real-Life Images With Pre-Trained Vision-and-Language Models},
    booktitle={Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
    month={October},
    year={2021},
    pages={2125-2134}
  }