Miss Xinyu Zhang

Australian Institute for Machine Learning

Division of Research and Innovation


Dr Xinyu Zhang is currently a Lecturer at the School of Computer Science, University of Auckland, and also an Adjunct Lecturer at Australian Institute for Machine Learning (AIML), the University of Adelaide.
 
She was a Research Fellow at AIML, working closely with A/Prof. Lingqiao Liu and Prof. Anton van den Hengel. Before that, She was a Senior Research Scientist in Baidu Inc., working closely with Chief Scientist Jingdong Wang. She was also a Research Fellow at AIML working on the Neonatal Medical Project. She earned her Ph.D from Tongji University and was a visiting Ph.D student at the University of Adelaide, under the supervision of Prof. Chunhua Shen, Prof. Javen Qinfeng Shi, Prof. Anton van den Hengel and Prof. Mingyu You. Dr Zhang has broad interests in computer vision and metric learning. Her current research topics centers on image / video generation, self-supervised / un-supervised learning, human-centric AI and multimodal retrieval.Google Scholar: https://scholar.google.com/citations?user=PSzJxD8AAAAJ&hl=en
 
Area Chair:
CVPR, ICCV, WACV
Conference Reviewer:ICLR, NeurIPS, CVPR, ICCV, ECCV, AAAI, IJCAIJournal Reviewer:TPAMI, IJCV, TMM, PR, Neurocomputing

Date Position Institution name
2025 - ongoing Lecturer University of Auckland
2024 - ongoing Adjunct Lecturer University of Adelaide
2024 - 2025 Research Fellow University of Adelaide
2021 - 2024 Senior Research Scientist Baidu (China)

Date Institution name Country Title
Tongji University China PhD
University of Adelaide Australia Visiting PhD student

Year Citation
2024 Zhang, J., Wang, M., Jiang, H., Zhang, X., Yan, C., & Zeng, D. (2024). STAT: Multi-Object Tracking Based on Spatio-Temporal Topological Constraints. IEEE Transactions on Multimedia, 26, 4445-4457.
DOI Scopus16 WoS17
2023 Yin, J., Zhang, X., Ma, Z., Guo, J., & Liu, Y. (2023). A Real-Time Memory Updating Strategy for Unsupervised Person Re-Identification.. IEEE transactions on image processing : a publication of the IEEE Signal Processing Society, 32, 2309-2321.
DOI Scopus58 WoS47 Europe PMC1
2023 Zhang, X., Chen, J., Yuan, J., Chen, Q., Wang, J., Wang, X., . . . Wang, J. (2023). CAE v2: Context autoencoder with CLIP latent alignment. Transactions on Machine Learning Research.
2020 Zhang, X., Zhang, R., Cao, J., Gong, D., You, M., & Shen, C. (2020). Part-Guided Attention Learning for Vehicle Instance Retrieval. IEEE Transactions on Intelligent Transportation Systems, 23(4), 1-13.
DOI Scopus50 WoS42
2018 You, M., Zhang, Y., Shen, C., & Zhang, X. (2018). An Extended Filtered Channel Framework for Pedestrian Detection. IEEE Transactions on Intelligent Transportation Systems, 19(5), 1640-1651.
DOI Scopus22 WoS19

Year Citation
2025 Zhang, X., Gong, D., Duan, Z., Van Den Hengel, A., & Liu, L. (2025). Let Your Video Listen to Your Music! - Beat-Aligned, Content-Preserving Video Editing with Arbitrary Music. In Mm 2025 Proceedings of the 33rd ACM International Conference on Multimedia Co Located with mm 2025 (pp. 12140-12149). ACM.
DOI
2024 Yuan, J., Zhang, X., Zhou, H., Wang, J., Qiu, Z., Shao, Z., . . . Wang, J. (2024). Hap: Structure-aware masked image modeling for human-centric perception. In NeurIPS Proceedings. Online: NeurIPS.
2024 Sun, Y., Chen, J., Zhang, S., Zhang, X., Chen, Q., Zhang, G., . . . Li, Z. (2024). VRP-SAM: SAM with Visual Reference Prompt. In 2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) (pp. 23565-23574). WA, Seattle: IEEE COMPUTER SOC.
DOI WoS33
2023 Shao, Z., Zhang, X., Ding, C., Wang, J., & Wang, J. (2023). Unified Pre-training with Pseudo Texts for Text-To-Image Person Re-identification. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV, 2023) (pp. 1-11). Online: IEEE.
DOI WoS38
2022 Li, D., Wang, Z., Wang, J., Zhang, X., Ding, E., Wang, J., & Zhang, Z. (2022). Self-Guided Hard Negative Generation for Unsupervised Person Re-Identification. In Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence (pp. 1067-1073). Vienna: International Joint Conferences on Artificial Intelligence Organization.
DOI WoS4
2022 Zhang, X., Li, D., Wang, Z., Wang, J., Ding, E., Shi, J. Q., . . . Wang, J. (2022). Implicit Sample Extension for Unsupervised Person Re-Identification. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition Vol. 2022-June (pp. 7359-7368). Online: IEEE.
DOI Scopus143 WoS117
2022 Shao, Z., Zhang, X., Fang, M., Lin, Z., Wang, J., & Ding, C. (2022). Learning Granularity-Unified Representations for Text-to-Image Person Re-identification. In Proceedings of the 30th ACM International Conference on Multimedia (pp. 5566-5574). Online: ACM.
DOI WoS120
2022 Xi, T., Sun, Y., Yu, D., Li, B., Peng, N., Zhang, G., . . . Wang, J. (2022). UFO: Unified Feature Optimization. In European Conference on Computer Vision Vol. 13686 (pp. 472-488). Tel Aviv: Springer Science and Business Media Deutschland GmbH.
DOI WoS7
2021 Zhang, X., Wang, X., Bian, J. -W., Shen, C., & You, M. (2021). Diverse Knowledge Distillation for End-to-End Person Search. In THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE Vol. 35 (pp. 3412-3420). Virtual, Online: ASSOC ADVANCEMENT ARTIFICIAL INTELLIGENCE.
DOI Scopus41 WoS29
2019 Zhang, X., Cao, J., Shen, C., & You, M. (2019). Self-training with progressive augmentation for unsupervised cross-domain person re-identification. In Proceedings of the IEEE International Conference on Computer Vision Vol. 2019-October (pp. 8221-8230). online: IEEE.
DOI Scopus261 WoS237

Year Citation
2024 Yang, L., Zhang, X., Li, X., Chen, J., Yao, K., Zhang, G., . . . Yang, J. (2024). Add-SD: Rational Generation without Manual Reference.
2024 You, Z., Zhang, X., Guo, H., Wang, J., & Li, C. (2024). Are Images Indistinguishable to Humans Also Indistinguishable to
Classifiers?.
2024 Liao, M., Lu, H., Zhang, X., Wan, F., Wang, T., Zhao, Y., . . . Wang, J. (2024). Evaluation of Text-to-Video Generation Models: A Dynamics Perspective.
2024 Chen, Q., Su, X., Zhang, X., Wang, J., Chen, J., Shen, Y., . . . Wang, J. (2024). LW-DETR: A Transformer Replacement to YOLO for Real-Time Detection.
2024 Wang, Y., Su, X., Chen, Q., Zhang, X., Xi, T., Yao, K., . . . Wang, J. (2024). OVLW-DETR: Open-Vocabulary Light-Weighted Detection Transformer.

Date Role Research Topic Program Degree Type Student Load Student Name
2025 External Supervisor Low-supervision Learning via Knowledge Transfer from Pretrained Models Doctor of Philosophy Doctorate Full Time Mr Zicheng Duan
2025 External Supervisor Toward Multi-Agent 3D Dynamic Scene Generation: A Framework for Complex Interactions in Shared Virtual Environments Master of Philosophy Master Full Time Mr Zhiyuan Zhang
2025 External Supervisor Toward Multi-Agent 3D Dynamic Scene Generation: A Framework for Complex Interactions in Shared Virtual Environments Master of Philosophy Master Full Time Mr Zhiyuan Zhang
2025 External Supervisor Low-supervision Learning via Knowledge Transfer from Pretrained Models Doctor of Philosophy Doctorate Full Time Mr Zicheng Duan

Connect With Me

External Profiles

Other Links