Miss Xinyu Zhang
Australian Institute for Machine Learning
Division of Research and Innovation
Dr Xinyu Zhang is currently a Lecturer at the School of Computer Science, University of Auckland, and also an Adjunct Lecturer at Australian Institute for Machine Learning (AIML), the University of Adelaide.
She was a Research Fellow at AIML, working closely with A/Prof. Lingqiao Liu and Prof. Anton van den Hengel. Before that, She was a Senior Research Scientist in Baidu Inc., working closely with Chief Scientist Jingdong Wang. She was also a Research Fellow at AIML working on the Neonatal Medical Project. She earned her Ph.D from Tongji University and was a visiting Ph.D student at the University of Adelaide, under the supervision of Prof. Chunhua Shen, Prof. Javen Qinfeng Shi, Prof. Anton van den Hengel and Prof. Mingyu You. Dr Zhang has broad interests in computer vision and metric learning. Her current research topics centers on image / video generation, self-supervised / un-supervised learning, human-centric AI and multimodal retrieval.Google Scholar: https://scholar.google.com/citations?user=PSzJxD8AAAAJ&hl=en
Area Chair:
CVPR, ICCV, WACV
Conference Reviewer:ICLR, NeurIPS, CVPR, ICCV, ECCV, AAAI, IJCAIJournal Reviewer:TPAMI, IJCV, TMM, PR, Neurocomputing
| Date | Position | Institution name |
|---|---|---|
| 2025 - ongoing | Lecturer | University of Auckland |
| 2024 - ongoing | Adjunct Lecturer | University of Adelaide |
| 2024 - 2025 | Research Fellow | University of Adelaide |
| 2021 - 2024 | Senior Research Scientist | Baidu (China) |
| Date | Institution name | Country | Title |
|---|---|---|---|
| Tongji University | China | PhD | |
| University of Adelaide | Australia | Visiting PhD student |
| Year | Citation |
|---|---|
| 2024 | Zhang, J., Wang, M., Jiang, H., Zhang, X., Yan, C., & Zeng, D. (2024). STAT: Multi-Object Tracking Based on Spatio-Temporal Topological Constraints. IEEE Transactions on Multimedia, 26, 4445-4457. Scopus16 WoS17 |
| 2023 | Yin, J., Zhang, X., Ma, Z., Guo, J., & Liu, Y. (2023). A Real-Time Memory Updating Strategy for Unsupervised Person Re-Identification.. IEEE transactions on image processing : a publication of the IEEE Signal Processing Society, 32, 2309-2321. Scopus58 WoS47 Europe PMC1 |
| 2023 | Zhang, X., Chen, J., Yuan, J., Chen, Q., Wang, J., Wang, X., . . . Wang, J. (2023). CAE v2: Context autoencoder with CLIP latent alignment. Transactions on Machine Learning Research. |
| 2020 | Zhang, X., Zhang, R., Cao, J., Gong, D., You, M., & Shen, C. (2020). Part-Guided Attention Learning for Vehicle Instance Retrieval. IEEE Transactions on Intelligent Transportation Systems, 23(4), 1-13. Scopus50 WoS42 |
| 2018 | You, M., Zhang, Y., Shen, C., & Zhang, X. (2018). An Extended Filtered Channel Framework for Pedestrian Detection. IEEE Transactions on Intelligent Transportation Systems, 19(5), 1640-1651. Scopus22 WoS19 |
| Year | Citation |
|---|---|
| 2025 | Zhang, X., Gong, D., Duan, Z., Van Den Hengel, A., & Liu, L. (2025). Let Your Video Listen to Your Music! - Beat-Aligned, Content-Preserving Video Editing with Arbitrary Music. In Mm 2025 Proceedings of the 33rd ACM International Conference on Multimedia Co Located with mm 2025 (pp. 12140-12149). ACM. DOI |
| 2024 | Yuan, J., Zhang, X., Zhou, H., Wang, J., Qiu, Z., Shao, Z., . . . Wang, J. (2024). Hap: Structure-aware masked image modeling for human-centric perception. In NeurIPS Proceedings. Online: NeurIPS. |
| 2024 | Sun, Y., Chen, J., Zhang, S., Zhang, X., Chen, Q., Zhang, G., . . . Li, Z. (2024). VRP-SAM: SAM with Visual Reference Prompt. In 2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) (pp. 23565-23574). WA, Seattle: IEEE COMPUTER SOC. DOI WoS33 |
| 2023 | Shao, Z., Zhang, X., Ding, C., Wang, J., & Wang, J. (2023). Unified Pre-training with Pseudo Texts for Text-To-Image Person Re-identification. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV, 2023) (pp. 1-11). Online: IEEE. DOI WoS38 |
| 2022 | Li, D., Wang, Z., Wang, J., Zhang, X., Ding, E., Wang, J., & Zhang, Z. (2022). Self-Guided Hard Negative Generation for Unsupervised Person Re-Identification. In Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence (pp. 1067-1073). Vienna: International Joint Conferences on Artificial Intelligence Organization. DOI WoS4 |
| 2022 | Zhang, X., Li, D., Wang, Z., Wang, J., Ding, E., Shi, J. Q., . . . Wang, J. (2022). Implicit Sample Extension for Unsupervised Person Re-Identification. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition Vol. 2022-June (pp. 7359-7368). Online: IEEE. DOI Scopus143 WoS117 |
| 2022 | Shao, Z., Zhang, X., Fang, M., Lin, Z., Wang, J., & Ding, C. (2022). Learning Granularity-Unified Representations for Text-to-Image Person Re-identification. In Proceedings of the 30th ACM International Conference on Multimedia (pp. 5566-5574). Online: ACM. DOI WoS120 |
| 2022 | Xi, T., Sun, Y., Yu, D., Li, B., Peng, N., Zhang, G., . . . Wang, J. (2022). UFO: Unified Feature Optimization. In European Conference on Computer Vision Vol. 13686 (pp. 472-488). Tel Aviv: Springer Science and Business Media Deutschland GmbH. DOI WoS7 |
| 2021 | Zhang, X., Wang, X., Bian, J. -W., Shen, C., & You, M. (2021). Diverse Knowledge Distillation for End-to-End Person Search. In THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE Vol. 35 (pp. 3412-3420). Virtual, Online: ASSOC ADVANCEMENT ARTIFICIAL INTELLIGENCE. DOI Scopus41 WoS29 |
| 2019 | Zhang, X., Cao, J., Shen, C., & You, M. (2019). Self-training with progressive augmentation for unsupervised cross-domain person re-identification. In Proceedings of the IEEE International Conference on Computer Vision Vol. 2019-October (pp. 8221-8230). online: IEEE. DOI Scopus261 WoS237 |
| Year | Citation |
|---|---|
| 2024 | Yang, L., Zhang, X., Li, X., Chen, J., Yao, K., Zhang, G., . . . Yang, J. (2024). Add-SD: Rational Generation without Manual Reference. |
| 2024 | You, Z., Zhang, X., Guo, H., Wang, J., & Li, C. (2024). Are Images Indistinguishable to Humans Also Indistinguishable to Classifiers?. |
| 2024 | Liao, M., Lu, H., Zhang, X., Wan, F., Wang, T., Zhao, Y., . . . Wang, J. (2024). Evaluation of Text-to-Video Generation Models: A Dynamics Perspective. |
| 2024 | Chen, Q., Su, X., Zhang, X., Wang, J., Chen, J., Shen, Y., . . . Wang, J. (2024). LW-DETR: A Transformer Replacement to YOLO for Real-Time Detection. |
| 2024 | Wang, Y., Su, X., Chen, Q., Zhang, X., Xi, T., Yao, K., . . . Wang, J. (2024). OVLW-DETR: Open-Vocabulary Light-Weighted Detection Transformer. |
| Date | Role | Research Topic | Program | Degree Type | Student Load | Student Name |
|---|---|---|---|---|---|---|
| 2025 | External Supervisor | Low-supervision Learning via Knowledge Transfer from Pretrained Models | Doctor of Philosophy | Doctorate | Full Time | Mr Zicheng Duan |
| 2025 | External Supervisor | Toward Multi-Agent 3D Dynamic Scene Generation: A Framework for Complex Interactions in Shared Virtual Environments | Master of Philosophy | Master | Full Time | Mr Zhiyuan Zhang |
| 2025 | External Supervisor | Toward Multi-Agent 3D Dynamic Scene Generation: A Framework for Complex Interactions in Shared Virtual Environments | Master of Philosophy | Master | Full Time | Mr Zhiyuan Zhang |
| 2025 | External Supervisor | Low-supervision Learning via Knowledge Transfer from Pretrained Models | Doctor of Philosophy | Doctorate | Full Time | Mr Zicheng Duan |