Miss Xinyu Zhang

Office of Engineering and Information Technology

College of Engineering and Information Technology

Dr Xinyu Zhang is currently a Lecturer at the School of Computer Science, University of Auckland, and also an Adjunct Lecturer at Australian Institute for Machine Learning (AIML), the University of Adelaide.

She was a Research Fellow at AIML, working closely with A/Prof. Lingqiao Liu and Prof. Anton van den Hengel. Before that, She was a Senior Research Scientist in Baidu Inc., working closely with Chief Scientist Jingdong Wang. She was also a Research Fellow at AIML working on the Neonatal Medical Project. She earned her Ph.D from Tongji University and was a visiting Ph.D student at the University of Adelaide, under the supervision of Prof. Chunhua Shen, Prof. Javen Qinfeng Shi, Prof. Anton van den Hengel and Prof. Mingyu You. Dr Zhang has broad interests in computer vision and metric learning. Her current research topics centers on image / video generation, self-supervised / un-supervised learning, human-centric AI and multimodal retrieval.Google Scholar: https://scholar.google.com/citations?user=PSzJxD8AAAAJ&hl=en

Area Chair:
CVPR, ICCV, WACV
Conference Reviewer:ICLR, NeurIPS, CVPR, ICCV, ECCV, AAAI, IJCAIJournal Reviewer:TPAMI, IJCV, TMM, PR, Neurocomputing

Date	Position	Institution name
2025 - ongoing	Lecturer	University of Auckland
2024 - ongoing	Adjunct Lecturer	University of Adelaide
2024 - 2025	Research Fellow	University of Adelaide
2021 - 2024	Senior Research Scientist	Baidu (China)

Date	Institution name	Country	Title
	Tongji University	China	PhD
	University of Adelaide	Australia	Visiting PhD student

Year	Citation
2026	Lu, H., Zhang, X., Tian, Z., Wu, X., Zuo, W., & Wang, J. (2026). I2V-Adapter: Fast adapting image pre-trained models for video correspondence. Pattern Recognition, 177, 113228. DOI
2024	Zhang, J., Wang, M., Jiang, H., Zhang, X., Yan, C., & Zeng, D. (2024). STAT: Multi-Object Tracking Based on Spatio-Temporal Topological Constraints. IEEE Transactions on Multimedia, 26, 4445-4457. DOI Scopus19 WoS19
2023	Yin, J., Zhang, X., Ma, Z., Guo, J., & Liu, Y. (2023). A Real-Time Memory Updating Strategy for Unsupervised Person Re-Identification.. IEEE transactions on image processing : a publication of the IEEE Signal Processing Society, 32, 2309-2321. DOI Scopus61 WoS51 Europe PMC1
2023	Zhang, X., Chen, J., Yuan, J., Chen, Q., Wang, J., Wang, X., . . . Wang, J. (2023). CAE v2: Context autoencoder with CLIP latent alignment. Transactions on Machine Learning Research.
2020	Zhang, X., Zhang, R., Cao, J., Gong, D., You, M., & Shen, C. (2020). Part-Guided Attention Learning for Vehicle Instance Retrieval. IEEE Transactions on Intelligent Transportation Systems, 23(4), 1-13. DOI Scopus52 WoS43
2018	You, M., Zhang, Y., Shen, C., & Zhang, X. (2018). An Extended Filtered Channel Framework for Pedestrian Detection. IEEE Transactions on Intelligent Transportation Systems, 19(5), 1640-1651. DOI Scopus22 WoS19

Year	Citation
2025	Zhang, X., Gong, D., Duan, Z., Van Den Hengel, A., & Liu, L. (2025). Let Your Video Listen to Your Music! - Beat-Aligned, Content-Preserving Video Editing with Arbitrary Music. In Mm 2025 Proceedings of the 33rd ACM International Conference on Multimedia Co Located with mm 2025 (pp. 12140-12149). IRELAND, Dublin: ASSOC COMPUTING MACHINERY. DOI
2024	Yuan, J., Zhang, X., Zhou, H., Wang, J., Qiu, Z., Shao, Z., . . . Wang, J. (2024). Hap: Structure-aware masked image modeling for human-centric perception. In NeurIPS Proceedings. Online: NeurIPS.
2024	Sun, Y., Chen, J., Zhang, S., Zhang, X., Chen, Q., Zhang, G., . . . Li, Z. (2024). VRP-SAM: SAM with Visual Reference Prompt. In 2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) (pp. 23565-23574). WA, Seattle: IEEE COMPUTER SOC. DOI WoS42
2023	Shao, Z., Zhang, X., Ding, C., Wang, J., & Wang, J. (2023). Unified Pre-training with Pseudo Texts for Text-To-Image Person Re-identification. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV, 2023) (pp. 1-11). Online: IEEE. DOI WoS46
2022	Li, D., Wang, Z., Wang, J., Zhang, X., Ding, E., Wang, J., & Zhang, Z. (2022). Self-Guided Hard Negative Generation for Unsupervised Person Re-Identification. In Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence (pp. 1067-1073). Vienna: International Joint Conferences on Artificial Intelligence Organization. DOI WoS4
2022	Zhang, X., Li, D., Wang, Z., Wang, J., Ding, E., Shi, J. Q., . . . Wang, J. (2022). Implicit Sample Extension for Unsupervised Person Re-Identification. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition Vol. 2022-June (pp. 7359-7368). Online: IEEE. DOI Scopus148 WoS123
2022	Shao, Z., Zhang, X., Fang, M., Lin, Z., Wang, J., & Ding, C. (2022). Learning Granularity-Unified Representations for Text-to-Image Person Re-identification. In Proceedings of the 30th ACM International Conference on Multimedia (pp. 5566-5574). Online: ACM. DOI WoS140
2022	Xi, T., Sun, Y., Yu, D., Li, B., Peng, N., Zhang, G., . . . Wang, J. (2022). UFO: Unified Feature Optimization. In European Conference on Computer Vision Vol. 13686 (pp. 472-488). Tel Aviv: Springer Science and Business Media Deutschland GmbH. DOI WoS7
2021	Zhang, X., Wang, X., Bian, J. -W., Shen, C., & You, M. (2021). Diverse Knowledge Distillation for End-to-End Person Search. In THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE Vol. 35 (pp. 3412-3420). Virtual, Online: ASSOC ADVANCEMENT ARTIFICIAL INTELLIGENCE. DOI Scopus42 WoS32
2019	Zhang, X., Cao, J., Shen, C., & You, M. (2019). Self-training with progressive augmentation for unsupervised cross-domain person re-identification. In Proceedings of the IEEE International Conference on Computer Vision Vol. 2019-October (pp. 8221-8230). online: IEEE. DOI Scopus262 WoS238

Year	Citation
2024	Yang, L., Zhang, X., Li, X., Chen, J., Yao, K., Zhang, G., . . . Yang, J. (2024). Add-SD: Rational Generation without Manual Reference.
2024	You, Z., Zhang, X., Guo, H., Wang, J., & Li, C. (2024). Are Images Indistinguishable to Humans Also Indistinguishable to Classifiers?.
2024	Liao, M., Lu, H., Zhang, X., Wan, F., Wang, T., Zhao, Y., . . . Wang, J. (2024). Evaluation of Text-to-Video Generation Models: A Dynamics Perspective.
2024	Chen, Q., Su, X., Zhang, X., Wang, J., Chen, J., Shen, Y., . . . Wang, J. (2024). LW-DETR: A Transformer Replacement to YOLO for Real-Time Detection.
2024	Wang, Y., Su, X., Chen, Q., Zhang, X., Xi, T., Yao, K., . . . Wang, J. (2024). OVLW-DETR: Open-Vocabulary Light-Weighted Detection Transformer.

Date	Role	Research Topic	Program	Degree Type	Student Load	Student Name
2025	External Supervisor	Low-supervision Learning via Knowledge Transfer from Pretrained Models	Doctor of Philosophy	Doctorate	Full Time	Mr Zicheng Duan
2025	External Supervisor	Toward Multi-Agent 3D Dynamic Scene Generation: A Framework for Complex Interactions in Shared Virtual Environments	Master of Philosophy	Master	Full Time	Mr Zhiyuan Zhang
2025	External Supervisor	Toward Multi-Agent 3D Dynamic Scene Generation: A Framework for Complex Interactions in Shared Virtual Environments	Master of Philosophy	Master	Full Time	Mr Zhiyuan Zhang
2025	External Supervisor	Low-supervision Learning via Knowledge Transfer from Pretrained Models	Doctor of Philosophy	Doctorate	Full Time	Mr Zicheng Duan

Email: xinyu.zhang02@adelaide.edu.au

Miss Xinyu Zhang

Miss Xinyu Zhang

Connect With Me

External Profiles

Other Links

Miss Xinyu Zhang

Miss Xinyu Zhang

Appointments

Education

Journals

Conference Papers

Preprint

Current Higher Degree by Research Supervision (Adelaide University)

Connect With Me

External Profiles

Other Links