Xinyu Wang

Lecturer

School of Computer and Mathematical Sciences

Faculty of Sciences, Engineering and Technology

Eligible to supervise Masters and PhD (as Co-Supervisor) - email supervisor to discuss availability.


Dr. Xinyu Wang is currently a Lecturer at the School of Computer and Mathematical Sciences and also a researcher with the Australian Institute for Machine Learning (AIML). He received his PhD from the University of Adelaide under the supervision of Prof. Chunhua Shen, and subsequently worked as a Research Fellow at AIML for two years, collaborating with A/Prof. Qi Wu. Xinyu has been actively publishing research papers in prestigious conferences and journals within the field of Artificial Intelligence, such as CVPR, ACL, and ICLR. He and his colleagues received the Best Paper Award at ACL 2024 for their pioneering work on using AI models to decipher ancient languages.Xinyu's current research interests span a broad range within multimodal machine learning, with a recent focus on Large Multimodal Models (LMMs). His work emphasizes improving the efficiency and accessibility of LMMs, alongside exploring their interdisciplinary applications in fields such as document analysis, paleography, game theory and computational social sciences.

I am NOT recruiting summer research students, as these 6-week projects are too short to conduct in-depth research. My name was unexpectedly listed in the supervisors’ list, so please do not reach out for such engagements.

However, if you are interested in undertaking a longer-term research project with me, such as an Honours thesis or an MSc research component, please feel free to email me to discuss the details. Typically, I expect prospective students to have a high GPA (above 6.8) and strong programming skills, particularly in Python.

  • Journals

    Year Citation
    2025 Lin, C. -T., Ng, C. C., Tan, Z. Q., Nah, W. J., Wang, X., Kew, J. L., . . . Zach, C. (2025). Text in the dark: Extremely low-light text image enhancement. Signal Processing: Image Communication, 130, 117222.
    DOI
    2024 Wang, P., Zhang, K., Wang, X., Han, S., Liu, Y., Wan, J., . . . Liu, Y. (2024). An open dataset for oracle bone character recognition and decipherment.. Sci Data, 11(1), 976.
    DOI Scopus6 WoS4 Europe PMC1
    2024 Ng, C. C., Lin, C. T., Tan, Z. Q., Wang, X., Kew, J. L., Chan, C. S., & Zach, C. (2024). When IC meets text: Towards a rich annotated integrated circuit text dataset. Pattern Recognition, 147, 110124.
    DOI Scopus1
    2023 Li, Z., Wang, X., Liu, Y., Jin, L., Huang, Y., & Ding, K. (2023). Improving Handwritten Mathematical Expression Recognition Via Similar Symbol Distinguishing. IEEE Transactions on Multimedia, 26, 90-102.
    DOI Scopus6 WoS5
    2023 Liu, Y., Zhang, J., Peng, D., Huang, M., Wang, X., Tang, J., . . . Jin, L. (2023). SPTS v2: Single-Point Scene Text Spotting. IEEE Transactions on Pattern Analysis and Machine Intelligence, 45(12), 1-15.
    DOI Scopus28 WoS28 Europe PMC2
    2021 Liu, Y., He, T., Chen, H., Wang, X., Luo, C., Zhang, S., . . . Jin, L. (2021). Exploring the Capacity of an Orderless Box Discretization Network for Multi-orientation Scene Text Detection. International Journal of Computer Vision, 129(6), 1972-1992.
    DOI Scopus26 WoS24
    2020 Wang, X., Shen, C., Li, H., & Xu, S. (2020). Human Detection Aided by Deeply Learned Semantic Masks. IEEE Transactions on Circuits and Systems for Video Technology, 30(8), 2663-2673.
    DOI Scopus13 WoS11
    2019 Li, H., Wang, X., Shen, F., Li, Y., Porikli, F., & Wang, M. (2019). Real-Time Deep Tracking via Corrective Domain Adaptation. IEEE Transactions on Circuits and Systems for Video Technology, 29(9), 2600-2612.
    DOI Scopus17 WoS16
  • Conference Papers

    Year Citation
    2025 Wang, X., Zhuang, B., & Wu, Q. (2025). ARE LARGE VISION LANGUAGE MODELS GOOD GAME PLAYERS?. In 13th International Conference on Learning Representations Iclr 2025 (pp. 24502-24539).
    Scopus1
    2024 Guan, H., Yang, H., Wang, X., Han, S., Liu, Y., Jin, L., . . . Liu, Y. (2024). Deciphering Oracle Bone Language with Diffusion Models. In L. W. Ku, A. Martins, & V. Srikumar (Eds.), Proceedings of the Annual Meeting of the Association for Computational Linguistics Vol. 1 (pp. 15554-15567). THAILAND, Bangkok: ASSOC COMPUTATIONAL LINGUISTICS-ACL.
    DOI Scopus5 WoS2
    2024 Wang, P., Zhang, K., Wang, X., Han, S., Liu, Y., Jin, L., . . . Liu, Y. (2024). Puzzle Pieces Picker: Deciphering Ancient Chinese Characters with Radical Reconstruction. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) Vol. 14804 LNCS (pp. 169-187). Athens Greece: Springer Nature Switzerland.
    DOI Scopus2 WoS2
    2024 Wang, X., Zhuang, B., & Wu, Q. (2024). ModaVerse: Efficiently Transforming Modalities with LLMs. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 26596-26606). WA, Seattle: IEEE COMPUTER SOC.
    DOI Scopus6 WoS2
    2022 Peng, D., Wang, X., Liu, Y., Zhang, J., Huang, M., Lai, S., . . . Jin, L. (2022). SPTS: Single-Point Text Spotting. In Proceedings of the 30th ACM International Conference on Multimedia (pp. 10 pages). Online: ACM.
    DOI Scopus50
    2021 Ng, C. C., Nazaruddin, A. K. B., Lee, Y. K., Wang, X., Liu, Y., Chan, C. S., . . . Fan, L. (2021). ICDAR 2021 Competition on Integrated Circuit Text Spotting and Aesthetic Assessment. In J. Llados, D. Lopresti, & S. Uchida (Eds.), Proceedings of the ... International Conference on Document Analysis and Recognition / sponsored by the IAPR TC-11 and TC-10, in cooperation with the IEEE Computer Society and IGS. International Conference on Document Analysis and Recog... Vol. 12824 LNCS (pp. 663-677). Switzerland: Springer.
    DOI Scopus4
    2020 Wang, X., Liu, Y., Shen, C., Ng, C. C., Luo, C., Jin, L., . . . Wang, L. (2020). On the general value of evidence, and bilingual scene-text visual question answering. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2020) (pp. 10123-10132). online: IEEE.
    DOI Scopus84 WoS85
    2018 Wang, X., Li, H., Li, Y., Porikli, F., & Wang, M. (2018). Deep tracking with objectness. In IEEE International Conference on Image Processing, ICIP Vol. 2017-September (pp. 660-664). New York, NY, USA: IEEE.
    DOI Scopus6
    2017 Wang, X., Li, H., Li, Y., Shen, F., & Porikli, F. (2017). Robust and real-time deep tracking via multi-scale domain adaptation. In Proceedings - IEEE International Conference on Multimedia and Expo (ICME) (pp. 1338-1343). Hong Kong, China: IEEE.
    DOI Scopus15
  • UoA Start-up Grant - Sole Investigator (2025-2027)
  • UoA Early Career Seed Funding - Sole Investigator (2025)
2025
  • Semester 1 & 2 CompSci 2008/3020 Topics/Advanced Topics in Computer Science
  • Trimester 3 CompSci 7327 Concepts in Artificial Intelligence and Machine Learning
  • Trimester 3 CompSci 7318 Deep Learning Fundamentals
2024
  • Trimester 3 CompSci 7318 Deep Learning Fundamentals
  • Semester 1 CompSci 3007/7059 Artificial Intelligence
  • Current Higher Degree by Research Supervision (University of Adelaide)

    Date Role Research Topic Program Degree Type Student Load Student Name
    2025 Co-Supervisor Efficient Video Foundation Model Doctor of Philosophy Doctorate Full Time Mr Feng Chen
    2025 Co-Supervisor Vision-and-Language in the Wild Doctor of Philosophy Doctorate Full Time Mr Zheng Yu
    2025 Co-Supervisor Foundation Models for Embodied Navigation Doctor of Philosophy Doctorate Full Time Mr Xiangyu Shi
  • Position: Lecturer
  • Email: xinyu.wang02@adelaide.edu.au
  • Campus: North Terrace
  • Building: Ingkarni Wardli, floor Level Five
  • Room: 5.48D
  • Org Unit: School of Computer and Mathematical Sciences

Connect With Me
External Profiles