Xinyu Wang

Lecturer

School of Computer and Mathematical Sciences

Faculty of Sciences, Engineering and Technology

Eligible to supervise Masters and PhD (as Co-Supervisor) - email supervisor to discuss availability.


Dr. Xinyu Wang is currently a Lecturer at the School of Computer and Mathematical Sciences. He received his PhD from the University of Adelaide under the supervision of Prof. Chunhua Shen, and subsequently worked as a Research Fellow at the Australian Institute for Machine Learning (AIML) for two years, collaborating with A/Prof. Qi Wu. Xinyu has been actively publishing research papers in prestigious conferences and journals within the field of Artificial Intelligence, such as CVPR, ACL, ICLR, ACMMM, TPAMI, IJCV, TMM, TCSVT, and PR. He and his colleagues received the Best Paper Award at ACL 2024, a top conference in Natural Language Processing, for their pioneering work on using AI models to decipher ancient languages.

Xinyu's current research interests span a broad range within deep learning, with a recent focus on Large Multimodal Models (LMMs). His work emphasizes improving the efficiency and accessibility of LMMs, alongside exploring their interdisciplinary applications in fields such as paleography and computational social sciences.

My current research primarily focuses on the intersection of computer vision and natural language processing, with a current emphasis on Large Multimodal Models (LMMs). I am particularly interested in multi-modal reasoning and exploring its applications in interdisciplinary fields.

Recently, my work has been expanding into using deep learning techniques within computational social sciences and game theory. This includes projects on deciphering ancient scripts and studying the behavior of LMMs in multi-agent cooperation scenarios.


I am happy to supervise 1-2 talented Honours/Master’s students each semester for their research projects (4015A/B, 7205A/B, etc.), provided you meet the following criteria:

  • High GPA
  • Strong programming skills, especially in Python
  • Completed AI-related courses, such as Deep Learning Foundations, Applied Natural Language Processing, Using Machine Learning Tools, Computer Vision, etc.
  • Familiarity with deep learning frameworks, such as PyTorch, TensorFlow, etc.

If you are interested, please feel free to contact me with your latest CV, academic transcripts, and a brief statement on the research topic you would like to work with me.

 

  • Journals

    Year Citation
    2025 Lin, C. -T., Ng, C. C., Tan, Z. Q., Nah, W. J., Wang, X., Kew, J. L., . . . Zach, C. (2025). Text in the dark: Extremely low-light text image enhancement. Signal Processing: Image Communication, 130, 117222.
    DOI
    2024 Wang, P., Zhang, K., Wang, X., Han, S., Liu, Y., Wan, J., . . . Liu, Y. (2024). An open dataset for oracle bone character recognition and decipherment.. Sci Data, 11(1), 976.
    DOI Scopus3 Europe PMC1
    2024 Ng, C. C., Lin, C. T., Tan, Z. Q., Wang, X., Kew, J. L., Chan, C. S., & Zach, C. (2024). When IC meets text: Towards a rich annotated integrated circuit text dataset. Pattern Recognition, 147, 110124.
    DOI Scopus1
    2023 Li, Z., Wang, X., Liu, Y., Jin, L., Huang, Y., & Ding, K. (2023). Improving Handwritten Mathematical Expression Recognition Via Similar Symbol Distinguishing. IEEE Transactions on Multimedia, 26, 90-102.
    DOI Scopus5
    2023 Liu, Y., Zhang, J., Peng, D., Huang, M., Wang, X., Tang, J., . . . Jin, L. (2023). SPTS v2: Single-Point Scene Text Spotting. IEEE Transactions on Pattern Analysis and Machine Intelligence, 45(12), 1-15.
    DOI Scopus26 WoS1 Europe PMC2
    2021 Liu, Y., He, T., Chen, H., Wang, X., Luo, C., Zhang, S., . . . Jin, L. (2021). Exploring the Capacity of an Orderless Box Discretization Network for Multi-orientation Scene Text Detection. International Journal of Computer Vision, 129(6), 1972-1992.
    DOI Scopus25 WoS6
    2020 Wang, X., Shen, C., Li, H., & Xu, S. (2020). Human Detection Aided by Deeply Learned Semantic Masks. IEEE Transactions on Circuits and Systems for Video Technology, 30(8), 2663-2673.
    DOI Scopus13 WoS6
    2019 Li, H., Wang, X., Shen, F., Li, Y., Porikli, F., & Wang, M. (2019). Real-Time Deep Tracking via Corrective Domain Adaptation. IEEE Transactions on Circuits and Systems for Video Technology, 29(9), 2600-2612.
    DOI Scopus17 WoS10
  • Conference Papers

    Year Citation
    2024 Guan, H., Yang, H., Wang, X., Han, S., Liu, Y., Jin, L., . . . Liu, Y. (2024). Deciphering Oracle Bone Language with Diffusion Models. In L. W. Ku, A. Martins, & V. Srikumar (Eds.), Proceedings of the Annual Meeting of the Association for Computational Linguistics Vol. 1 (pp. 15554-15567). THAILAND, Bangkok: ASSOC COMPUTATIONAL LINGUISTICS-ACL.
    DOI Scopus3
    2024 Wang, P., Zhang, K., Wang, X., Han, S., Liu, Y., Jin, L., . . . Liu, Y. (2024). Puzzle Pieces Picker: Deciphering Ancient Chinese Characters with Radical Reconstruction. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) Vol. 14804 LNCS (pp. 169-187). Athens Greece: Springer Nature Switzerland.
    DOI Scopus2
    2024 Wang, X., Zhuang, B., & Wu, Q. (2024). ModaVerse: Efficiently Transforming Modalities with LLMs. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 26596-26606). WA, Seattle: IEEE COMPUTER SOC.
    DOI Scopus5
    2022 Peng, D., Wang, X., Liu, Y., Zhang, J., Huang, M., Lai, S., . . . Jin, L. (2022). SPTS: Single-Point Text Spotting. In Proceedings of the 30th ACM International Conference on Multimedia (pp. 10 pages). Online: ACM.
    DOI Scopus48
    2021 Ng, C. C., Nazaruddin, A. K. B., Lee, Y. K., Wang, X., Liu, Y., Chan, C. S., . . . Fan, L. (2021). ICDAR 2021 Competition on Integrated Circuit Text Spotting and Aesthetic Assessment. In J. Llados, D. Lopresti, & S. Uchida (Eds.), Proceedings of the ... International Conference on Document Analysis and Recognition / sponsored by the IAPR TC-11 and TC-10, in cooperation with the IEEE Computer Society and IGS. International Conference on Document Analysis and Recog... Vol. 12824 LNCS (pp. 663-677). Switzerland: Springer.
    DOI Scopus4
    2020 Wang, X., Liu, Y., Shen, C., Ng, C. C., Luo, C., Jin, L., . . . Wang, L. (2020). On the general value of evidence, and bilingual scene-text visual question answering. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2020) (pp. 10123-10132). online: IEEE.
    DOI Scopus80
    2018 Wang, X., Li, H., Li, Y., Porikli, F., & Wang, M. (2018). Deep tracking with objectness. In IEEE International Conference on Image Processing, ICIP Vol. 2017-September (pp. 660-664). New York, NY, USA: IEEE.
    DOI Scopus6
    2017 Wang, X., Li, H., Li, Y., Shen, F., & Porikli, F. (2017). Robust and real-time deep tracking via multi-scale domain adaptation. In Proceedings - IEEE International Conference on Multimedia and Expo (ICME) (pp. 1338-1343). Hong Kong, China: IEEE.
    DOI Scopus15
  • UoA Start-up Grant - Sole Investigator (2025-2027)
  • UoA Early Career Seed Funding - Sole Investigator (2025)
2025
  • Semester 1 & 2 CompSci 2008/3020 Topics/Advanced Topics in Computer Science
  • Trimester 3 CompSci 7327 Concepts in Artificial Intelligence and Machine Learning
  • Trimester 3 CompSci 7318 Deep Learning Fundamentals
2024
  • Trimester 3 CompSci 7318 Deep Learning Fundamentals
  • Semester 1 CompSci 3007/7059 Artificial Intelligence
  • Position: Lecturer
  • Email: xinyu.wang02@adelaide.edu.au
  • Campus: North Terrace
  • Building: Ingkarni Wardli, floor Level Five
  • Room: 5.48D
  • Org Unit: School of Computer and Mathematical Sciences

Connect With Me
External Profiles