Mr Yuanhong Chen
Research Fellow
School of Computer Science and Information Technology
College of Engineering and Information Technology
Dr. Yuanhong Chen is a postdoctoral researcher at the Australian Institute for Machine Learning, specialising in computer vision, multimodal learning, and generative models. He holds a PhD in Computer Science and works on bridging medical image analysis and audio-visual perception with deep learning. His current research explores integrating large language models for cross-modal understanding and reasoning.
My previous research focused on multimodal learning, audio-visual perception, and spatial audio generation. My current research explores the integration of large language models for cross-modal understanding and reasoning, with applications in audio-visual learning and generative modelling.
| Date | Institution name | Country | Title |
|---|---|---|---|
| 2021 - 2024 | University of Adelaide | Australia | PhD |
| Year | Citation |
|---|---|
| 2025 | Chen, Y., Shimada, K., Simon, C., Ikemiya, Y., Shibuya, T., & Mitsufuji, Y. (2025). CCStereo: Audio-Visual Contextual and Contrastive Learning for Binaural Audio Generation. |
| 2024 | Chen, Y., Wang, C., Liu, Y., Wang, H., & Carneiro, G. (2024). CPM: Class-conditional Prompting Machine for Audio-visual Segmentation. |
| 2023 | Chen, Y., Liu, Y., Wang, H., Liu, F., Wang, C., Frazer, H., & Carneiro, G. (2023). Unraveling Instance Associations: A Closer Look for Audio-Visual Segmentation. |
| 2023 | Chen, Y., Liu, Y., Wang, C., Elliott, M., Kwok, C. F., Pena-Solorzano, C., . . . Carneiro, G. (2023). BRAIxDet: Learning to Detect Malignant Breast Lesion with Incomplete Annotations. |
| 2022 | Chen, Y., Liu, F., Wang, H., Wang, C., Tian, Y., Liu, Y., & Carneiro, G. (2022). BoMD: Bag of Multi-label Descriptors for Noisy Chest X-ray Classification. |
| 2022 | Tian, Y., Pang, G., Liu, Y., Wang, C., Chen, Y., Liu, F., . . . Carneiro, G. (2022). Unsupervised Anomaly Detection in Medical Images with a Memory-augmented Multi-level Cross-attentional Masked Autoencoder.. |