Xin Yu

Australian Institute for Machine Learning - Projects

Faculty of Sciences, Engineering and Technology

I am Xin Yu, an Associate Professor at the University of Adelaide (Dec 2025 – present). My research focuses on computer vision and machine learning, with a particular interest in enabling technologies that improve accessibility and understanding through visual intelligence. I received my PhD degree in Computer Science from the Australian National University, and a PhD degree in Communication and Information Engineering from Tsinghua University. Currently, I am a Visiting Faculty Researcher at Google (2024–-Present), and leading a Visual Intelligence Group at the Australian Institute for Machine Learning (AIML).

I am the recipient of several prestigious awards, including the Australian Research Council (ARC) Discovery Early Career Researcher Award (DECRA, 2023–2025), the Google Research Scholar Program Award (2021), and the Google Inclusion Research Award (2023). I was also honoured with the Queensland Young Tall Poppy Science Award from the Australian Institute of Policy and Science (AIPS). I also received the CORE Outstanding Research Contribution Award 2026 from the Computing Research and Education Association of Australasia (CORE).

Career
Publications
Contact

Language Competencies

Language Competency

English Can read, write, speak, understand spoken and peer review
Education

Date Institution name Country Title

2015 - 2019 Australian National University Australia PhD

2009 - 2015 Tsinghua University China PhD
Research Interests

Computer Vision Knowledge Representation and Machine Learning Machine learning Data engineering and data science

Language	Competency
English	Can read, write, speak, understand spoken and peer review

Date	Institution name	Country	Title
2015 - 2019	Australian National University	Australia	PhD
2009 - 2015	Tsinghua University	China	PhD

Journals

Year	Citation
2025	Du, X., Sun, H., Lu, M., Zhu, T., & Yu, X. (2025). DreamCar: Leveraging Car-Specific Prior for In-the-Wild 3D Car Reconstruction. IEEE Robotics and Automation Letters, 10(2), 1840-1847. DOI Scopus1
2025	Ma, Y., Wang, S., Ding, Y., Ma, B., Lv, T., Fan, C., . . . Yu, X. (2025). TalkCLIP: Talking Head Generation with Text-Guided Expressive Speaking Styles. IEEE Transactions on Multimedia, 27, 6335-6346. DOI
2025	Wu, H., Zhao, M., Hu, Z., Fan, C., Li, L., Chen, W., . . . Yu, X. (2025). ICE: Interactive 3D Game Character Facial Editing via Dialogue. IEEE Transactions on Multimedia, 27, 3210-3223. DOI
2025	Jiang, W., Zhao, D., Wang, C., Yu, X., Arun, P. V., Asano, Y., . . . Zhou, H. (2025). Hyperspectral video object tracking with cross-modal spectral complementary and memory prompt network. Knowledge Based Systems, 330, 114595. DOI
2025	Li, Z., Liu, S., Yu, X., Bhavya, K., Cao, J., Diffenderfer, J. D., . . . Pascucci, V. (2025). "Understanding Robustness Lottery": A Geometric Visual Comparative Analysis of Neural Network Pruning Approaches.. IEEE transactions on visualization and computer graphics, 31(9), 6337-6352. DOI
2024	Liu, C., Li, P., Zhang, H., Li, L., Huang, Z., Wang, D., & Yu, X. (2024). BAVS: Bootstrapping Audio-Visual Segmentation by Integrating Foundation Knowledge. IEEE Transactions on Multimedia, 26, 10015-10028. DOI Scopus14
2024	Zhang, W., Li, L., Ding, Y., Chen, W., Deng, Z., & Yu, X. (2024). Detecting Facial Action Units From Global-Local Fine-Grained Expressions. IEEE Transactions on Circuits and Systems for Video Technology, 34(2), 983-994. DOI Scopus10
2024	Chen, D. Y., Di, X., Yu, X., & Biswal, B. B. (2024). The significance and limited influence of cerebrovascular reactivity on age and sex effects in task-and resting-state brain activity. Cerebral Cortex, 34(2). DOI Scopus3
2024	Hu, Z., Tang, J., Li, L., Hou, J., Xin, H., Yu, X., & Bu, J. (2024). MarkerNet: A divide-and-conquer solution to motion capture solving from raw markers. Computer Animation and Virtual Worlds, 35(1). DOI Scopus3
2024	Fu, H., Yu, X., Li, L., & Zhang, L. (2024). CBARF: Cascaded Bundle-Adjusting Neural Radiance Fields From Imperfect Camera Poses. IEEE Transactions on Multimedia, 26, 9304-9315. DOI Scopus3
2024	Rao, Q., Yu, X., Li, G., & Zhu, L. (2024). CMGNet: Collaborative multi-modal graph network for video captioning. Computer Vision and Image Understanding, 238, 103864. DOI Scopus5
2024	Qi, X., Liu, C., Li, L., Hou, J., Xin, H., & Yu, X. (2024). EmotionGesture: Audio-Driven Diverse Emotional Co-Speech 3D Gesture Generation. IEEE Transactions on Multimedia, 26, 10420-10430. DOI Scopus10
2024	Zhao, Y., Liu, B., Zhu, T., Ding, M., Yu, X., & Zhou, W. (2024). Proactive image manipulation detection via deep semi-fragile watermark. Neurocomputing, 585, 127593. DOI Scopus21
2024	Song, X., Liu, C., Zheng, Y., Feng, Z., Li, L., Zhou, K., & Yu, X. (2024). HairStyle Editing via Parametric Controllable Strokes. IEEE Transactions on Visualization and Computer Graphics, 30(7), 3857-3870. DOI Scopus4
2024	Xu, Q., Chen, H., Du, H., Zhang, H., Łukasik, S., Zhu, T., & Yu, X. (2024). M3A: A multimodal misinformation dataset for media authenticity analysis. Computer Vision and Image Understanding, 249, 104205. DOI Scopus5
2024	Choi, S., Hike, D., Pohmann, R., Avdievich, N., Gomez-Cid, L., Man, W., . . . Yu, X. (2024). Alpha-180 spin-echo-based line-scanning method for high-resolution laminar-specific fMRI in animals.. Imaging neuroscience (Cambridge, Mass.), 2, imag-2-00120. DOI Europe PMC1
2024	Jiang, Y., Pais-Roldán, P., Pohmann, R., & Yu, X. (2024). High Spatiotemporal Resolution Radial Encoding Single-Vessel fMRI. Advanced Science, 11(26), e2309218. DOI Scopus1 Europe PMC4
2024	Sheng, H., Shen, X., Du, H., Zhang, H., Huang, Z., & Yu, X. (2024). AI empowered Auslan learning for parents of deaf children and children of deaf adults. AI and Ethics, 4(4), 877-887. DOI
2024	Yu, X., Yu, X., Yang, Q., Tang, Y., Gao, R., Bao, S., . . . Landman, B. A. (2024). Deep conditional generative model for longitudinal single-slice abdominal computed tomography harmonization.. Journal of medical imaging (Bellingham, Wash.), 11(2), 024008. DOI Europe PMC1
2024	Choi, S. H., Im, G. H., Choi, S., Yu, X., Bandettini, P. A., Menon, R. S., & Kim, S. G. (2024). No replication of direct neuronal activity–related (DIANA) fMRI in anesthetized mice. Science Advances, 10(13), eadl0999. DOI Scopus7 Europe PMC12
2024	Plagwitz, L., Choi, S., Yu, X., Segelcke, D., Lambers, H., Pogatzki-Zahn, E., . . . Pradier, B. (2024). Data-driven time series analysis of sensory cortical processing using high-resolution fMRI across different studies. Biomedical Signal Processing and Control, 93, 106136. DOI Scopus1
2024	Wang, S., Ma, Y., Ding, Y., Hu, Z., Fan, C., Lv, T., . . . Yu, X. (2024). StyleTalk++: A Unified Framework for Controlling the Speaking Styles of Talking Heads. IEEE Transactions on Pattern Analysis and Machine Intelligence, 46(6), 4331-4347. DOI Scopus8 Europe PMC1
2024	Chen, Y., Fernandez, Z., Scheel, N., Gifani, M., Zhu, D. C., Counts, S. E., . . . Qian, C. (2024). Novel inductively coupled ear-bars (ICEs) to enhance restored fMRI signal from susceptibility compensation in rats. Cerebral Cortex, 34(1), bhad479. DOI Scopus1 Europe PMC1
2024	Guan, S., Yu, X., Huang, W., Fang, G., & Lu, H. (2024). DMMG: Dual Min-Max Games for Self-Supervised Skeleton-Based Action Recognition. IEEE Transactions on Image Processing, 33, 395-407. DOI Scopus11 Europe PMC2
2024	Du, X., Yu, X., Liu, J., Dai, B., & Xu, F. (2024). Ethics-aware face recognition aided by synthetic face images. Neurocomputing, 600, 128129. DOI Scopus6
2024	Hike, D., Choi, S., Zhang, B., Jiang, Y., Liu, X., Pohmann, R., . . . Yu, X. (2024). Implementation of 2D Line scanning Method.. Aperture neuro, 4. DOI
2024	Zhou, S., Zhu, T., Ye, D., Yu, X., & Zhou, W. (2024). Boosting Model Inversion Attacks With Adversarial Examples. IEEE Transactions on Dependable and Secure Computing, 21(3), 1451-1468. DOI Scopus15
2024	Zhao, M., Qi, X., Hu, Z., Li, L., Zhang, Y., Huang, Z., & Yu, X. (2024). Calligraphy Font Generation via Explicitly Modeling Location-Aware Glyph Component Deformations. IEEE Transactions on Multimedia, 26, 5939-5950. DOI Scopus4
2023	Mao, Y., Wan, Z., Dai, Y., & Yu, X. (2023). Deep Idempotent Network for Efficient Single Image Blind Deblurring. IEEE Transactions on Circuits and Systems for Video Technology, 33(1), 172-185. DOI Scopus32
2023	Bao, S., Cui, C., Li, J., Tang, Y., Lee, H. H., Deng, R., . . . Huo, Y. (2023). Topological-Preserving Membrane Skeleton Segmentation in Multiplex Immunofluorescence Imaging. Progress in Biomedical Optics and Imaging Proceedings of SPIE, 12471, 124710B. DOI Scopus1
2023	Zeng, H., Zhang, W., Fan, C., Lv, T., Wang, S., Zhang, Z., . . . Yu, X. (2023). FlowFace: Semantic Flow-Guided Shape-Aware Face Swapping. Proceedings of the 37th Aaai Conference on Artificial Intelligence Aaai 2023, 37(3), 3367-3375. DOI Scopus3
2023	Sheng, H., Yu, X., Wang, F., Khan, M. W., Weng, H., Shariflou, S., & Golzan, S. M. (2023). Autonomous Stabilization of Retinal Videos for Streamlining Assessment of Spontaneous Venous Pulsations. Proceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society EMBS, 2023, 1-4. DOI Scopus3 Europe PMC1
2023	Xu, Y., Zhou, C., Yu, X., & Yang, Y. (2023). Cyclic Self-Training With Proposal Weight Modulation for Cross-Supervised Object Detection. IEEE Transactions on Image Processing, 32, 1992-2002. DOI Scopus9 Europe PMC2
2023	Yu, X., Tang, Y., Yang, Q., Lee, H. H., Gao, R., Bao, S., . . . Landman, B. A. (2023). Longitudinal Variability Analysis on Low-dose Abdominal CT with Deep Learning-based Segmentation. Progress in Biomedical Optics and Imaging Proceedings of SPIE, 12464, 1246423. DOI Scopus3 Europe PMC2
2023	Yang, Q., Yu, X., Lee, H. H., Cai, L. Y., Xu, K., Bao, S., . . . Landman, B. A. (2023). Single slice thigh CT muscle group segmentation with domain adaptation and self-training. Journal of Medical Imaging, 10(4), 044001. DOI Scopus2 Europe PMC1
2023	Shi, Y., Yu, X., Liu, L., Campbell, D., Koniusz, P., & Li, H. (2023). Accurate 3-DoF Camera Geo-Localization via Ground-to-Satellite Image Matching. IEEE Transactions on Pattern Analysis and Machine Intelligence, 45(3), 2682-2697. DOI Scopus38 Europe PMC2
2023	Yu, X., Yang, Q., Zhou, Y., Cai, L. Y., Gao, R., Lee, H. H., . . . Tang, Y. (2023). UNesT: Local spatial representation learning with hierarchical transformer for efficient medical segmentation. Medical Image Analysis, 90, 102939. DOI Scopus64 Europe PMC29
2023	Ma, Y., Wang, S., Hu, Z., Fan, C., Lv, T., Ding, Y., . . . Yu, X. (2023). StyleTalk: One-Shot Talking Head Generation with Controllable Speaking Styles. Proceedings of the 37th Aaai Conference on Artificial Intelligence Aaai 2023, 37(2), 1896-1904. DOI Scopus45
2023	Ramadass, K., Yu, X., Cai, L. Y., Tang, Y., Bao, S., Kerley, C., . . . Landman, B. A. (2023). Deep whole brain segmentation of 7T structural MRI. Progress in Biomedical Optics and Imaging Proceedings of SPIE, 12464, 124642O. DOI
2022	Fan, H., Yu, X., Yang, Y., & Kankanhalli, M. (2022). Deep Hierarchical Representation of Point Cloud Videos via Spatio-Temporal Decomposition. IEEE Transactions on Pattern Analysis and Machine Intelligence, 44(12), 9918-9930. DOI Scopus40 Europe PMC2
2022	Shi, Y., Campbell, D., Yu, X., & Li, H. (2022). Geometry-Guided Street-View Panorama Synthesis From Satellite Imagery. IEEE Transactions on Pattern Analysis and Machine Intelligence, 44(12), 10009-10022. DOI Scopus34 Europe PMC1
2022	Chen, X., Jiang, Y., Choi, S., Pohmann, R., Scheffler, K., Kleinfeld, D., & Yu, X. (2022). Erratum: Assessment of single-vessel cerebral blood velocity by phase contrast fMRI (PLoS Biol (2021) 19:9 (e3000923) DOI: 10.1371/journal.pbio.3000923). Plos Biology, 20(12), e3001951. DOI
2022	Ma, F., Wu, Y., Yu, X., & Yang, Y. (2022). Learning With Noisy Labels via Self-Reweighting From Class Centroids. IEEE Transactions on Neural Networks and Learning Systems, 33(11), 6275-6285. DOI Scopus35 Europe PMC3
2022	Choi, S., Zeng, H., Chen, Y., Sobczak, F., Qian, C., & Yu, X. (2022). Laminar-specific functional connectivity mapping with multi-slice line-scanning fMRI. Cerebral Cortex, 32(20), 4492-4501. DOI Scopus11 Europe PMC14
2022	Zhou, X. A., Jiang, Y., Napadow, V., & Yu, X. (2022). Challenges and Perspectives of Mapping Locus Coeruleus Activity in the Rodent with High-Resolution fMRI. Brain Sciences, 12(8), 1085. DOI
2022	Zeng, H., Jiang, Y., Beer-Hammer, S., & Yu, X. (2022). Awake Mouse fMRI and Pupillary Recordings in the Ultra-High Magnetic Field. Frontiers in Neuroscience, 16, 886709. DOI Scopus11 Europe PMC18
2022	Pan, L., Hartley, R., Scheerlinck, C., Liu, M., Yu, X., & Dai, Y. (2022). High Frame Rate Video Reconstruction Based on an Event Camera. IEEE Transactions on Pattern Analysis and Machine Intelligence, 44(5), 2519-2533. DOI Scopus68 Europe PMC9
2022	Xu, Y., Yu, X., Zhang, J., Zhu, L., & Wang, D. (2022). Weakly Supervised RGB-D Salient Object Detection with Prediction Consistency Training and Active Scribble Boosting. IEEE Transactions on Image Processing, 31, 2148-2161. DOI Scopus48 Europe PMC4
2022	Zheng, Y., Yu, X., Liu, M., & Zhang, S. (2022). Single-Image Deraining via Recurrent Residual Multiscale Networks. IEEE Transactions on Neural Networks and Learning Systems, 33(3), 1310-1323. DOI Scopus26 Europe PMC6
2022	Zhang, Y., Yu, X., Lu, X., & Liu, P. (2022). Pro-UIGAN: Progressive Face Hallucination From Occluded Thumbnails. IEEE Transactions on Image Processing, 31, 3236-3250. DOI Scopus17 Europe PMC2
2022	Zhang, Y., Tsang, I. W., Luo, Y., Hu, C., Lu, X., & Yu, X. (2022). Recursive Copy and Paste GAN: Face Hallucination From Shaded Thumbnails. IEEE Transactions on Pattern Analysis and Machine Intelligence, 44(8), 4321-4338. DOI Scopus20 Europe PMC3
2022	Klugah-Brown, B., Yu, Y., Hu, P., Agoalikum, E., Liu, C., Liu, X., . . . Biswal, B. (2022). Effect of surgical mask on fMRI signals during task and rest. Communications Biology, 5(1), 1004. DOI Scopus7 Europe PMC6
2022	Wang, S., Li, L., Ding, Y., & Yu, X. (2022). One-Shot Talking Face Generation from Single-Speaker Audio-Visual Correlation Learning. Proceedings of the 36th Aaai Conference on Artificial Intelligence Aaai 2022, 36(3), 2531-2539. DOI Scopus92
2022	Chen, Y., Wang, Q., Choi, S., Zeng, H., Takahashi, K., Qian, C., & Yu, X. (2022). Focal fMRI signal enhancement with implantable inductively coupled detectors. Neuroimage, 247, 118793. DOI Scopus5 Europe PMC11
2022	Fan, H., Zhuo, T., Yu, X., Yang, Y., & Kankanhalli, M. (2022). Understanding Atomic Hand-Object Interaction With Human Intention. IEEE Transactions on Circuits and Systems for Video Technology, 32(1), 275-285. DOI Scopus24
2022	Han, C., Yu, X., Gao, C., Sang, N., & Yang, Y. (2022). Single image based 3D human pose estimation via uncertainty learning. Pattern Recognition, 132, 108934. DOI Scopus29
2021	Ding, Y., Yu, X., & Yang, Y. (2021). Modeling the Probabilistic Distribution of Unlabeled Data for One-shot Medical Image Segmentation. 35th Aaai Conference on Artificial Intelligence Aaai 2021, 2A(2), 1246-1254. DOI Scopus31
2021	Li, L., Wang, S., Zhang, Z., Ding, Y., Zheng, Y., Yu, X., & Fan, C. (2021). Write-a-speaker: Text-based Emotional and Rhythmic Talking-head Generation. 35th Aaai Conference on Artificial Intelligence Aaai 2021, 3A(3), 1911-1920. DOI Scopus55
2021	Mächler, P., Broggini, T., Mateo, C., Thunemann, M., Fomin-Thunemann, N., Doran, P. R., . . . Devor, A. (2021). A suite of neurophotonic tools to underpin the contribution of internal brain states in fMRI. Current Opinion in Biomedical Engineering, 18, 100273. DOI Scopus5 Europe PMC7
2021	Raimondo, L., Knapen, T., Oliveira, Ĺ. A. F., Yu, X., Dumoulin, S. O., van der Zwaag, W., & Siero, J. C. W. (2021). A line through the brain: implementation of human line-scanning at 7T for ultra-high spatiotemporal resolution fMRI. Journal of Cerebral Blood Flow and Metabolism, 41(11), 2831-2843. DOI Scopus27 Europe PMC26
2021	Chen, X., Jiang, Y., Choi, S., Pohmann, R., Scheffler, K., Kleinfeld, D., & Yu, X. (2021). Assessment of single-vessel cerebral blood velocity by phase contrast fMRI. Plos Biology, 19(9), e3000923. DOI Scopus18 Europe PMC18
2021	Sobczak, F., Pais-Roldán, P., Takahashi, K., & Yud, X. (2021). Decoding the brain state-dependent relationship between pupil dynamics and resting state fmri signal fluctuation. Elife, 10, e68980. DOI Scopus13 Europe PMC13
2021	Quan, R., Wu, Y., Yu, X., & Yang, Y. (2021). Progressive transfer learning for face anti-spoofing. IEEE Transactions on Image Processing, 30, 3946-3955. DOI Scopus64 Europe PMC7
2021	Sobczak, F., He, Y., Sejnowski, T. J., & Yu, X. (2021). Predicting the fMRI Signal Fluctuation with Recurrent Neural Networks Trained on Vascular Network Dynamics. Cerebral Cortex, 31(2), 826-844. DOI Scopus9 Europe PMC10
2021	Zhang, Y., Tsang, I. W., Li, J., Liu, P., Lu, X., & Yu, X. (2021). Face Hallucination with Finishing Touches. IEEE Transactions on Image Processing, 30, 1728-1743. DOI Scopus30 Europe PMC4
2021	Pais-Roldán, P., Mateo, C., Pan, W. J., Acland, B., Kleinfeld, D., Snyder, L. H., . . . Keilholz, S. (2021). Contribution of animal models toward understanding resting state functional connectivity. Neuroimage, 245, 118630. DOI Scopus26 Europe PMC30
2021	Xu, Y., Zhou, C., Yu, X., Xiao, B., & Yang, Y. (2021). Pyramidal Multiple Instance Detection Network with Mask Guided Self-Correction for Weakly Supervised Object Detection. IEEE Transactions on Image Processing, 30, 3029-3040. DOI Scopus45 Europe PMC7
2020	Qian, W., Yu, X., & Qian, C. (2020). Wireless Powered Encoding and Broadcasting of Frequency Modulated Detection Signals.. IEEE access : practical innovations, open solutions, 8, 200450-200460. DOI Europe PMC1
2020	He, Y., Wang, M., & Yu, X. (2020). High spatiotemporal vessel-specific hemodynamic mapping with multi-echo single-vessel fMRI. Journal of Cerebral Blood Flow and Metabolism, 40(10), 2098-2114. DOI Scopus8 Europe PMC11
2020	Yu, X., Fernando, B., Hartley, R., & Porikli, F. (2020). Semantic face hallucination: Super-resolving very low-resolution face images with supplementary attributes. IEEE Transactions on Pattern Analysis and Machine Intelligence, 42(11), 2926-2943. DOI Scopus43 Europe PMC4
2020	Drew, P. J., Mateo, C., Turner, K. L., Yu, X., & Kleinfeld, D. (2020). Ultra-slow Oscillations in fMRI and Resting-State Connectivity: Neuronal and Vascular Contributions and Technical Confounds. Neuron, 107(5), 782-804. DOI Scopus109 Europe PMC116
2020	Chen, Y., Sobczak, F., Pais-Roldan, P., Schwarz, C., Koretsky, A. P., & Yu, X. (2020). Mapping the Brain-Wide Network Effects by Optogenetic Activation of the Corpus Callosum. Cerebral Cortex, 30(11), 5885-5898. DOI Scopus21 Europe PMC21
2020	Choi, S., Takahashi, K., Jiang, Y., Köhler, S., Zeng, H., Wang, Q., . . . Yu, X. (2020). Real-time fmri brain mapping in animals. Journal of Visualized Experiments, 2020(163), 1-13. DOI Scopus2 Europe PMC2
2020	Yu, X., Shiri, F., Ghanem, B., & Porikli, F. (2020). Can We See More? Joint Frontalization and Hallucination of Unaligned Tiny Faces. IEEE Transactions on Pattern Analysis and Machine Intelligence, 42(9), 2148-2164. DOI Scopus33 Europe PMC6
2020	Qian, W., Yu, X., & Qian, C. (2020). Wireless Reconfigurable RF Detector Array for Focal and Multiregional Signal Enhancement.. IEEE access : practical innovations, open solutions, 8, 136594-136604. DOI Europe PMC6
2020	Pais-Roldán, P., Takahashi, K., Sobczak, F., Chen, Y., Zhao, X., Zeng, H., . . . Yu, X. (2020). Indexing brain state-dependent pupil dynamics with simultaneous fMRI and optical fiber calcium recording. Proceedings of the National Academy of Sciences of the United States of America, 117(12), 6875-6882. DOI Scopus50 Europe PMC60
2020	Ovsepian, S. V., Jiang, Y., Sardella, T. C. P., Malekzadeh-Najafabadi, J., Burton, N. C., Yu, X., & Ntziachristos, V. (2020). Visualizing cortical response to optogenetic stimulation and sensory inputs using multispectral handheld optoacoustic imaging. Photoacoustics, 17, 100153. DOI Scopus7 Europe PMC9
2020	Handwerker, J., Pérez-Rodas, M., Beyerlein, M., Vincent, F., Beck, A., Freytag, N., . . . Scheffler, K. (2020). A CMOS NMR needle for probing brain physiology with high spatial and temporal resolution. Nature Methods, 17(1), 64-67. DOI Scopus36 Europe PMC15
2020	Yu, X., Porikli, F., Fernando, B., & Hartley, R. (2020). Hallucinating Unaligned Face Images by Multiscale Transformative Discriminative Networks. International Journal of Computer Vision, 128(2), 500-526. DOI Scopus35
2020	Wang, Z., Yu, X., Lu, M., Wang, Q., Qian, C., & Xu, F. (2020). Single image portrait relighting via explicit multiple reflectance channel modeling. ACM Transactions on Graphics, 39(6), 1-13. DOI Scopus82
2019	Yan, H., Yu, X., Zhang, Y., Zhang, S., Zhao, X., & Zhang, L. (2019). Single Image Depth Estimation with Normal Guided Scale Invariant Deep Convolutional Fields. IEEE Transactions on Circuits and Systems for Video Technology, 29(1), 80-92. DOI Scopus28
2019	Shiri, F., Yu, X., Porikli, F., Hartley, R., & Koniusz, P. (2019). Identity-Preserving Face Recovery from Stylized Portraits. International Journal of Computer Vision, 127(6-7), 863-883. DOI Scopus20
2019	Chen, X., Sobczak, F., Chen, Y., Jiang, Y., Qian, C., Lu, Z., . . . Yu, X. (2019). Mapping optogenetically-driven single-vessel fMRI with concurrent neuronal calcium recordings in the rat hippocampus. Nature Communications, 10(1), 5239. DOI Scopus37 Europe PMC45
2019	Chen, Y., Pais-Roldan, P., Chen, X., Frosz, M. H., & Yu, X. (2019). MRI-guided robotic arm drives optogenetic fMRI with concurrent Ca2+ recording. Nature Communications, 10(1), 2536. DOI Scopus26 Europe PMC24
2019	Pais-Roldán, P., Edlow, B. L., Jiang, Y., Stelzer, J., Zou, M., & Yu, X. (2019). Multimodal assessment of recovery from coma in a rat model of diffuse brainstem tegmentum injury. Neuroimage, 189, 615-630. DOI Scopus26 Europe PMC28
2018	Pais-Roldán, P., Biswal, B., Scheffler, K., & Yu, X. (2018). Identifying respiration-related aliasing artifacts in the rodent resting-state fMRI. Frontiers in Neuroscience, 12(NOV), 788. DOI Scopus22 Europe PMC26
2018	Yu, X., & Porikli, F. (2018). Imagining the Unimaginable Faces by Deconvolutional Networks. IEEE Transactions on Image Processing, 27(6), 2747-2761. DOI Scopus31 Europe PMC7
2018	Wang, M., He, Y., Sejnowski, T. J., & Yu, X. (2018). Brain-state dependent astrocytic Ca2+ signals are coupled to both positive and negative BOLD-fMRI signals. Proceedings of the National Academy of Sciences of the United States of America, 115(7), E1647-E1656. DOI Scopus83 Europe PMC84
2018	He, Y., Wang, M., Chen, X., Pohmann, R., Polimeni, J. R., Scheffler, K., . . . Yu, X. (2018). Ultra-Slow Single-Vessel BOLD and CBV-Based fMRI Spatiotemporal Dynamics and Their Correlation with Neuronal Intracellular Calcium Signals. Neuron, 97(4), 925-939.e5. DOI Scopus96 Europe PMC102
2018	Li, L., Zhang, S., Yu, X., & Zhang, L. (2018). PMSC: PatchMatch-Based Superpixel Cut for Accurate Stereo Matching. IEEE Transactions on Circuits and Systems for Video Technology, 28(3), 679-692. DOI Scopus81
2017	Li, L., Yu, X., Zhang, S., Zhao, X., & Zhang, L. (2017). 3D Cost aggregation with multiple minimum spanning trees for stereo matching. Applied Optics, 56(12), 3411-3420. DOI Scopus80 Europe PMC8
2017	Chung, S., Jeong, J. H., Ko, S., Yu, X., Kim, Y. H., Isaac, J. T. R., & Koretsky, A. P. (2017). Peripheral Sensory Deprivation Restores Critical-Period-like Plasticity to Adult Somatosensory Thalamocortical Inputs. Cell Reports, 19(13), 2707-2717. DOI Scopus26 Europe PMC24
2016	Pais-Roldán, P., Singh, A. P., Schulz, H., & Yu, X. (2016). High magnetic field induced otolith fusion in the zebrafish larvae. Scientific Reports, 6. DOI Scopus15 Europe PMC6
2016	Yu, X., He, Y., Wang, M., Merkle, H., Dodd, S. J., Silva, A. C., & Koretsky, A. P. (2016). Sensory and optogenetically driven single-vessel fMRI. Nature Methods, 13(4), 337-340. DOI Scopus84 Europe PMC93
2015	Sui, Y., Zhao, X., Zhang, S., Yu, X., Zhao, S., & Zhang, L. (2015). Self-expressive tracking. Pattern Recognition, 48(9), 2872-2884. DOI Scopus12
2015	Zhang, S., Sui, Y., Zhao, S., Yu, X., & Zhang, L. (2015). Multi-local-task learning with global regularization for object tracking. Pattern Recognition, 48(12), 3881-3894. DOI Scopus23
2015	Zhang, S., Sui, Y., Yu, X., Zhao, S., & Zhang, L. (2015). Hybrid support vector machines for robust object tracking. Pattern Recognition, 48(8), 2474-2488. DOI Scopus32
2015	Zhang, S., Yu, X., Sui, Y., Zhao, S., & Zhang, L. (2015). Object tracking with multi-view support vector machines. IEEE Transactions on Multimedia, 17(3), 265-278. DOI Scopus113
2015	Yu, X., Zhang, S., Zhao, X., & Zhang, L. (2015). Removing blur kernel noise via a hybrid lp norm. Journal of Electronic Imaging, 24(1). DOI Scopus4
2014	Yu, X., Xu, F., Zhang, S., & Zhang, L. (2014). Efficient patch-wise non-uniform deblurring for a single image. IEEE Transactions on Multimedia, 16(6), 1510-1524. DOI Scopus50
2014	Qian, C., Yu, X., Pothayee, N., Dodd, S., Bouraoud, N., Star, R., . . . Koretsky, A. (2014). Live nephron imaging by MRI. American Journal of Physiology Renal Physiology, 307(10), F1162-F1168. DOI Scopus17 Europe PMC17
2014	Yu, X., Zhao, X., Sui, Y., & Zhang, L. (2014). Handling noise in single image defocus map estimation by using directional filters. Optics Letters, 39(21), 6281-6284. DOI Scopus4 Europe PMC2
2014	Yu, X., & Koretsky, A. P. (2014). Interhemispheric plasticity protects the deafferented somatosensory cortex from functional takeover after nerve injury. Brain Connectivity, 4(9), 709-717. DOI Scopus16 Europe PMC14
2014	Yu, X., Qian, C., Chen, D. Y., Dodd, S. J., & Koretsky, A. P. (2014). Deciphering laminar-specific neural inputs with line-scanning fMRI. Nature Methods, 11(1), 55-58. DOI Scopus137 Europe PMC139
2013	Qian, C., Yu, X., Chen, D. Y., Dodd, S., Bouraoud, N., Pothayee, N., . . . Koretsky, A. (2013). Wireless amplified nuclear MR detector (WAND) for high-spatial-resolution MR imaging of internal organs: Preclinical demonstration in a rodent model. Radiology, 268(1), 228-236. DOI Scopus37 Europe PMC35
2012	Yu, X., Chung, S., Chen, D. Y., Wang, S., Dodd, S. J., Walters, J. R., . . . Koretsky, A. P. (2012). Thalamocortical Inputs Show Post-Critical-Period Plasticity. Neuron, 74(4), 731-742. DOI Scopus64 Europe PMC66
2012	Yu, X., Glen, D., Wang, S., Dodd, S., Hirano, Y., Saad, Z., . . . Koretsky, A. P. (2012). Direct imaging of macrovascular and microvascular contributions to BOLD fMRI in layers IV-V of the rat whisker-barrel cortex. Neuroimage, 59(2), 1451-1460. DOI Scopus87 Europe PMC90
2011	Zhao, X., Yu, X., Sun, L., Hu, K., Wang, G., & Zhang, L. (2011). Non-rigid object tracking as salient region segmentation and association. IEICE Transactions on Information and Systems, E94-D(4), 934-937. DOI Scopus2
2010	Yu, X., Wang, S., Chen, D. Y., Dodd, S., Goloshevsky, A., & Koretsky, A. P. (2010). 3D mapping of somatotopic reorganization with small animal functional MRI. Neuroimage, 49(2), 1667-1676. DOI Scopus35 Europe PMC29
-	Tang, T., Du, H., Yu, X., & Yang, Y. (2022). Monocular Camera-Based Point-Goal Navigation by Learning Depth Channel and Cross-Modality Pyramid Fusion. Proceedings of the AAAI Conference on Artificial Intelligence, 36(5), 5422-5430. DOI
-	Yu, X., & Porikli, F. (2017). Face Hallucination with Tiny Unaligned Images by Transformative Discriminative Neural Networks. Proceedings of the AAAI Conference on Artificial Intelligence, 31(1). DOI

Book Chapters

Year	Citation
2025	Zhang, H., Xu, J., Tang, T., Sun, H., Yu, X., Huang, Z., & Yu, K. (2025). OpenSight: A Simple Open-Vocabulary Framework for LiDAR-Based Object Detection. In Lecture Notes in Computer Science (Vol. 15142 LNCS, pp. 1-19). Springer Nature Switzerland. DOI
2024	Yu, Q., Du, H., & Yu, X. (2024). A New Perspective of Weakly Supervised 3D Instance Segmentation via Bounding Boxes. In Lecture Notes in Computer Science (Vol. 14471 LNAI, pp. 103-114). Springer Nature Singapore. DOI Scopus1
2024	Sheng, H., Yu, X., Li, X., & Golzan, M. (2024). Context-Based Masking for Spontaneous Venous Pulsations Detection. In Lecture Notes in Computer Science (Vol. 14471 LNAI, pp. 520-532). Springer Nature Singapore. DOI Scopus1
2024	Xu, Q., Du, H., Chen, H., Liu, B., & Yu, X. (2024). MMOOC: A Multimodal Misinformation Dataset for Out-of-Context News Analysis. In Lecture Notes in Computer Science (Vol. 14897 LNCS, pp. 444-459). Springer Nature Singapore. DOI Scopus4
2024	Du, H., Huang, Z., Chapman, S., & Yu, X. (2024). Toward a Unified Framework for RGB and RGB-D Visual Navigation. In Lecture Notes in Computer Science (Vol. 14472 LNAI, pp. 363-375). Springer Nature Singapore. DOI Scopus2
2024	Yuan, B., Wang, Z., & Yu, X. (2024). Towards Reliable and Efficient Vegetation Segmentation for Australian Wheat Data Analysis. In Lecture Notes in Computer Science (pp. 119-135). Springer Nature Switzerland. DOI
2023	Cai, J., Nguyen, K. N., Shrestha, N., Good, A., Tu, R., Yu, X., . . . Serra, T. (2023). Getting Away with More Network Pruning: From Sparsity to Geometry and Linear Regions. In Lecture Notes in Computer Science (Vol. 13884 LNCS, pp. 200-218). Springer Nature Switzerland. DOI Scopus3
2023	Shi, Y., Yu, X., Wang, S., & Li, H. (2023). CVLNet: Cross-view Semantic Correspondence Learning for Video-Based Camera Localization. In Lecture Notes in Computer Science (Vol. 13841 LNCS, pp. 123-141). Springer Nature Switzerland. DOI Scopus12
2023	Wang, M., Lin, B., Guo, X., Li, L., Zhu, Z., Sun, J., . . . Yu, X. (2023). GaitStrip: Gait Recognition via Effective Strip-Based Feature Representations and Multi-level Framework. In Lecture Notes in Computer Science (Vol. 13844 LNCS, pp. 711-727). Springer Nature Switzerland. DOI Scopus5
2023	Zhou, X. A., Jiang, Y., Man, W., & Yu, X. (2023). Multimodal methods to help interpret resting-state fMRI. In Advances in Resting-State Functional MRI (pp. 207-235). Elsevier. DOI
2023	Fu, H., Liu, C., Qi, X., Lin, B., Li, L., Zhang, L., & Yu, X. (2023). Sign Spotting via Multi-modal Fusion and Testing Time Transferring. In Lecture Notes in Computer Science (Vol. 13808 LNCS, pp. 271-287). Springer Nature Switzerland. DOI Scopus4
2023	Lee, H. H., Liu, Q., Bao, S., Yang, Q., Yu, X., Cai, L. Y., . . . Landman, B. A. (2023). Scaling up 3D Kernels with Bayesian Frequency Re-parameterization for Medical Image Segmentation. In Lecture Notes in Computer Science (Vol. 14223 LNCS, pp. 632-641). Springer Nature Switzerland. DOI Scopus9
2022	Zhu, F., Yang, Z., Yu, X., Yang, Y., & Wei, Y. (2022). Instance as Identity: A Generic Online Paradigm for Video Instance Segmentation. In Lecture Notes in Computer Science (Vol. 13689 LNCS, pp. 524-540). Springer Nature Switzerland. DOI Scopus4
2022	Zeng, H., Yu, X., Miao, J., & Yang, Y. (2022). MHR-Net: Multiple-Hypothesis Reconstruction of Non-Rigid Shapes from 2D Views. In Lecture Notes in Computer Science (Vol. 13662 LNCS, pp. 1-17). Springer Nature Switzerland. DOI Scopus7
2021	Liu, J., & Yu, X. (2021). Few-shot Weighted Style Matching for Glaucoma Detection. In Lecture Notes in Computer Science (Vol. 13069 LNAI, pp. 289-300). Springer International Publishing. DOI Scopus1
2020	Liu, J., Zou, Z., Ye, X., Tan, X., Ding, E., Xu, F., & Yu, X. (2020). Leaping from 2D Detection to Efficient 6DoF Object Pose Estimation. In Lecture Notes in Computer Science (Vol. 12536 LNCS, pp. 707-714). Springer International Publishing. DOI Scopus8
2020	Du, H., Yu, X., & Zheng, L. (2020). Learning Object Relation Graph and Tentative Policy for Visual Navigation. In Lecture Notes in Computer Science (Vol. 12352 LNCS, pp. 19-34). Springer International Publishing. DOI Scopus103
2018	Yu, X., Fernando, B., Ghanem, B., Porikli, F., & Hartley, R. (2018). Face super-resolution guided by facial component heatmaps. In Lecture Notes in Computer Science (Vol. 11213 LNCS, pp. 219-235). Springer International Publishing. DOI Scopus48
2017	Yu, X. (2017). When Photons Meet Protons: Optogenetics, Calcium Signal Detection, and fMRI in Small Animals. In Small Animal Imaging (pp. 773-791). Springer International Publishing. DOI
2016	Yu, X., & Porikli, F. (2016). Ultra-resolving face images by discriminative generative networks. In Unknown Book (Vol. 9909 LNCS, pp. 318-333). DOI Scopus248

Conference Papers

Year	Citation
2025	Hu, Z., Zhang, Y., Liu, C., Li, L., Peng, S., Zhou, X., . . . Yu, X. (2025). CPT-VR: Improving Surface Rendering via Closest Point Transform with View-Reflection Appearance. In Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics Vol. 15131 LNCS (pp. 223-239). Springer Nature Switzerland. DOI
2025	He, M., Zhang, J., & Yu, X. (2025). Transferable Attacks for Semantic Segmentation. In Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics Vol. 15449 LNCS (pp. 372-388). Springer Nature Singapore. DOI
2025	Liu, C., Li, P., Yang, L., Wang, D., Li, L., & Yu, X. (2025). Robust Audio-Visual Segmentation via Audio-Guided Visual Convergent Alignment. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 28922-28931). IEEE. DOI
2025	Wang, S., Zhang, H., Shen, X., Wang, D., & Yu, X. (2025). Blind Bitstream-corrupted Video Recovery via Metadata-guided Diffusion Model. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 22975-22984). IEEE. DOI
2025	Liu, C., Zhang, W., Qiu, F., Li, L., Wang, D., & Yu, X. (2025). Affective Behaviour Analysis via Progressive Learning. In Lecture Notes in Computer Science Vol. 15637 LNCS (pp. 366-379). Springer Nature Switzerland. DOI
2025	Xu, Q., Cao, R., Shen, X., Du, H., Wang, S., & Yu, X. (2025). M3GYM: A Large-Scale Multimodal Multi-view Multi-person Pose Dataset for Fitness Activity Understanding in Real-world Settings. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 12289-12300). IEEE. DOI
2025	Wang, S., Chen, W., Zhang, W., Zhao, M., Li, L., Zhang, R., . . . Yu, X. (2025). EasyCraft: A Robust and Efficient Framework for Automatic Avatar Crafting. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 5581-5591). IEEE. DOI
2025	Liu, C., Qiu, F., Zhang, W., Li, L., Wang, D., & Yu, X. (2025). Compound Expression Recognition via Curriculum Learning. In Lecture Notes in Computer Science Vol. 15637 LNCS (pp. 282-293). Springer Nature Switzerland. DOI
2025	Liu, C., Yang, L., Li, P., Wang, D., Li, L., & Yu, X. (2025). Dynamic Derivation and Elimination: Audio Visual Segmentation with Enhanced Audio Semantics. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 3131-3141). IEEE. DOI
2025	Shen, X., Du, H., Xu, M., Liu, M., & Yu, X. (2025). Cross-View Isolated Sign Language Recognition Challenge: Design, Results and Future Research. In Www Companion 2025 Companion Proceedings of the ACM Web Conference 2025 (pp. 2444-2447). ACM. DOI
2025	Xu, Q., Du, H., Łukasik, S., Zhu, T., Wang, S., & Yu, X. (2025). MDAM3: A Misinformation Detection and Analysis Framework for Multitype Multimodal Media. In Www 2025 Proceedings of the ACM Web Conference (pp. 5285-5296). ACM. DOI
2025	Guo, T., Du, H., Huo, H., Liu, B., & Yu, X. (2025). Who is Being Impersonated? Deepfake Audio Detection and Impersonated Identification via Extraction of Id-Specific Features. In Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics Vol. 15255 LNCS (pp. 301-320). Springer Nature Singapore. DOI Scopus1
2025	Ying, J., Shen, X., & Yu, X. (2025). Vision-Based Abnormal Action Dataset for Recognising Body Motion Disorders. In Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics Vol. 15443 LNAI (pp. 443-455). Springer Nature Singapore. DOI Scopus1
2025	Zhang, B., Cao, Z., Du, H., Yu, X., Li, X., Liu, J., & Wang, S. (2025). TokenBinder: Text-Video Retrieval with One-to-Many Alignment Paradigm. In 2025 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) (pp. 4957-4967). IEEE. DOI
2025	Cao, Z., Zhang, B., Du, H., Yu, X., Li, X., & Wang, S. (2025). FlashVTG: Feature Layering and Adaptive Score Handling Network for Video Temporal Grounding. In 2025 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) (pp. 9226-9236). IEEE. DOI
2024	Yu, X., Tang, Y., Yang, Q., Lee, H. H., Bao, S., Huo, Y., & Landman, B. A. (2024). Enhancing Hierarchical Transformers for Whole Brain Segmentation with Intracranial Measurements Integration.. In B. S. Gimi, & A. Krol (Eds.), Proceedings of SPIE--the International Society for Optical Engineering Vol. 12930 (pp. 129300K). United States: SPIE. DOI Europe PMC1
2024	Qiu, F., Zhang, W., Liu, C., Li, L., Du, H., Guo, T., & Yu, X. (2024). Language-guided Multi-modal Emotional Mimicry Intensity Estimation. In IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops (pp. 4742-4751). IEEE. DOI Scopus3
2024	Tang, J., Li, L., Qi, X., Chen, Y., Fan, C., & Yu, X. (2024). AS-NeRF: Learning Auxiliary Sampling for Generalizable Novel View Synthesis from Sparse Views. In Proceedings IEEE International Conference on Multimedia and Expo (pp. 1-6). IEEE. DOI
2024	Qiu, F., Du, H., Zhang, W., Liu, C., Li, L., Guo, T., & Yu, X. (2024). Learning Transferable Compound Expressions from Masked AutoEncoder Pretraining. In IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops (pp. 4733-4741). IEEE. DOI Scopus2
2024	Zhang, W., Qiu, F., Liu, C., Li, L., Du, H., Guo, T., & Yu, X. (2024). An Effective Ensemble Learning Framework for Affective Behaviour Analysis. In IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops (pp. 4761-4772). IEEE. DOI Scopus6
2024	Wei, T., Chen, Z., Huang, Z., & Yu, X. (2024). Benchmarking In-the-Wild Multimodal Disease Recognition and A Versatile Baseline. In Mm 2024 Proceedings of the 32nd ACM International Conference on Multimedia (pp. 1593-1601). ACM. DOI Scopus3
2024	Qiu, F., Zhang, W., Liu, C., An, R., Li, L., Ding, Y., . . . Yu, X. (2024). FreeAvatar: Robust 3D Facial Animation Transfer by Learning an Expression Foundation Model. In Proceedings SIGGRAPH Asia 2024 Conference Papers SA 2024 (pp. 1-11). ACM. DOI Scopus3
2024	Wei, T., Chen, Z., & Yu, X. (2024). Snap and Diagnose: An Advanced Multimodal Retrieval System for Identifying Plant Diseases in the Wild. In Proceedings of the 6th ACM International Conference on Multimedia in Asia Mmasia 2024 (pp. 1-3). ACM. DOI
2024	Hu, Z., Zhao, M., Zhao, C., Liang, X., Li, L., Zhao, Z., . . . Yu, X. (2024). EfficientDreamer: High-Fidelity and Stable 3D Creation via Orthogonal-view Diffusion Priors. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 4949-4958). IEEE. DOI Scopus8
2024	Shiri, F., Guo, X. Y., Far, M. G., Yu, X., Haffari, G., & Li, Y. F. (2024). An Empirical Analysis on Spatial Reasoning Capabilities of Large Multimodal Models. In Emnlp 2024 2024 Conference on Empirical Methods in Natural Language Processing Proceedings of the Conference (pp. 21440-21455). Association for Computational Linguistics. DOI Scopus1
2024	Yu, Q., Du, H., Liu, C., & Yu, X. (2024). When 3D Bounding-Box Meets SAM: Point Cloud Instance Segmentation with Weak-and-Noisy Supervision. In Proceedings 2024 IEEE Winter Conference on Applications of Computer Vision Wacv 2024 (pp. 3707-3716). IEEE. DOI Scopus4
2024	Dong, G., Wang, H., Sun, J., & Wang, X. (2024). Evaluating and Mitigating Linguistic Discrimination in Large Language Models: Perspectives on Safety Equity and Knowledge Equity. In Proceedings of the Thirty-ThirdInternational Joint Conference on Artificial Intelligence (pp. 348-356). International Joint Conferences on Artificial Intelligence Organization. DOI
2024	Liu, C., Li, P. P., Yu, Q., Sheng, H., Wang, D., Li, L., & Yu, X. (2024). Benchmarking Audio Visual Segmentation for Long-Untrimmed Videos. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 22712-22722). IEEE. DOI Scopus1
2024	Chen, H., Zhu, T., Yu, X., & Zhou, W. (2024). Machine Unlearning via Null Space Calibration. In Ijcai International Joint Conference on Artificial Intelligence (pp. 358-366). Scopus5
2024	Shen, X., Du, H., Sheng, H., Wang, S., Chen, H., Chen, H., . . . Yu, X. (2024). MM-WLAuslan: Multi-View Multi-Modal Word-Level Australian Sign Language Recognition Dataset. In Advances in Neural Information Processing Systems Vol. 37. Scopus1
2024	Chen, H., Liu, Y., Ma, Y., Zheng, N., & Yu, X. (2024). TPR: Topology-Preserving Reservoirs for Generalized Zero-Shot Learning. In Advances in Neural Information Processing Systems Vol. 37. Scopus1
2024	Lim, J. S., Chen, Z., Baktashmotlagh, M., Chen, Z., Yu, X., Huang, Z., & Luo, Y. (2024). DiPEx: Dispersing Prompt Expansion for Class-Agnostic Object Detection. In Advances in Neural Information Processing Systems Vol. 37. Scopus2
2024	Wu, Y., Meng, Y., Hu, Z., Li, L., Wu, H., Zhou, K., . . . Yu, X. (2024). Text-Guided 3D Face Synthesis - From Generation to Editing. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 1260-1269). IEEE. DOI Scopus7
2023	Du, H., Li, L., Huang, Z., & Yu, X. (2023). Object-Goal Visual Navigation via Effective Exploration of Relations Among Historical Navigation States. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition Vol. 2023-June (pp. 2563-2573). IEEE. DOI Scopus24
2023	Shen, C., Lin, B., Zhang, S., Yu, X., Huang, G. Q., & Yu, S. (2023). Gait Recognition with Mask-based Regularization. In 2023 IEEE International Joint Conference on Biometrics Ijcb 2023 (pp. 1-10). IEEE. DOI Scopus5
2023	Zhang, Y., Wang, Z., Luo, Y., Yu, X., & Huang, Z. (2023). Learning Efficient Unsupervised Satellite Image-based Building Damage Detection. In Proceedings IEEE International Conference on Data Mining Icdm (pp. 1547-1552). IEEE. DOI Scopus2
2023	Wu, H., Hu, Z., Li, L., Zhang, Y., Fan, C., & Yu, X. (2023). NeFII: Inverse Rendering for Reflectance Decomposition with Near-Field Indirect Illumination. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition Vol. 2023-June (pp. 4295-4304). IEEE. DOI Scopus33
2023	Tang, J., Li, L., Hou, J., Xin, H., & Yu, X. (2023). A Divide-and-conquer Solution to 3D Human Motion Estimation from Raw MoCap Data. In 2023 IEEE Conference on Virtual Reality and 3D User Interfaces Abstracts and Workshops (VRW) (pp. 767-768). IEEE. DOI
2023	Zhao, Y., Liu, B., Ding, M., Liu, B., Zhu, T., & Yu, X. (2023). Proactive Deepfake Defence via Identity Watermarking. In 2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) (pp. 4591-4600). IEEE. DOI
2023	Du, H., Yu, X., Hussain, F., Armin, M. A., Petersson, L., & Li, W. (2023). Weakly-supervised Point Cloud Instance Segmentation with Geometric Priors. In 2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) (pp. 4260-4269). IEEE. DOI
2023	Rao, Q., Yu, X., Navasardyan, S., & Shi, H. (2023). Sim2RealVS: A New Benchmark for Video Stabilization with a Strong Baseline. In 2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) (pp. 5395-5404). IEEE. DOI
2023	Qi, X., Liu, C., Sun, M., Li, L., Fan, C., & Yu, X. (2023). Diverse 3D Hand Gesture Prediction from Body Dynamics by Bilateral Hand Disentanglement. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition Vol. 2023-June (pp. 4616-4626). IEEE. DOI Scopus15
2023	Wang, M., Guo, X., Lin, B., Yang, T., Zhu, Z., Li, L., . . . Yu, X. (2023). DyGait: Exploiting Dynamic Representations for High-performance Gait Recognition. In Proceedings of the IEEE International Conference on Computer Vision (pp. 13378-13387). IEEE. DOI Scopus42
2023	Liu, C., Li, P. P., Qi, X., Zhang, H., Li, L., Wang, D., & Yu, X. (2023). Audio-Visual Segmentation by Exploring Cross-Modal Mutual Semantics. In Mm 2023 Proceedings of the 31st ACM International Conference on Multimedia (pp. 7590-7598). ACM. DOI Scopus31
2023	Khan, M. W., Sheng, H., Zhang, H., Du, H., Wang, S., Coroneo, M. T., . . . Yu, X. (2023). RVD: A Handheld Device-Based Fundus Video Dataset for Retinal Vessel Segmentation. In Advances in Neural Information Processing Systems Vol. 36. Scopus6
2023	Liu, P., Yu, X., & Zhou, J. T. (2023). META KNOWLEDGE CONDENSATION FOR FEDERATED LEARNING. In 11th International Conference on Learning Representations Iclr 2023. Scopus7
2023	Liu, B., Liu, B., Ding, M., Zhu, T., & Yu, X. (2023). TI<sup>2</sup>Net: Temporal Identity Inconsistency Network for Deepfake Detection. In 2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV). IEEE. DOI
2023	Shen, X., Yuan, S., Sheng, H., Du, H., & Yu, X. (2023). Auslan-Daily: Australian Sign Language Translation for Daily Communication and News. In Advances in Neural Information Processing Systems Vol. 36. Scopus19
2023	Luo, Y., Chen, Z., Wang, Z., Yu, X., Huang, Z., & Baktashmotlagh, M. (2023). EXPLORING ACTIVE 3D OBJECT DETECTION FROM A GENERALIZATION PERSPECTIVE. In 11th International Conference on Learning Representations Iclr 2023. Scopus19
2022	Li, S., Phillips, J. M., Yu, X., Kirby, R. M., & Zhe, S. (2022). Batch Multi-Fidelity Active Learning with Budget Constraints. In Advances in Neural Information Processing Systems Vol. 35. Scopus11
2022	Yao, G., Wu, H., Yuan, Y., Li, L., Zhou, K., & Yu, X. (2022). Learning Implicit Body Representations from Double Diffusion Based Neural Radiance Fields. In Ijcai International Joint Conference on Artificial Intelligence (pp. 1566-1572). International Joint Conferences on Artificial Intelligence Organization. DOI Scopus4
2021	Li, P., Yu, X., & Yang, Y. (2021). Super-Resolving Cross-Domain Face Miniatures by Peeking at One-Shot Exemplar. In Proceedings of the IEEE International Conference on Computer Vision (pp. 4449-4459). IEEE. DOI Scopus2
2021	Zeng, H., Dai, Y., Yu, X., Wang, X., & Yang, Y. (2021). PR-RRN: Pairwise-Regularized Residual-Recursive Networks for Non-rigid Structure-from-Motion. In Proceedings of the IEEE International Conference on Computer Vision (pp. 5580-5589). IEEE. DOI Scopus10
2021	Yu, X., Van Baar, J., & Chen, S. (2021). Joint 3D Human Shape Recovery and Pose Estimation from a Single Image with Bilayer Graph. In Proceedings 2021 International Conference on 3D Vision 3dv 2021 (pp. 505-514). IEEE. DOI Scopus5
2021	Wang, S., Li, L., Ding, Y., Fan, C., & Yu, X. (2021). Audio2Head: Audio-driven One-shot Talking-head Generation with Natural Head Motion. In Ijcai International Joint Conference on Artificial Intelligence (pp. 1098-1105). International Joint Conferences on Artificial Intelligence Organization. DOI Scopus59
2021	Ding, Y., Yu, X., & Yang, Y. (2021). RFNet: Region-aware Fusion Network for Incomplete Multi-modal Brain Tumor Segmentation. In Proceedings of the IEEE International Conference on Computer Vision (pp. 3955-3964). IEEE. DOI Scopus135
2021	Ben-Shabat, Y., Yu, X., Saleh, F., Campbell, D., Rodriguez Opazo, C., Li, H., & Gould, S. (2021). The IKEA ASM Dataset: Understanding people assembling furniture through actions, objects and pose. In Proceedings of the IEEE Winter Conference on Applications of Computer Vision (WACV 2021) (pp. 846-858). virtual online: IEEE. DOI Scopus93 WoS81
2021	Fan, H., Yu, X., Ding, Y., Yang, Y., & Kankanhalli, M. (2021). PSTNET: POINT SPATIO-TEMPORAL CONVOLUTION ON POINT CLOUD SEQUENCES. In Iclr 2021 9th International Conference on Learning Representations. Scopus67
2021	Du, H., Yu, X., & Zheng, L. (2021). VTNET: VISUAL TRANSFORMER NETWORK FOR OBJECT GOAL NAVIGATION. In Iclr 2021 9th International Conference on Learning Representations. Scopus40
2021	Tang, T., Yu, X., Dong, X., & Yang, Y. (2021). Auto-Navigator: Decoupled neural architecture search for visual navigation. In Proceedings 2021 IEEE Winter Conference on Applications of Computer Vision Wacv 2021 (pp. 3742-3751). IEEE. DOI Scopus9
2021	Quan, R., Yu, X., Liang, Y., & Yang, Y. (2021). Removing Raindrops and Rain Streaks in One Go. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 9143-9152). IEEE. DOI Scopus158
2021	Shi, Y., Li, H., & Yu, X. (2021). Self-Supervised Visibility Learning for Novel View Synthesis. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 9670-9679). IEEE. DOI Scopus15
2021	Li, D., Xu, C., Zhang, K., Yu, X., Zhong, Y., Ren, W., . . . Li, H. (2021). ARVo: Learning All-Range Volumetric Correspondence for Video Deblurring. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 7717-7727). IEEE. DOI Scopus56
2021	Lin, B., Zhang, S., & Yu, X. (2021). Gait Recognition via Effective Global-Local Feature Representation and Local Temporal Aggregation. In Proceedings of the IEEE International Conference on Computer Vision (pp. 14628-14636). IEEE. DOI Scopus266
2021	Yang, Z., Yu, X., & Yang, Y. (2021). DSC-PoseNet: Learning 6DoF Object Pose Estimation via Dual-scale Consistency. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 3906-3915). IEEE. DOI Scopus45
2021	Zhuang, Z., Yu, X., & Mahony, R. (2021). End-to-end Multi-Instance Robotic Reaching from Monocular Vision. In Proceedings IEEE International Conference on Robotics and Automation Vol. 2021-May (pp. 12974-12980). IEEE. DOI Scopus1
2021	Kennedy, G., Gao, J., Zhuang, Z., Yu, X., & Mahony, R. (2021). A General Approach to State Refinement. In IEEE International Conference on Intelligent Robots and Systems (pp. 8985-8991). IEEE. DOI
2021	Zhang, J., Fan, D. P., Dai, Y., Yu, X., Zhong, Y., Barnes, N., & Shao, L. (2021). RGB-D Saliency Detection via Cascaded Mutual Information Minimization. In Proceedings of the IEEE International Conference on Computer Vision (pp. 4318-4327). IEEE. DOI Scopus109
2020	Li, D., Yu, X., Xu, C., Petersson, L., & Li, H. (2020). Transferring Cross-Domain Knowledge for Video Sign Language Recognition. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 6204-6213). IEEE. DOI Scopus121
2020	Zhuang, Z., Yu, X., & Mahony, R. (2020). LyRN (Lyapunov Reaching Network): A Real-Time Closed Loop approach from Monocular Vision. In 2020 IEEE International Conference on Robotics and Automation (ICRA) (pp. 8331-8337). IEEE. DOI
2020	Zheng, Z., Jiang, M., Wang, Z., Wang, J., Bai, Z., Zhang, X., . . . Ding, E. (2020). Going beyond real data: A robust visual representation for vehicle re-identification. In IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops Vol. 2020-June (pp. 2550-2558). IEEE. DOI Scopus49
2020	Shi, Y., Yu, X., Campbell, D., & Li, H. (2020). Where am I looking at? Joint location and orientation estimation by cross-view matching. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 4063-4071). IEEE. DOI Scopus186
2020	Zhang, J., Yu, X., Li, A., Song, P., Liu, B., & Dai, Y. (2020). Weakly-Supervised Salient Object Detection via Scribble Annotations. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 12543-12552). IEEE. DOI Scopus293
2020	Yu, X., Zhuang, Z., Koniusz, P., & Li, H. (2020). 6DoF Object Pose Estimation via Differentiable Proxy Voting Regularizer. In 31st British Machine Vision Conference Bmvc 2020. Scopus18
2020	Li, D., Xu, C., Yu, X., Zhang, K., Swift, B., Suominen, H., & Li, H. (2020). TSPNet: Hierarchical feature learning via temporal semantic pyramid for sign language translation. In Advances in Neural Information Processing Systems Vol. 2020-December. Scopus105
2020	Li, P., Dong, X., Yu, X., & Yang, Y. (2020). When Humans Meet Machines: Towards Efficient Segmentation Networks. In 31st British Machine Vision Conference Bmvc 2020. Scopus27
2020	Li, D., Rodriguez Opazo, C., Yu, X., & Li, H. (2020). Word-level Deep Sign Language Recognition from Video: A New Large-scale Dataset and Methods Comparison. In Proceedings of the 2020 IEEE Winter Conference on Applications of Computer Vision (pp. 1448-1458). Snowmass Village, CO, USA: IEEE. DOI Scopus446 WoS309
2020	Zhang, Y., Tsang, I. W., Luo, Y., Hu, C. -H., Lu, X., & Yu, X. (2020). Copy and Paste GAN: Face Hallucination From Shaded Thumbnails. In 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (pp. 7353-7362). IEEE. DOI
2019	Pan, L., Scheerlinck, C., Yu, X., Hartley, R., Liu, M., & Dai, Y. (2019). Bringing a blurry frame alive at high frame-rate with an event camera. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition Vol. 2019-June (pp. 6813-6822). IEEE. DOI Scopus246
2019	Yu, X., Tian, Y., Porikli, F., Hartley, R., Li, H., Heijnen, H., & Balntas, V. (2019). Unsupervised extraction of local image descriptors via relative distance ranking loss. In Proceedings 2019 International Conference on Computer Vision Workshop Iccvw 2019 (pp. 2893-2902). IEEE. DOI Scopus23
2019	Tian, Y., Yu, X., Fan, B., Wu, F., Heijnen, H., & Balntas, V. (2019). Sosnet: Second order similarity regularization for local descriptor learning. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition Vol. 2019-June (pp. 11008-11017). IEEE. DOI Scopus358
2019	Shiri, F., Yu, X., Porikli, F., Hartley, R., & Koniusz, P. (2019). Recovering faces from portraits with auxiliary facial attributes. In Proceedings 2019 IEEE Winter Conference on Applications of Computer Vision Wacv 2019 (pp. 406-415). IEEE. DOI Scopus12
2018	Yu, X., Yu, Z., & Ramalingam, S. (2018). Learning Strict Identity Mappings in Deep Residual Networks. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 4432-4440). IEEE. DOI Scopus60
2018	Yu, X., Fernando, B., Hartley, R., & Porikli, F. (2018). Super-Resolving Very Low-Resolution Face Images with Supplementary Attributes. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 908-917). DOI Scopus170
2018	Shiri, F., Yu, X., Porikli, F., Hartley, R., & Koniusz, P. (2018). Identity-preserving face recovery from portraits. In Proceedings 2018 IEEE Winter Conference on Applications of Computer Vision Wacv 2018 Vol. 2018-January (pp. 102-111). IEEE. DOI Scopus16
2017	Yu, X., & Porikli, F. (2017). Hallucinating very low-Resolution unaligned and noisy face images by transformative discriminative autoencoders. In Proceedings 30th IEEE Conference on Computer Vision and Pattern Recognition Cvpr 2017 Vol. 2017-January (pp. 5367-5375). DOI Scopus122
2017	Shiri, F., Yu, X., Koniusz, P., & Porikli, F. (2017). Face Destylization. In Dicta 2017 2017 International Conference on Digital Image Computing Techniques and Applications Vol. 2017-December (pp. 1-8). DOI Scopus14

Email: xin.yu@adelaide.edu.au

Xin Yu

Language Competencies

Education

Research Interests

Journals

Book Chapters

Conference Papers

Connect With Me

External Profiles