Xin Yu

Xin Yu

Australian Institute for Machine Learning - Projects

Faculty of Sciences, Engineering and Technology


I am Xin Yu, an Associate Professor at the University of Adelaide (Dec 2025 – present). My research focuses on computer vision and machine learning, with a particular interest in enabling technologies that improve accessibility and understanding through visual intelligence. I received my PhD degree in Computer Science from the Australian National University, and a PhD degree in Communication and Information Engineering from Tsinghua University. Currently, I am a Visiting Faculty Researcher at Google (2024–-Present), and leading a Visual Intelligence Group at the Australian Institute for Machine Learning (AIML).
 
I am the recipient of several prestigious awards, including the Australian Research Council (ARC) Discovery Early Career Researcher Award (DECRA, 2023–2025), the Google Research Scholar Program Award (2021), and the Google Inclusion Research Award (2023). I was also honoured with the Queensland Young Tall Poppy Science Award from the Australian Institute of Policy and Science (AIPS). I also received the CORE Outstanding Research Contribution Award 2026 from the Computing Research and Education Association of Australasia (CORE).

  • Journals

    Year Citation
    2025 Du, X., Sun, H., Lu, M., Zhu, T., & Yu, X. (2025). DreamCar: Leveraging Car-Specific Prior for In-the-Wild 3D Car Reconstruction. IEEE Robotics and Automation Letters, 10(2), 1840-1847.
    DOI Scopus1
    2025 Ma, Y., Wang, S., Ding, Y., Ma, B., Lv, T., Fan, C., . . . Yu, X. (2025). TalkCLIP: Talking Head Generation with Text-Guided Expressive Speaking Styles. IEEE Transactions on Multimedia, 27, 6335-6346.
    DOI
    2025 Wu, H., Zhao, M., Hu, Z., Fan, C., Li, L., Chen, W., . . . Yu, X. (2025). ICE: Interactive 3D Game Character Facial Editing via Dialogue. IEEE Transactions on Multimedia, 27, 3210-3223.
    DOI
    2025 Jiang, W., Zhao, D., Wang, C., Yu, X., Arun, P. V., Asano, Y., . . . Zhou, H. (2025). Hyperspectral video object tracking with cross-modal spectral complementary and memory prompt network. Knowledge Based Systems, 330, 114595.
    DOI
    2025 Li, Z., Liu, S., Yu, X., Bhavya, K., Cao, J., Diffenderfer, J. D., . . . Pascucci, V. (2025). "Understanding Robustness Lottery": A Geometric Visual Comparative Analysis of Neural Network Pruning Approaches.. IEEE transactions on visualization and computer graphics, 31(9), 6337-6352.
    DOI
    2024 Liu, C., Li, P., Zhang, H., Li, L., Huang, Z., Wang, D., & Yu, X. (2024). BAVS: Bootstrapping Audio-Visual Segmentation by Integrating Foundation Knowledge. IEEE Transactions on Multimedia, 26, 10015-10028.
    DOI Scopus14
    2024 Zhang, W., Li, L., Ding, Y., Chen, W., Deng, Z., & Yu, X. (2024). Detecting Facial Action Units From Global-Local Fine-Grained Expressions. IEEE Transactions on Circuits and Systems for Video Technology, 34(2), 983-994.
    DOI Scopus10
    2024 Chen, D. Y., Di, X., Yu, X., & Biswal, B. B. (2024). The significance and limited influence of cerebrovascular reactivity on age and sex effects in task-and resting-state brain activity. Cerebral Cortex, 34(2).
    DOI Scopus3
    2024 Hu, Z., Tang, J., Li, L., Hou, J., Xin, H., Yu, X., & Bu, J. (2024). MarkerNet: A divide-and-conquer solution to motion capture solving from raw markers. Computer Animation and Virtual Worlds, 35(1).
    DOI Scopus3
    2024 Fu, H., Yu, X., Li, L., & Zhang, L. (2024). CBARF: Cascaded Bundle-Adjusting Neural Radiance Fields From Imperfect Camera Poses. IEEE Transactions on Multimedia, 26, 9304-9315.
    DOI Scopus3
    2024 Rao, Q., Yu, X., Li, G., & Zhu, L. (2024). CMGNet: Collaborative multi-modal graph network for video captioning. Computer Vision and Image Understanding, 238, 103864.
    DOI Scopus5
    2024 Qi, X., Liu, C., Li, L., Hou, J., Xin, H., & Yu, X. (2024). EmotionGesture: Audio-Driven Diverse Emotional Co-Speech 3D Gesture Generation. IEEE Transactions on Multimedia, 26, 10420-10430.
    DOI Scopus10
    2024 Zhao, Y., Liu, B., Zhu, T., Ding, M., Yu, X., & Zhou, W. (2024). Proactive image manipulation detection via deep semi-fragile watermark. Neurocomputing, 585, 127593.
    DOI Scopus21
    2024 Song, X., Liu, C., Zheng, Y., Feng, Z., Li, L., Zhou, K., & Yu, X. (2024). HairStyle Editing via Parametric Controllable Strokes. IEEE Transactions on Visualization and Computer Graphics, 30(7), 3857-3870.
    DOI Scopus4
    2024 Xu, Q., Chen, H., Du, H., Zhang, H., Łukasik, S., Zhu, T., & Yu, X. (2024). M3A: A multimodal misinformation dataset for media authenticity analysis. Computer Vision and Image Understanding, 249, 104205.
    DOI Scopus5
    2024 Choi, S., Hike, D., Pohmann, R., Avdievich, N., Gomez-Cid, L., Man, W., . . . Yu, X. (2024). Alpha-180 spin-echo-based line-scanning method for high-resolution laminar-specific fMRI in animals.. Imaging neuroscience (Cambridge, Mass.), 2, imag-2-00120.
    DOI Europe PMC1
    2024 Jiang, Y., Pais-Roldán, P., Pohmann, R., & Yu, X. (2024). High Spatiotemporal Resolution Radial Encoding Single-Vessel fMRI. Advanced Science, 11(26), e2309218.
    DOI Scopus1 Europe PMC4
    2024 Sheng, H., Shen, X., Du, H., Zhang, H., Huang, Z., & Yu, X. (2024). AI empowered Auslan learning for parents of deaf children and children of deaf adults. AI and Ethics, 4(4), 877-887.
    DOI
    2024 Yu, X., Yu, X., Yang, Q., Tang, Y., Gao, R., Bao, S., . . . Landman, B. A. (2024). Deep conditional generative model for longitudinal single-slice abdominal computed tomography harmonization.. Journal of medical imaging (Bellingham, Wash.), 11(2), 024008.
    DOI Europe PMC1
    2024 Choi, S. H., Im, G. H., Choi, S., Yu, X., Bandettini, P. A., Menon, R. S., & Kim, S. G. (2024). No replication of direct neuronal activity–related (DIANA) fMRI in anesthetized mice. Science Advances, 10(13), eadl0999.
    DOI Scopus7 Europe PMC12
    2024 Plagwitz, L., Choi, S., Yu, X., Segelcke, D., Lambers, H., Pogatzki-Zahn, E., . . . Pradier, B. (2024). Data-driven time series analysis of sensory cortical processing using high-resolution fMRI across different studies. Biomedical Signal Processing and Control, 93, 106136.
    DOI Scopus1
    2024 Wang, S., Ma, Y., Ding, Y., Hu, Z., Fan, C., Lv, T., . . . Yu, X. (2024). StyleTalk++: A Unified Framework for Controlling the Speaking Styles of Talking Heads. IEEE Transactions on Pattern Analysis and Machine Intelligence, 46(6), 4331-4347.
    DOI Scopus8 Europe PMC1
    2024 Chen, Y., Fernandez, Z., Scheel, N., Gifani, M., Zhu, D. C., Counts, S. E., . . . Qian, C. (2024). Novel inductively coupled ear-bars (ICEs) to enhance restored fMRI signal from susceptibility compensation in rats. Cerebral Cortex, 34(1), bhad479.
    DOI Scopus1 Europe PMC1
    2024 Guan, S., Yu, X., Huang, W., Fang, G., & Lu, H. (2024). DMMG: Dual Min-Max Games for Self-Supervised Skeleton-Based Action Recognition. IEEE Transactions on Image Processing, 33, 395-407.
    DOI Scopus11 Europe PMC2
    2024 Du, X., Yu, X., Liu, J., Dai, B., & Xu, F. (2024). Ethics-aware face recognition aided by synthetic face images. Neurocomputing, 600, 128129.
    DOI Scopus6
    2024 Hike, D., Choi, S., Zhang, B., Jiang, Y., Liu, X., Pohmann, R., . . . Yu, X. (2024). Implementation of 2D Line scanning Method.. Aperture neuro, 4.
    DOI
    2024 Zhou, S., Zhu, T., Ye, D., Yu, X., & Zhou, W. (2024). Boosting Model Inversion Attacks With Adversarial Examples. IEEE Transactions on Dependable and Secure Computing, 21(3), 1451-1468.
    DOI Scopus15
    2024 Zhao, M., Qi, X., Hu, Z., Li, L., Zhang, Y., Huang, Z., & Yu, X. (2024). Calligraphy Font Generation via Explicitly Modeling Location-Aware Glyph Component Deformations. IEEE Transactions on Multimedia, 26, 5939-5950.
    DOI Scopus4
    2023 Mao, Y., Wan, Z., Dai, Y., & Yu, X. (2023). Deep Idempotent Network for Efficient Single Image Blind Deblurring. IEEE Transactions on Circuits and Systems for Video Technology, 33(1), 172-185.
    DOI Scopus32
    2023 Bao, S., Cui, C., Li, J., Tang, Y., Lee, H. H., Deng, R., . . . Huo, Y. (2023). Topological-Preserving Membrane Skeleton Segmentation in Multiplex Immunofluorescence Imaging. Progress in Biomedical Optics and Imaging Proceedings of SPIE, 12471, 124710B.
    DOI Scopus1
    2023 Zeng, H., Zhang, W., Fan, C., Lv, T., Wang, S., Zhang, Z., . . . Yu, X. (2023). FlowFace: Semantic Flow-Guided Shape-Aware Face Swapping. Proceedings of the 37th Aaai Conference on Artificial Intelligence Aaai 2023, 37(3), 3367-3375.
    DOI Scopus3
    2023 Sheng, H., Yu, X., Wang, F., Khan, M. W., Weng, H., Shariflou, S., & Golzan, S. M. (2023). Autonomous Stabilization of Retinal Videos for Streamlining Assessment of Spontaneous Venous Pulsations. Proceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society EMBS, 2023, 1-4.
    DOI Scopus3 Europe PMC1
    2023 Xu, Y., Zhou, C., Yu, X., & Yang, Y. (2023). Cyclic Self-Training With Proposal Weight Modulation for Cross-Supervised Object Detection. IEEE Transactions on Image Processing, 32, 1992-2002.
    DOI Scopus9 Europe PMC2
    2023 Yu, X., Tang, Y., Yang, Q., Lee, H. H., Gao, R., Bao, S., . . . Landman, B. A. (2023). Longitudinal Variability Analysis on Low-dose Abdominal CT with Deep Learning-based Segmentation. Progress in Biomedical Optics and Imaging Proceedings of SPIE, 12464, 1246423.
    DOI Scopus3 Europe PMC2
    2023 Yang, Q., Yu, X., Lee, H. H., Cai, L. Y., Xu, K., Bao, S., . . . Landman, B. A. (2023). Single slice thigh CT muscle group segmentation with domain adaptation and self-training. Journal of Medical Imaging, 10(4), 044001.
    DOI Scopus2 Europe PMC1
    2023 Shi, Y., Yu, X., Liu, L., Campbell, D., Koniusz, P., & Li, H. (2023). Accurate 3-DoF Camera Geo-Localization via Ground-to-Satellite Image Matching. IEEE Transactions on Pattern Analysis and Machine Intelligence, 45(3), 2682-2697.
    DOI Scopus38 Europe PMC2
    2023 Yu, X., Yang, Q., Zhou, Y., Cai, L. Y., Gao, R., Lee, H. H., . . . Tang, Y. (2023). UNesT: Local spatial representation learning with hierarchical transformer for efficient medical segmentation. Medical Image Analysis, 90, 102939.
    DOI Scopus64 Europe PMC29
    2023 Ma, Y., Wang, S., Hu, Z., Fan, C., Lv, T., Ding, Y., . . . Yu, X. (2023). StyleTalk: One-Shot Talking Head Generation with Controllable Speaking Styles. Proceedings of the 37th Aaai Conference on Artificial Intelligence Aaai 2023, 37(2), 1896-1904.
    DOI Scopus45
    2023 Ramadass, K., Yu, X., Cai, L. Y., Tang, Y., Bao, S., Kerley, C., . . . Landman, B. A. (2023). Deep whole brain segmentation of 7T structural MRI. Progress in Biomedical Optics and Imaging Proceedings of SPIE, 12464, 124642O.
    DOI
    2022 Fan, H., Yu, X., Yang, Y., & Kankanhalli, M. (2022). Deep Hierarchical Representation of Point Cloud Videos via Spatio-Temporal Decomposition. IEEE Transactions on Pattern Analysis and Machine Intelligence, 44(12), 9918-9930.
    DOI Scopus40 Europe PMC2
    2022 Shi, Y., Campbell, D., Yu, X., & Li, H. (2022). Geometry-Guided Street-View Panorama Synthesis From Satellite Imagery. IEEE Transactions on Pattern Analysis and Machine Intelligence, 44(12), 10009-10022.
    DOI Scopus34 Europe PMC1
    2022 Chen, X., Jiang, Y., Choi, S., Pohmann, R., Scheffler, K., Kleinfeld, D., & Yu, X. (2022). Erratum: Assessment of single-vessel cerebral blood velocity by phase contrast fMRI (PLoS Biol (2021) 19:9 (e3000923) DOI: 10.1371/journal.pbio.3000923). Plos Biology, 20(12), e3001951.
    DOI
    2022 Ma, F., Wu, Y., Yu, X., & Yang, Y. (2022). Learning With Noisy Labels via Self-Reweighting From Class Centroids. IEEE Transactions on Neural Networks and Learning Systems, 33(11), 6275-6285.
    DOI Scopus35 Europe PMC3
    2022 Choi, S., Zeng, H., Chen, Y., Sobczak, F., Qian, C., & Yu, X. (2022). Laminar-specific functional connectivity mapping with multi-slice line-scanning fMRI. Cerebral Cortex, 32(20), 4492-4501.
    DOI Scopus11 Europe PMC14
    2022 Zhou, X. A., Jiang, Y., Napadow, V., & Yu, X. (2022). Challenges and Perspectives of Mapping Locus Coeruleus Activity in the Rodent with High-Resolution fMRI. Brain Sciences, 12(8), 1085.
    DOI
    2022 Zeng, H., Jiang, Y., Beer-Hammer, S., & Yu, X. (2022). Awake Mouse fMRI and Pupillary Recordings in the Ultra-High Magnetic Field. Frontiers in Neuroscience, 16, 886709.
    DOI Scopus11 Europe PMC18
    2022 Pan, L., Hartley, R., Scheerlinck, C., Liu, M., Yu, X., & Dai, Y. (2022). High Frame Rate Video Reconstruction Based on an Event Camera. IEEE Transactions on Pattern Analysis and Machine Intelligence, 44(5), 2519-2533.
    DOI Scopus68 Europe PMC9
    2022 Xu, Y., Yu, X., Zhang, J., Zhu, L., & Wang, D. (2022). Weakly Supervised RGB-D Salient Object Detection with Prediction Consistency Training and Active Scribble Boosting. IEEE Transactions on Image Processing, 31, 2148-2161.
    DOI Scopus48 Europe PMC4
    2022 Zheng, Y., Yu, X., Liu, M., & Zhang, S. (2022). Single-Image Deraining via Recurrent Residual Multiscale Networks. IEEE Transactions on Neural Networks and Learning Systems, 33(3), 1310-1323.
    DOI Scopus26 Europe PMC6
    2022 Zhang, Y., Yu, X., Lu, X., & Liu, P. (2022). Pro-UIGAN: Progressive Face Hallucination From Occluded Thumbnails. IEEE Transactions on Image Processing, 31, 3236-3250.
    DOI Scopus17 Europe PMC2
    2022 Zhang, Y., Tsang, I. W., Luo, Y., Hu, C., Lu, X., & Yu, X. (2022). Recursive Copy and Paste GAN: Face Hallucination From Shaded Thumbnails. IEEE Transactions on Pattern Analysis and Machine Intelligence, 44(8), 4321-4338.
    DOI Scopus20 Europe PMC3
    2022 Klugah-Brown, B., Yu, Y., Hu, P., Agoalikum, E., Liu, C., Liu, X., . . . Biswal, B. (2022). Effect of surgical mask on fMRI signals during task and rest. Communications Biology, 5(1), 1004.
    DOI Scopus7 Europe PMC6
    2022 Wang, S., Li, L., Ding, Y., & Yu, X. (2022). One-Shot Talking Face Generation from Single-Speaker Audio-Visual Correlation Learning. Proceedings of the 36th Aaai Conference on Artificial Intelligence Aaai 2022, 36(3), 2531-2539.
    DOI Scopus92
    2022 Chen, Y., Wang, Q., Choi, S., Zeng, H., Takahashi, K., Qian, C., & Yu, X. (2022). Focal fMRI signal enhancement with implantable inductively coupled detectors. Neuroimage, 247, 118793.
    DOI Scopus5 Europe PMC11
    2022 Fan, H., Zhuo, T., Yu, X., Yang, Y., & Kankanhalli, M. (2022). Understanding Atomic Hand-Object Interaction With Human Intention. IEEE Transactions on Circuits and Systems for Video Technology, 32(1), 275-285.
    DOI Scopus24
    2022 Han, C., Yu, X., Gao, C., Sang, N., & Yang, Y. (2022). Single image based 3D human pose estimation via uncertainty learning. Pattern Recognition, 132, 108934.
    DOI Scopus29
    2021 Ding, Y., Yu, X., & Yang, Y. (2021). Modeling the Probabilistic Distribution of Unlabeled Data for One-shot Medical Image Segmentation. 35th Aaai Conference on Artificial Intelligence Aaai 2021, 2A(2), 1246-1254.
    DOI Scopus31
    2021 Li, L., Wang, S., Zhang, Z., Ding, Y., Zheng, Y., Yu, X., & Fan, C. (2021). Write-a-speaker: Text-based Emotional and Rhythmic Talking-head Generation. 35th Aaai Conference on Artificial Intelligence Aaai 2021, 3A(3), 1911-1920.
    DOI Scopus55
    2021 Mächler, P., Broggini, T., Mateo, C., Thunemann, M., Fomin-Thunemann, N., Doran, P. R., . . . Devor, A. (2021). A suite of neurophotonic tools to underpin the contribution of internal brain states in fMRI. Current Opinion in Biomedical Engineering, 18, 100273.
    DOI Scopus5 Europe PMC7
    2021 Raimondo, L., Knapen, T., Oliveira, Ĺ. A. F., Yu, X., Dumoulin, S. O., van der Zwaag, W., & Siero, J. C. W. (2021). A line through the brain: implementation of human line-scanning at 7T for ultra-high spatiotemporal resolution fMRI. Journal of Cerebral Blood Flow and Metabolism, 41(11), 2831-2843.
    DOI Scopus27 Europe PMC26
    2021 Chen, X., Jiang, Y., Choi, S., Pohmann, R., Scheffler, K., Kleinfeld, D., & Yu, X. (2021). Assessment of single-vessel cerebral blood velocity by phase contrast fMRI. Plos Biology, 19(9), e3000923.
    DOI Scopus18 Europe PMC18
    2021 Sobczak, F., Pais-Roldán, P., Takahashi, K., & Yud, X. (2021). Decoding the brain state-dependent relationship between pupil dynamics and resting state fmri signal fluctuation. Elife, 10, e68980.
    DOI Scopus13 Europe PMC13
    2021 Quan, R., Wu, Y., Yu, X., & Yang, Y. (2021). Progressive transfer learning for face anti-spoofing. IEEE Transactions on Image Processing, 30, 3946-3955.
    DOI Scopus64 Europe PMC7
    2021 Sobczak, F., He, Y., Sejnowski, T. J., & Yu, X. (2021). Predicting the fMRI Signal Fluctuation with Recurrent Neural Networks Trained on Vascular Network Dynamics. Cerebral Cortex, 31(2), 826-844.
    DOI Scopus9 Europe PMC10
    2021 Zhang, Y., Tsang, I. W., Li, J., Liu, P., Lu, X., & Yu, X. (2021). Face Hallucination with Finishing Touches. IEEE Transactions on Image Processing, 30, 1728-1743.
    DOI Scopus30 Europe PMC4
    2021 Pais-Roldán, P., Mateo, C., Pan, W. J., Acland, B., Kleinfeld, D., Snyder, L. H., . . . Keilholz, S. (2021). Contribution of animal models toward understanding resting state functional connectivity. Neuroimage, 245, 118630.
    DOI Scopus26 Europe PMC30
    2021 Xu, Y., Zhou, C., Yu, X., Xiao, B., & Yang, Y. (2021). Pyramidal Multiple Instance Detection Network with Mask Guided Self-Correction for Weakly Supervised Object Detection. IEEE Transactions on Image Processing, 30, 3029-3040.
    DOI Scopus45 Europe PMC7
    2020 Qian, W., Yu, X., & Qian, C. (2020). Wireless Powered Encoding and Broadcasting of Frequency Modulated Detection Signals.. IEEE access : practical innovations, open solutions, 8, 200450-200460.
    DOI Europe PMC1
    2020 He, Y., Wang, M., & Yu, X. (2020). High spatiotemporal vessel-specific hemodynamic mapping with multi-echo single-vessel fMRI. Journal of Cerebral Blood Flow and Metabolism, 40(10), 2098-2114.
    DOI Scopus8 Europe PMC11
    2020 Yu, X., Fernando, B., Hartley, R., & Porikli, F. (2020). Semantic face hallucination: Super-resolving very low-resolution face images with supplementary attributes. IEEE Transactions on Pattern Analysis and Machine Intelligence, 42(11), 2926-2943.
    DOI Scopus43 Europe PMC4
    2020 Drew, P. J., Mateo, C., Turner, K. L., Yu, X., & Kleinfeld, D. (2020). Ultra-slow Oscillations in fMRI and Resting-State Connectivity: Neuronal and Vascular Contributions and Technical Confounds. Neuron, 107(5), 782-804.
    DOI Scopus109 Europe PMC116
    2020 Chen, Y., Sobczak, F., Pais-Roldan, P., Schwarz, C., Koretsky, A. P., & Yu, X. (2020). Mapping the Brain-Wide Network Effects by Optogenetic Activation of the Corpus Callosum. Cerebral Cortex, 30(11), 5885-5898.
    DOI Scopus21 Europe PMC21
    2020 Choi, S., Takahashi, K., Jiang, Y., Köhler, S., Zeng, H., Wang, Q., . . . Yu, X. (2020). Real-time fmri brain mapping in animals. Journal of Visualized Experiments, 2020(163), 1-13.
    DOI Scopus2 Europe PMC2
    2020 Yu, X., Shiri, F., Ghanem, B., & Porikli, F. (2020). Can We See More? Joint Frontalization and Hallucination of Unaligned Tiny Faces. IEEE Transactions on Pattern Analysis and Machine Intelligence, 42(9), 2148-2164.
    DOI Scopus33 Europe PMC6
    2020 Qian, W., Yu, X., & Qian, C. (2020). Wireless Reconfigurable RF Detector Array for Focal and Multiregional Signal Enhancement.. IEEE access : practical innovations, open solutions, 8, 136594-136604.
    DOI Europe PMC6
    2020 Pais-Roldán, P., Takahashi, K., Sobczak, F., Chen, Y., Zhao, X., Zeng, H., . . . Yu, X. (2020). Indexing brain state-dependent pupil dynamics with simultaneous fMRI and optical fiber calcium recording. Proceedings of the National Academy of Sciences of the United States of America, 117(12), 6875-6882.
    DOI Scopus50 Europe PMC60
    2020 Ovsepian, S. V., Jiang, Y., Sardella, T. C. P., Malekzadeh-Najafabadi, J., Burton, N. C., Yu, X., & Ntziachristos, V. (2020). Visualizing cortical response to optogenetic stimulation and sensory inputs using multispectral handheld optoacoustic imaging. Photoacoustics, 17, 100153.
    DOI Scopus7 Europe PMC9
    2020 Handwerker, J., Pérez-Rodas, M., Beyerlein, M., Vincent, F., Beck, A., Freytag, N., . . . Scheffler, K. (2020). A CMOS NMR needle for probing brain physiology with high spatial and temporal resolution. Nature Methods, 17(1), 64-67.
    DOI Scopus36 Europe PMC15
    2020 Yu, X., Porikli, F., Fernando, B., & Hartley, R. (2020). Hallucinating Unaligned Face Images by Multiscale Transformative Discriminative Networks. International Journal of Computer Vision, 128(2), 500-526.
    DOI Scopus35
    2020 Wang, Z., Yu, X., Lu, M., Wang, Q., Qian, C., & Xu, F. (2020). Single image portrait relighting via explicit multiple reflectance channel modeling. ACM Transactions on Graphics, 39(6), 1-13.
    DOI Scopus82
    2019 Yan, H., Yu, X., Zhang, Y., Zhang, S., Zhao, X., & Zhang, L. (2019). Single Image Depth Estimation with Normal Guided Scale Invariant Deep Convolutional Fields. IEEE Transactions on Circuits and Systems for Video Technology, 29(1), 80-92.
    DOI Scopus28
    2019 Shiri, F., Yu, X., Porikli, F., Hartley, R., & Koniusz, P. (2019). Identity-Preserving Face Recovery from Stylized Portraits. International Journal of Computer Vision, 127(6-7), 863-883.
    DOI Scopus20
    2019 Chen, X., Sobczak, F., Chen, Y., Jiang, Y., Qian, C., Lu, Z., . . . Yu, X. (2019). Mapping optogenetically-driven single-vessel fMRI with concurrent neuronal calcium recordings in the rat hippocampus. Nature Communications, 10(1), 5239.
    DOI Scopus37 Europe PMC45
    2019 Chen, Y., Pais-Roldan, P., Chen, X., Frosz, M. H., & Yu, X. (2019). MRI-guided robotic arm drives optogenetic fMRI with concurrent Ca2+ recording. Nature Communications, 10(1), 2536.
    DOI Scopus26 Europe PMC24
    2019 Pais-Roldán, P., Edlow, B. L., Jiang, Y., Stelzer, J., Zou, M., & Yu, X. (2019). Multimodal assessment of recovery from coma in a rat model of diffuse brainstem tegmentum injury. Neuroimage, 189, 615-630.
    DOI Scopus26 Europe PMC28
    2018 Pais-Roldán, P., Biswal, B., Scheffler, K., & Yu, X. (2018). Identifying respiration-related aliasing artifacts in the rodent resting-state fMRI. Frontiers in Neuroscience, 12(NOV), 788.
    DOI Scopus22 Europe PMC26
    2018 Yu, X., & Porikli, F. (2018). Imagining the Unimaginable Faces by Deconvolutional Networks. IEEE Transactions on Image Processing, 27(6), 2747-2761.
    DOI Scopus31 Europe PMC7
    2018 Wang, M., He, Y., Sejnowski, T. J., & Yu, X. (2018). Brain-state dependent astrocytic Ca2+ signals are coupled to both positive and negative BOLD-fMRI signals. Proceedings of the National Academy of Sciences of the United States of America, 115(7), E1647-E1656.
    DOI Scopus83 Europe PMC84
    2018 He, Y., Wang, M., Chen, X., Pohmann, R., Polimeni, J. R., Scheffler, K., . . . Yu, X. (2018). Ultra-Slow Single-Vessel BOLD and CBV-Based fMRI Spatiotemporal Dynamics and Their Correlation with Neuronal Intracellular Calcium Signals. Neuron, 97(4), 925-939.e5.
    DOI Scopus96 Europe PMC102
    2018 Li, L., Zhang, S., Yu, X., & Zhang, L. (2018). PMSC: PatchMatch-Based Superpixel Cut for Accurate Stereo Matching. IEEE Transactions on Circuits and Systems for Video Technology, 28(3), 679-692.
    DOI Scopus81
    2017 Li, L., Yu, X., Zhang, S., Zhao, X., & Zhang, L. (2017). 3D Cost aggregation with multiple minimum spanning trees for stereo matching. Applied Optics, 56(12), 3411-3420.
    DOI Scopus80 Europe PMC8
    2017 Chung, S., Jeong, J. H., Ko, S., Yu, X., Kim, Y. H., Isaac, J. T. R., & Koretsky, A. P. (2017). Peripheral Sensory Deprivation Restores Critical-Period-like Plasticity to Adult Somatosensory Thalamocortical Inputs. Cell Reports, 19(13), 2707-2717.
    DOI Scopus26 Europe PMC24
    2016 Pais-Roldán, P., Singh, A. P., Schulz, H., & Yu, X. (2016). High magnetic field induced otolith fusion in the zebrafish larvae. Scientific Reports, 6.
    DOI Scopus15 Europe PMC6
    2016 Yu, X., He, Y., Wang, M., Merkle, H., Dodd, S. J., Silva, A. C., & Koretsky, A. P. (2016). Sensory and optogenetically driven single-vessel fMRI. Nature Methods, 13(4), 337-340.
    DOI Scopus84 Europe PMC93
    2015 Sui, Y., Zhao, X., Zhang, S., Yu, X., Zhao, S., & Zhang, L. (2015). Self-expressive tracking. Pattern Recognition, 48(9), 2872-2884.
    DOI Scopus12
    2015 Zhang, S., Sui, Y., Zhao, S., Yu, X., & Zhang, L. (2015). Multi-local-task learning with global regularization for object tracking. Pattern Recognition, 48(12), 3881-3894.
    DOI Scopus23
    2015 Zhang, S., Sui, Y., Yu, X., Zhao, S., & Zhang, L. (2015). Hybrid support vector machines for robust object tracking. Pattern Recognition, 48(8), 2474-2488.
    DOI Scopus32
    2015 Zhang, S., Yu, X., Sui, Y., Zhao, S., & Zhang, L. (2015). Object tracking with multi-view support vector machines. IEEE Transactions on Multimedia, 17(3), 265-278.
    DOI Scopus113
    2015 Yu, X., Zhang, S., Zhao, X., & Zhang, L. (2015). Removing blur kernel noise via a hybrid lp norm. Journal of Electronic Imaging, 24(1).
    DOI Scopus4
    2014 Yu, X., Xu, F., Zhang, S., & Zhang, L. (2014). Efficient patch-wise non-uniform deblurring for a single image. IEEE Transactions on Multimedia, 16(6), 1510-1524.
    DOI Scopus50
    2014 Qian, C., Yu, X., Pothayee, N., Dodd, S., Bouraoud, N., Star, R., . . . Koretsky, A. (2014). Live nephron imaging by MRI. American Journal of Physiology Renal Physiology, 307(10), F1162-F1168.
    DOI Scopus17 Europe PMC17
    2014 Yu, X., Zhao, X., Sui, Y., & Zhang, L. (2014). Handling noise in single image defocus map estimation by using directional filters. Optics Letters, 39(21), 6281-6284.
    DOI Scopus4 Europe PMC2
    2014 Yu, X., & Koretsky, A. P. (2014). Interhemispheric plasticity protects the deafferented somatosensory cortex from functional takeover after nerve injury. Brain Connectivity, 4(9), 709-717.
    DOI Scopus16 Europe PMC14
    2014 Yu, X., Qian, C., Chen, D. Y., Dodd, S. J., & Koretsky, A. P. (2014). Deciphering laminar-specific neural inputs with line-scanning fMRI. Nature Methods, 11(1), 55-58.
    DOI Scopus137 Europe PMC139
    2013 Qian, C., Yu, X., Chen, D. Y., Dodd, S., Bouraoud, N., Pothayee, N., . . . Koretsky, A. (2013). Wireless amplified nuclear MR detector (WAND) for high-spatial-resolution MR imaging of internal organs: Preclinical demonstration in a rodent model. Radiology, 268(1), 228-236.
    DOI Scopus37 Europe PMC35
    2012 Yu, X., Chung, S., Chen, D. Y., Wang, S., Dodd, S. J., Walters, J. R., . . . Koretsky, A. P. (2012). Thalamocortical Inputs Show Post-Critical-Period Plasticity. Neuron, 74(4), 731-742.
    DOI Scopus64 Europe PMC66
    2012 Yu, X., Glen, D., Wang, S., Dodd, S., Hirano, Y., Saad, Z., . . . Koretsky, A. P. (2012). Direct imaging of macrovascular and microvascular contributions to BOLD fMRI in layers IV-V of the rat whisker-barrel cortex. Neuroimage, 59(2), 1451-1460.
    DOI Scopus87 Europe PMC90
    2011 Zhao, X., Yu, X., Sun, L., Hu, K., Wang, G., & Zhang, L. (2011). Non-rigid object tracking as salient region segmentation and association. IEICE Transactions on Information and Systems, E94-D(4), 934-937.
    DOI Scopus2
    2010 Yu, X., Wang, S., Chen, D. Y., Dodd, S., Goloshevsky, A., & Koretsky, A. P. (2010). 3D mapping of somatotopic reorganization with small animal functional MRI. Neuroimage, 49(2), 1667-1676.
    DOI Scopus35 Europe PMC29
    - Tang, T., Du, H., Yu, X., & Yang, Y. (2022). Monocular Camera-Based Point-Goal Navigation by Learning Depth Channel and Cross-Modality Pyramid Fusion. Proceedings of the AAAI Conference on Artificial Intelligence, 36(5), 5422-5430.
    DOI
    - Yu, X., & Porikli, F. (2017). Face Hallucination with Tiny Unaligned Images by Transformative Discriminative Neural Networks. Proceedings of the AAAI Conference on Artificial Intelligence, 31(1).
    DOI
  • Book Chapters

    Year Citation
    2025 Zhang, H., Xu, J., Tang, T., Sun, H., Yu, X., Huang, Z., & Yu, K. (2025). OpenSight: A Simple Open-Vocabulary Framework for LiDAR-Based Object Detection. In Lecture Notes in Computer Science (Vol. 15142 LNCS, pp. 1-19). Springer Nature Switzerland.
    DOI
    2024 Yu, Q., Du, H., & Yu, X. (2024). A New Perspective of Weakly Supervised 3D Instance Segmentation via Bounding Boxes. In Lecture Notes in Computer Science (Vol. 14471 LNAI, pp. 103-114). Springer Nature Singapore.
    DOI Scopus1
    2024 Sheng, H., Yu, X., Li, X., & Golzan, M. (2024). Context-Based Masking for Spontaneous Venous Pulsations Detection. In Lecture Notes in Computer Science (Vol. 14471 LNAI, pp. 520-532). Springer Nature Singapore.
    DOI Scopus1
    2024 Xu, Q., Du, H., Chen, H., Liu, B., & Yu, X. (2024). MMOOC: A Multimodal Misinformation Dataset for Out-of-Context News Analysis. In Lecture Notes in Computer Science (Vol. 14897 LNCS, pp. 444-459). Springer Nature Singapore.
    DOI Scopus4
    2024 Du, H., Huang, Z., Chapman, S., & Yu, X. (2024). Toward a Unified Framework for RGB and RGB-D Visual Navigation. In Lecture Notes in Computer Science (Vol. 14472 LNAI, pp. 363-375). Springer Nature Singapore.
    DOI Scopus2
    2024 Yuan, B., Wang, Z., & Yu, X. (2024). Towards Reliable and Efficient Vegetation Segmentation for Australian Wheat Data Analysis. In Lecture Notes in Computer Science (pp. 119-135). Springer Nature Switzerland.
    DOI
    2023 Cai, J., Nguyen, K. N., Shrestha, N., Good, A., Tu, R., Yu, X., . . . Serra, T. (2023). Getting Away with More Network Pruning: From Sparsity to Geometry and Linear Regions. In Lecture Notes in Computer Science (Vol. 13884 LNCS, pp. 200-218). Springer Nature Switzerland.
    DOI Scopus3
    2023 Shi, Y., Yu, X., Wang, S., & Li, H. (2023). CVLNet: Cross-view Semantic Correspondence Learning for Video-Based Camera Localization. In Lecture Notes in Computer Science (Vol. 13841 LNCS, pp. 123-141). Springer Nature Switzerland.
    DOI Scopus12
    2023 Wang, M., Lin, B., Guo, X., Li, L., Zhu, Z., Sun, J., . . . Yu, X. (2023). GaitStrip: Gait Recognition via Effective Strip-Based Feature Representations and Multi-level Framework. In Lecture Notes in Computer Science (Vol. 13844 LNCS, pp. 711-727). Springer Nature Switzerland.
    DOI Scopus5
    2023 Zhou, X. A., Jiang, Y., Man, W., & Yu, X. (2023). Multimodal methods to help interpret resting-state fMRI. In Advances in Resting-State Functional MRI (pp. 207-235). Elsevier.
    DOI
    2023 Fu, H., Liu, C., Qi, X., Lin, B., Li, L., Zhang, L., & Yu, X. (2023). Sign Spotting via Multi-modal Fusion and Testing Time Transferring. In Lecture Notes in Computer Science (Vol. 13808 LNCS, pp. 271-287). Springer Nature Switzerland.
    DOI Scopus4
    2023 Lee, H. H., Liu, Q., Bao, S., Yang, Q., Yu, X., Cai, L. Y., . . . Landman, B. A. (2023). Scaling up 3D Kernels with Bayesian Frequency Re-parameterization for Medical Image Segmentation. In Lecture Notes in Computer Science (Vol. 14223 LNCS, pp. 632-641). Springer Nature Switzerland.
    DOI Scopus9
    2022 Zhu, F., Yang, Z., Yu, X., Yang, Y., & Wei, Y. (2022). Instance as Identity: A Generic Online Paradigm for Video Instance Segmentation. In Lecture Notes in Computer Science (Vol. 13689 LNCS, pp. 524-540). Springer Nature Switzerland.
    DOI Scopus4
    2022 Zeng, H., Yu, X., Miao, J., & Yang, Y. (2022). MHR-Net: Multiple-Hypothesis Reconstruction of Non-Rigid Shapes from 2D Views. In Lecture Notes in Computer Science (Vol. 13662 LNCS, pp. 1-17). Springer Nature Switzerland.
    DOI Scopus7
    2021 Liu, J., & Yu, X. (2021). Few-shot Weighted Style Matching for Glaucoma Detection. In Lecture Notes in Computer Science (Vol. 13069 LNAI, pp. 289-300). Springer International Publishing.
    DOI Scopus1
    2020 Liu, J., Zou, Z., Ye, X., Tan, X., Ding, E., Xu, F., & Yu, X. (2020). Leaping from 2D Detection to Efficient 6DoF Object Pose Estimation. In Lecture Notes in Computer Science (Vol. 12536 LNCS, pp. 707-714). Springer International Publishing.
    DOI Scopus8
    2020 Du, H., Yu, X., & Zheng, L. (2020). Learning Object Relation Graph and Tentative Policy for Visual Navigation. In Lecture Notes in Computer Science (Vol. 12352 LNCS, pp. 19-34). Springer International Publishing.
    DOI Scopus103
    2018 Yu, X., Fernando, B., Ghanem, B., Porikli, F., & Hartley, R. (2018). Face super-resolution guided by facial component heatmaps. In Lecture Notes in Computer Science (Vol. 11213 LNCS, pp. 219-235). Springer International Publishing.
    DOI Scopus48
    2017 Yu, X. (2017). When Photons Meet Protons: Optogenetics, Calcium Signal Detection, and fMRI in Small Animals. In Small Animal Imaging (pp. 773-791). Springer International Publishing.
    DOI
    2016 Yu, X., & Porikli, F. (2016). Ultra-resolving face images by discriminative generative networks. In Unknown Book (Vol. 9909 LNCS, pp. 318-333).
    DOI Scopus248
  • Conference Papers

    Year Citation
    2025 Hu, Z., Zhang, Y., Liu, C., Li, L., Peng, S., Zhou, X., . . . Yu, X. (2025). CPT-VR: Improving Surface Rendering via Closest Point Transform with View-Reflection Appearance. In Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics Vol. 15131 LNCS (pp. 223-239). Springer Nature Switzerland.
    DOI
    2025 He, M., Zhang, J., & Yu, X. (2025). Transferable Attacks for Semantic Segmentation. In Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics Vol. 15449 LNCS (pp. 372-388). Springer Nature Singapore.
    DOI
    2025 Liu, C., Li, P., Yang, L., Wang, D., Li, L., & Yu, X. (2025). Robust Audio-Visual Segmentation via Audio-Guided Visual Convergent Alignment. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 28922-28931). IEEE.
    DOI
    2025 Wang, S., Zhang, H., Shen, X., Wang, D., & Yu, X. (2025). Blind Bitstream-corrupted Video Recovery via Metadata-guided Diffusion Model. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 22975-22984). IEEE.
    DOI
    2025 Liu, C., Zhang, W., Qiu, F., Li, L., Wang, D., & Yu, X. (2025). Affective Behaviour Analysis via Progressive Learning. In Lecture Notes in Computer Science Vol. 15637 LNCS (pp. 366-379). Springer Nature Switzerland.
    DOI
    2025 Xu, Q., Cao, R., Shen, X., Du, H., Wang, S., & Yu, X. (2025). M3GYM: A Large-Scale Multimodal Multi-view Multi-person Pose Dataset for Fitness Activity Understanding in Real-world Settings. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 12289-12300). IEEE.
    DOI
    2025 Wang, S., Chen, W., Zhang, W., Zhao, M., Li, L., Zhang, R., . . . Yu, X. (2025). EasyCraft: A Robust and Efficient Framework for Automatic Avatar Crafting. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 5581-5591). IEEE.
    DOI
    2025 Liu, C., Qiu, F., Zhang, W., Li, L., Wang, D., & Yu, X. (2025). Compound Expression Recognition via Curriculum Learning. In Lecture Notes in Computer Science Vol. 15637 LNCS (pp. 282-293). Springer Nature Switzerland.
    DOI
    2025 Liu, C., Yang, L., Li, P., Wang, D., Li, L., & Yu, X. (2025). Dynamic Derivation and Elimination: Audio Visual Segmentation with Enhanced Audio Semantics. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 3131-3141). IEEE.
    DOI
    2025 Shen, X., Du, H., Xu, M., Liu, M., & Yu, X. (2025). Cross-View Isolated Sign Language Recognition Challenge: Design, Results and Future Research. In Www Companion 2025 Companion Proceedings of the ACM Web Conference 2025 (pp. 2444-2447). ACM.
    DOI
    2025 Xu, Q., Du, H., Łukasik, S., Zhu, T., Wang, S., & Yu, X. (2025). MDAM3: A Misinformation Detection and Analysis Framework for Multitype Multimodal Media. In Www 2025 Proceedings of the ACM Web Conference (pp. 5285-5296). ACM.
    DOI
    2025 Guo, T., Du, H., Huo, H., Liu, B., & Yu, X. (2025). Who is Being Impersonated? Deepfake Audio Detection and Impersonated Identification via Extraction of Id-Specific Features. In Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics Vol. 15255 LNCS (pp. 301-320). Springer Nature Singapore.
    DOI Scopus1
    2025 Ying, J., Shen, X., & Yu, X. (2025). Vision-Based Abnormal Action Dataset for Recognising Body Motion Disorders. In Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics Vol. 15443 LNAI (pp. 443-455). Springer Nature Singapore.
    DOI Scopus1
    2025 Zhang, B., Cao, Z., Du, H., Yu, X., Li, X., Liu, J., & Wang, S. (2025). TokenBinder: Text-Video Retrieval with One-to-Many Alignment Paradigm. In 2025 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) (pp. 4957-4967). IEEE.
    DOI
    2025 Cao, Z., Zhang, B., Du, H., Yu, X., Li, X., & Wang, S. (2025). FlashVTG: Feature Layering and Adaptive Score Handling Network for Video Temporal Grounding. In 2025 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) (pp. 9226-9236). IEEE.
    DOI
    2024 Yu, X., Tang, Y., Yang, Q., Lee, H. H., Bao, S., Huo, Y., & Landman, B. A. (2024). Enhancing Hierarchical Transformers for Whole Brain Segmentation with Intracranial Measurements Integration.. In B. S. Gimi, & A. Krol (Eds.), Proceedings of SPIE--the International Society for Optical Engineering Vol. 12930 (pp. 129300K). United States: SPIE.
    DOI Europe PMC1
    2024 Qiu, F., Zhang, W., Liu, C., Li, L., Du, H., Guo, T., & Yu, X. (2024). Language-guided Multi-modal Emotional Mimicry Intensity Estimation. In IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops (pp. 4742-4751). IEEE.
    DOI Scopus3
    2024 Tang, J., Li, L., Qi, X., Chen, Y., Fan, C., & Yu, X. (2024). AS-NeRF: Learning Auxiliary Sampling for Generalizable Novel View Synthesis from Sparse Views. In Proceedings IEEE International Conference on Multimedia and Expo (pp. 1-6). IEEE.
    DOI
    2024 Qiu, F., Du, H., Zhang, W., Liu, C., Li, L., Guo, T., & Yu, X. (2024). Learning Transferable Compound Expressions from Masked AutoEncoder Pretraining. In IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops (pp. 4733-4741). IEEE.
    DOI Scopus2
    2024 Zhang, W., Qiu, F., Liu, C., Li, L., Du, H., Guo, T., & Yu, X. (2024). An Effective Ensemble Learning Framework for Affective Behaviour Analysis. In IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops (pp. 4761-4772). IEEE.
    DOI Scopus6
    2024 Wei, T., Chen, Z., Huang, Z., & Yu, X. (2024). Benchmarking In-the-Wild Multimodal Disease Recognition and A Versatile Baseline. In Mm 2024 Proceedings of the 32nd ACM International Conference on Multimedia (pp. 1593-1601). ACM.
    DOI Scopus3
    2024 Qiu, F., Zhang, W., Liu, C., An, R., Li, L., Ding, Y., . . . Yu, X. (2024). FreeAvatar: Robust 3D Facial Animation Transfer by Learning an Expression Foundation Model. In Proceedings SIGGRAPH Asia 2024 Conference Papers SA 2024 (pp. 1-11). ACM.
    DOI Scopus3
    2024 Wei, T., Chen, Z., & Yu, X. (2024). Snap and Diagnose: An Advanced Multimodal Retrieval System for Identifying Plant Diseases in the Wild. In Proceedings of the 6th ACM International Conference on Multimedia in Asia Mmasia 2024 (pp. 1-3). ACM.
    DOI
    2024 Hu, Z., Zhao, M., Zhao, C., Liang, X., Li, L., Zhao, Z., . . . Yu, X. (2024). EfficientDreamer: High-Fidelity and Stable 3D Creation via Orthogonal-view Diffusion Priors. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 4949-4958). IEEE.
    DOI Scopus8
    2024 Shiri, F., Guo, X. Y., Far, M. G., Yu, X., Haffari, G., & Li, Y. F. (2024). An Empirical Analysis on Spatial Reasoning Capabilities of Large Multimodal Models. In Emnlp 2024 2024 Conference on Empirical Methods in Natural Language Processing Proceedings of the Conference (pp. 21440-21455). Association for Computational Linguistics.
    DOI Scopus1
    2024 Yu, Q., Du, H., Liu, C., & Yu, X. (2024). When 3D Bounding-Box Meets SAM: Point Cloud Instance Segmentation with Weak-and-Noisy Supervision. In Proceedings 2024 IEEE Winter Conference on Applications of Computer Vision Wacv 2024 (pp. 3707-3716). IEEE.
    DOI Scopus4
    2024 Dong, G., Wang, H., Sun, J., & Wang, X. (2024). Evaluating and Mitigating Linguistic Discrimination in Large Language Models: Perspectives on Safety Equity and Knowledge Equity. In Proceedings of the Thirty-ThirdInternational Joint Conference on Artificial Intelligence (pp. 348-356). International Joint Conferences on Artificial Intelligence Organization.
    DOI
    2024 Liu, C., Li, P. P., Yu, Q., Sheng, H., Wang, D., Li, L., & Yu, X. (2024). Benchmarking Audio Visual Segmentation for Long-Untrimmed Videos. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 22712-22722). IEEE.
    DOI Scopus1
    2024 Chen, H., Zhu, T., Yu, X., & Zhou, W. (2024). Machine Unlearning via Null Space Calibration. In Ijcai International Joint Conference on Artificial Intelligence (pp. 358-366).
    Scopus5
    2024 Shen, X., Du, H., Sheng, H., Wang, S., Chen, H., Chen, H., . . . Yu, X. (2024). MM-WLAuslan: Multi-View Multi-Modal Word-Level Australian Sign Language Recognition Dataset. In Advances in Neural Information Processing Systems Vol. 37.
    Scopus1
    2024 Chen, H., Liu, Y., Ma, Y., Zheng, N., & Yu, X. (2024). TPR: Topology-Preserving Reservoirs for Generalized Zero-Shot Learning. In Advances in Neural Information Processing Systems Vol. 37.
    Scopus1
    2024 Lim, J. S., Chen, Z., Baktashmotlagh, M., Chen, Z., Yu, X., Huang, Z., & Luo, Y. (2024). DiPEx: Dispersing Prompt Expansion for Class-Agnostic Object Detection. In Advances in Neural Information Processing Systems Vol. 37.
    Scopus2
    2024 Wu, Y., Meng, Y., Hu, Z., Li, L., Wu, H., Zhou, K., . . . Yu, X. (2024). Text-Guided 3D Face Synthesis - From Generation to Editing. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 1260-1269). IEEE.
    DOI Scopus7
    2023 Du, H., Li, L., Huang, Z., & Yu, X. (2023). Object-Goal Visual Navigation via Effective Exploration of Relations Among Historical Navigation States. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition Vol. 2023-June (pp. 2563-2573). IEEE.
    DOI Scopus24
    2023 Shen, C., Lin, B., Zhang, S., Yu, X., Huang, G. Q., & Yu, S. (2023). Gait Recognition with Mask-based Regularization. In 2023 IEEE International Joint Conference on Biometrics Ijcb 2023 (pp. 1-10). IEEE.
    DOI Scopus5
    2023 Zhang, Y., Wang, Z., Luo, Y., Yu, X., & Huang, Z. (2023). Learning Efficient Unsupervised Satellite Image-based Building Damage Detection. In Proceedings IEEE International Conference on Data Mining Icdm (pp. 1547-1552). IEEE.
    DOI Scopus2
    2023 Wu, H., Hu, Z., Li, L., Zhang, Y., Fan, C., & Yu, X. (2023). NeFII: Inverse Rendering for Reflectance Decomposition with Near-Field Indirect Illumination. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition Vol. 2023-June (pp. 4295-4304). IEEE.
    DOI Scopus33
    2023 Tang, J., Li, L., Hou, J., Xin, H., & Yu, X. (2023). A Divide-and-conquer Solution to 3D Human Motion Estimation from Raw MoCap Data. In 2023 IEEE Conference on Virtual Reality and 3D User Interfaces Abstracts and Workshops (VRW) (pp. 767-768). IEEE.
    DOI
    2023 Zhao, Y., Liu, B., Ding, M., Liu, B., Zhu, T., & Yu, X. (2023). Proactive Deepfake Defence via Identity Watermarking. In 2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) (pp. 4591-4600). IEEE.
    DOI
    2023 Du, H., Yu, X., Hussain, F., Armin, M. A., Petersson, L., & Li, W. (2023). Weakly-supervised Point Cloud Instance Segmentation with Geometric Priors. In 2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) (pp. 4260-4269). IEEE.
    DOI
    2023 Rao, Q., Yu, X., Navasardyan, S., & Shi, H. (2023). Sim2RealVS: A New Benchmark for Video Stabilization with a Strong Baseline. In 2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) (pp. 5395-5404). IEEE.
    DOI
    2023 Qi, X., Liu, C., Sun, M., Li, L., Fan, C., & Yu, X. (2023). Diverse 3D Hand Gesture Prediction from Body Dynamics by Bilateral Hand Disentanglement. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition Vol. 2023-June (pp. 4616-4626). IEEE.
    DOI Scopus15
    2023 Wang, M., Guo, X., Lin, B., Yang, T., Zhu, Z., Li, L., . . . Yu, X. (2023). DyGait: Exploiting Dynamic Representations for High-performance Gait Recognition. In Proceedings of the IEEE International Conference on Computer Vision (pp. 13378-13387). IEEE.
    DOI Scopus42
    2023 Liu, C., Li, P. P., Qi, X., Zhang, H., Li, L., Wang, D., & Yu, X. (2023). Audio-Visual Segmentation by Exploring Cross-Modal Mutual Semantics. In Mm 2023 Proceedings of the 31st ACM International Conference on Multimedia (pp. 7590-7598). ACM.
    DOI Scopus31
    2023 Khan, M. W., Sheng, H., Zhang, H., Du, H., Wang, S., Coroneo, M. T., . . . Yu, X. (2023). RVD: A Handheld Device-Based Fundus Video Dataset for Retinal Vessel Segmentation. In Advances in Neural Information Processing Systems Vol. 36.
    Scopus6
    2023 Liu, P., Yu, X., & Zhou, J. T. (2023). META KNOWLEDGE CONDENSATION FOR FEDERATED LEARNING. In 11th International Conference on Learning Representations Iclr 2023.
    Scopus7
    2023 Liu, B., Liu, B., Ding, M., Zhu, T., & Yu, X. (2023). TI<sup>2</sup>Net: Temporal Identity Inconsistency Network for Deepfake Detection. In 2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV). IEEE.
    DOI
    2023 Shen, X., Yuan, S., Sheng, H., Du, H., & Yu, X. (2023). Auslan-Daily: Australian Sign Language Translation for Daily Communication and News. In Advances in Neural Information Processing Systems Vol. 36.
    Scopus19
    2023 Luo, Y., Chen, Z., Wang, Z., Yu, X., Huang, Z., & Baktashmotlagh, M. (2023). EXPLORING ACTIVE 3D OBJECT DETECTION FROM A GENERALIZATION PERSPECTIVE. In 11th International Conference on Learning Representations Iclr 2023.
    Scopus19
    2022 Li, S., Phillips, J. M., Yu, X., Kirby, R. M., & Zhe, S. (2022). Batch Multi-Fidelity Active Learning with Budget Constraints. In Advances in Neural Information Processing Systems Vol. 35.
    Scopus11
    2022 Yao, G., Wu, H., Yuan, Y., Li, L., Zhou, K., & Yu, X. (2022). Learning Implicit Body Representations from Double Diffusion Based Neural Radiance Fields. In Ijcai International Joint Conference on Artificial Intelligence (pp. 1566-1572). International Joint Conferences on Artificial Intelligence Organization.
    DOI Scopus4
    2021 Li, P., Yu, X., & Yang, Y. (2021). Super-Resolving Cross-Domain Face Miniatures by Peeking at One-Shot Exemplar. In Proceedings of the IEEE International Conference on Computer Vision (pp. 4449-4459). IEEE.
    DOI Scopus2
    2021 Zeng, H., Dai, Y., Yu, X., Wang, X., & Yang, Y. (2021). PR-RRN: Pairwise-Regularized Residual-Recursive Networks for Non-rigid Structure-from-Motion. In Proceedings of the IEEE International Conference on Computer Vision (pp. 5580-5589). IEEE.
    DOI Scopus10
    2021 Yu, X., Van Baar, J., & Chen, S. (2021). Joint 3D Human Shape Recovery and Pose Estimation from a Single Image with Bilayer Graph. In Proceedings 2021 International Conference on 3D Vision 3dv 2021 (pp. 505-514). IEEE.
    DOI Scopus5
    2021 Wang, S., Li, L., Ding, Y., Fan, C., & Yu, X. (2021). Audio2Head: Audio-driven One-shot Talking-head Generation with Natural Head Motion. In Ijcai International Joint Conference on Artificial Intelligence (pp. 1098-1105). International Joint Conferences on Artificial Intelligence Organization.
    DOI Scopus59
    2021 Ding, Y., Yu, X., & Yang, Y. (2021). RFNet: Region-aware Fusion Network for Incomplete Multi-modal Brain Tumor Segmentation. In Proceedings of the IEEE International Conference on Computer Vision (pp. 3955-3964). IEEE.
    DOI Scopus135
    2021 Ben-Shabat, Y., Yu, X., Saleh, F., Campbell, D., Rodriguez Opazo, C., Li, H., & Gould, S. (2021). The IKEA ASM Dataset: Understanding people assembling furniture through actions, objects and pose. In Proceedings of the IEEE Winter Conference on Applications of Computer Vision (WACV 2021) (pp. 846-858). virtual online: IEEE.
    DOI Scopus93 WoS81
    2021 Fan, H., Yu, X., Ding, Y., Yang, Y., & Kankanhalli, M. (2021). PSTNET: POINT SPATIO-TEMPORAL CONVOLUTION ON POINT CLOUD SEQUENCES. In Iclr 2021 9th International Conference on Learning Representations.
    Scopus67
    2021 Du, H., Yu, X., & Zheng, L. (2021). VTNET: VISUAL TRANSFORMER NETWORK FOR OBJECT GOAL NAVIGATION. In Iclr 2021 9th International Conference on Learning Representations.
    Scopus40
    2021 Tang, T., Yu, X., Dong, X., & Yang, Y. (2021). Auto-Navigator: Decoupled neural architecture search for visual navigation. In Proceedings 2021 IEEE Winter Conference on Applications of Computer Vision Wacv 2021 (pp. 3742-3751). IEEE.
    DOI Scopus9
    2021 Quan, R., Yu, X., Liang, Y., & Yang, Y. (2021). Removing Raindrops and Rain Streaks in One Go. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 9143-9152). IEEE.
    DOI Scopus158
    2021 Shi, Y., Li, H., & Yu, X. (2021). Self-Supervised Visibility Learning for Novel View Synthesis. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 9670-9679). IEEE.
    DOI Scopus15
    2021 Li, D., Xu, C., Zhang, K., Yu, X., Zhong, Y., Ren, W., . . . Li, H. (2021). ARVo: Learning All-Range Volumetric Correspondence for Video Deblurring. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 7717-7727). IEEE.
    DOI Scopus56
    2021 Lin, B., Zhang, S., & Yu, X. (2021). Gait Recognition via Effective Global-Local Feature Representation and Local Temporal Aggregation. In Proceedings of the IEEE International Conference on Computer Vision (pp. 14628-14636). IEEE.
    DOI Scopus266
    2021 Yang, Z., Yu, X., & Yang, Y. (2021). DSC-PoseNet: Learning 6DoF Object Pose Estimation via Dual-scale Consistency. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 3906-3915). IEEE.
    DOI Scopus45
    2021 Zhuang, Z., Yu, X., & Mahony, R. (2021). End-to-end Multi-Instance Robotic Reaching from Monocular Vision. In Proceedings IEEE International Conference on Robotics and Automation Vol. 2021-May (pp. 12974-12980). IEEE.
    DOI Scopus1
    2021 Kennedy, G., Gao, J., Zhuang, Z., Yu, X., & Mahony, R. (2021). A General Approach to State Refinement. In IEEE International Conference on Intelligent Robots and Systems (pp. 8985-8991). IEEE.
    DOI
    2021 Zhang, J., Fan, D. P., Dai, Y., Yu, X., Zhong, Y., Barnes, N., & Shao, L. (2021). RGB-D Saliency Detection via Cascaded Mutual Information Minimization. In Proceedings of the IEEE International Conference on Computer Vision (pp. 4318-4327). IEEE.
    DOI Scopus109
    2020 Li, D., Yu, X., Xu, C., Petersson, L., & Li, H. (2020). Transferring Cross-Domain Knowledge for Video Sign Language Recognition. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 6204-6213). IEEE.
    DOI Scopus121
    2020 Zhuang, Z., Yu, X., & Mahony, R. (2020). LyRN (Lyapunov Reaching Network): A Real-Time Closed Loop approach from Monocular Vision. In 2020 IEEE International Conference on Robotics and Automation (ICRA) (pp. 8331-8337). IEEE.
    DOI
    2020 Zheng, Z., Jiang, M., Wang, Z., Wang, J., Bai, Z., Zhang, X., . . . Ding, E. (2020). Going beyond real data: A robust visual representation for vehicle re-identification. In IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops Vol. 2020-June (pp. 2550-2558). IEEE.
    DOI Scopus49
    2020 Shi, Y., Yu, X., Campbell, D., & Li, H. (2020). Where am I looking at? Joint location and orientation estimation by cross-view matching. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 4063-4071). IEEE.
    DOI Scopus186
    2020 Zhang, J., Yu, X., Li, A., Song, P., Liu, B., & Dai, Y. (2020). Weakly-Supervised Salient Object Detection via Scribble Annotations. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 12543-12552). IEEE.
    DOI Scopus293
    2020 Yu, X., Zhuang, Z., Koniusz, P., & Li, H. (2020). 6DoF Object Pose Estimation via Differentiable Proxy Voting Regularizer. In 31st British Machine Vision Conference Bmvc 2020.
    Scopus18
    2020 Li, D., Xu, C., Yu, X., Zhang, K., Swift, B., Suominen, H., & Li, H. (2020). TSPNet: Hierarchical feature learning via temporal semantic pyramid for sign language translation. In Advances in Neural Information Processing Systems Vol. 2020-December.
    Scopus105
    2020 Li, P., Dong, X., Yu, X., & Yang, Y. (2020). When Humans Meet Machines: Towards Efficient Segmentation Networks. In 31st British Machine Vision Conference Bmvc 2020.
    Scopus27
    2020 Li, D., Rodriguez Opazo, C., Yu, X., & Li, H. (2020). Word-level Deep Sign Language Recognition from Video: A New Large-scale Dataset and Methods Comparison. In Proceedings of the 2020 IEEE Winter Conference on Applications of Computer Vision (pp. 1448-1458). Snowmass Village, CO, USA: IEEE.
    DOI Scopus446 WoS309
    2020 Zhang, Y., Tsang, I. W., Luo, Y., Hu, C. -H., Lu, X., & Yu, X. (2020). Copy and Paste GAN: Face Hallucination From Shaded Thumbnails. In 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (pp. 7353-7362). IEEE.
    DOI
    2019 Pan, L., Scheerlinck, C., Yu, X., Hartley, R., Liu, M., & Dai, Y. (2019). Bringing a blurry frame alive at high frame-rate with an event camera. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition Vol. 2019-June (pp. 6813-6822). IEEE.
    DOI Scopus246
    2019 Yu, X., Tian, Y., Porikli, F., Hartley, R., Li, H., Heijnen, H., & Balntas, V. (2019). Unsupervised extraction of local image descriptors via relative distance ranking loss. In Proceedings 2019 International Conference on Computer Vision Workshop Iccvw 2019 (pp. 2893-2902). IEEE.
    DOI Scopus23
    2019 Tian, Y., Yu, X., Fan, B., Wu, F., Heijnen, H., & Balntas, V. (2019). Sosnet: Second order similarity regularization for local descriptor learning. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition Vol. 2019-June (pp. 11008-11017). IEEE.
    DOI Scopus358
    2019 Shiri, F., Yu, X., Porikli, F., Hartley, R., & Koniusz, P. (2019). Recovering faces from portraits with auxiliary facial attributes. In Proceedings 2019 IEEE Winter Conference on Applications of Computer Vision Wacv 2019 (pp. 406-415). IEEE.
    DOI Scopus12
    2018 Yu, X., Yu, Z., & Ramalingam, S. (2018). Learning Strict Identity Mappings in Deep Residual Networks. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 4432-4440). IEEE.
    DOI Scopus60
    2018 Yu, X., Fernando, B., Hartley, R., & Porikli, F. (2018). Super-Resolving Very Low-Resolution Face Images with Supplementary Attributes. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 908-917).
    DOI Scopus170
    2018 Shiri, F., Yu, X., Porikli, F., Hartley, R., & Koniusz, P. (2018). Identity-preserving face recovery from portraits. In Proceedings 2018 IEEE Winter Conference on Applications of Computer Vision Wacv 2018 Vol. 2018-January (pp. 102-111). IEEE.
    DOI Scopus16
    2017 Yu, X., & Porikli, F. (2017). Hallucinating very low-Resolution unaligned and noisy face images by transformative discriminative autoencoders. In Proceedings 30th IEEE Conference on Computer Vision and Pattern Recognition Cvpr 2017 Vol. 2017-January (pp. 5367-5375).
    DOI Scopus122
    2017 Shiri, F., Yu, X., Koniusz, P., & Porikli, F. (2017). Face Destylization. In Dicta 2017 2017 International Conference on Digital Image Computing Techniques and Applications Vol. 2017-December (pp. 1-8).
    DOI Scopus14

Connect With Me
External Profiles