Xin Yu

School of Computer Science and Information Technology

College of Engineering and Information Technology


I am Xin Yu, an Associate Professor at the University of Adelaide (Dec 2025 – present). My research focuses on computer vision and machine learning, with a particular interest in enabling technologies that improve accessibility and understanding through visual intelligence. I received my PhD degree in Computer Science from the Australian National University, and a PhD degree in Communication and Information Engineering from Tsinghua University. Currently, I am a Visiting Faculty Researcher at Google (2024–-Present), and leading a Visual Intelligence Group at the Australian Institute for Machine Learning (AIML).
 
I am the recipient of several prestigious awards, including the Australian Research Council (ARC) Discovery Early Career Researcher Award (DECRA, 2023–2025), the Google Research Scholar Program Award (2021), and the Google Inclusion Research Award (2023). I was also honoured with the Queensland Young Tall Poppy Science Award from the Australian Institute of Policy and Science (AIPS). I also received the CORE Outstanding Research Contribution Award 2026 from the Computing Research and Education Association of Australasia (CORE).

Language Competency
English Can read, write, speak, understand spoken and peer review

Date Institution name Country Title
2015 - 2019 Australian National University Australia PhD
2009 - 2015 Tsinghua University China PhD

Year Citation
2026 Liu, M., Yu, X., Xu, C., & Song, Y. (2026). Preface. Lecture Notes in Computer Science, 16370 LNAI, v-vi.
2025 Du, X., Sun, H., Lu, M., Zhu, T., & Yu, X. (2025). DreamCar: Leveraging Car-Specific Prior for In-the-Wild 3D Car Reconstruction. IEEE Robotics and Automation Letters, 10(2), 1840-1847.
DOI Scopus1
2025 Ma, Y., Wang, S., Ding, Y., Ma, B., Lv, T., Fan, C., . . . Yu, X. (2025). TalkCLIP: Talking Head Generation with Text-Guided Expressive Speaking Styles. IEEE Transactions on Multimedia, 27, 6335-6346.
DOI
2025 Wu, H., Zhao, M., Hu, Z., Fan, C., Li, L., Chen, W., . . . Yu, X. (2025). ICE: Interactive 3D Game Character Facial Editing via Dialogue. IEEE Transactions on Multimedia, 27, 3210-3223.
DOI
2025 Jiang, W., Zhao, D., Wang, C., Yu, X., Arun, P. V., Asano, Y., . . . Zhou, H. (2025). Hyperspectral video object tracking with cross-modal spectral complementary and memory prompt network. Knowledge Based Systems, 330, 114595.
DOI
2024 Zhao, Y., Liu, B., Zhu, T., Ding, M., Yu, X., & Zhou, W. (2024). Proactive image manipulation detection via deep semi-fragile watermark. Neurocomputing, 585, 127593.
DOI Scopus21 WoS13
2024 Song, X., Liu, C., Zheng, Y., Feng, Z., Li, L., Zhou, K., & Yu, X. (2024). HairStyle Editing via Parametric Controllable Strokes. IEEE Transactions on Visualization and Computer Graphics, 30(7), 3857-3870.
DOI Scopus4 WoS4
2024 Xu, Q., Chen, H., Du, H., Zhang, H., Łukasik, S., Zhu, T., & Yu, X. (2024). M3A: A multimodal misinformation dataset for media authenticity analysis. Computer Vision and Image Understanding, 249, 104205.
DOI Scopus5 WoS3
2024 Wang, S., Ma, Y., Ding, Y., Hu, Z., Fan, C., Lv, T., . . . Yu, X. (2024). StyleTalk++: A Unified Framework for Controlling the Speaking Styles of Talking Heads. IEEE Transactions on Pattern Analysis and Machine Intelligence, 46(6), 4331-4347.
DOI Scopus8 WoS3 Europe PMC1
2024 Guan, S., Yu, X., Huang, W., Fang, G., & Lu, H. (2024). DMMG: Dual Min-Max Games for Self-Supervised Skeleton-Based Action Recognition. IEEE Transactions on Image Processing, 33, 395-407.
DOI Scopus11 WoS8 Europe PMC2
2024 Choi, S., Hike, D., Pohmann, R., Avdievich, N., Gomez-Cid, L., Man, W., . . . Yu, X. (2024). Alpha-180 spin-echo-based line-scanning method for high-resolution laminar-specific fMRI in animals. IMAGING NEUROSCIENCE, 2, 30.
DOI Europe PMC1
2024 Sheng, H., Shen, X., Du, H., Zhang, H., Huang, Z., & Yu, X. (2024). AI empowered Auslan learning for parents of deaf children and children of deaf adults. AI and Ethics, 4(4), 877-887.
DOI
2024 Du, X., Yu, X., Liu, J., Dai, B., & Xu, F. (2024). Ethics-aware face recognition aided by synthetic face images. Neurocomputing, 600, 128129.
DOI Scopus6 WoS5
2024 Hu, Z., Tang, J., Li, L., Hou, J., Xin, H., Yu, X., & Bu, J. (2024). MarkerNet: A divide-and-conquer solution to motion capture solving from raw markers. Computer Animation and Virtual Worlds, 35(1), 19 pages.
DOI Scopus3 WoS1
2024 Fu, H., Yu, X., Li, L., & Zhang, L. (2024). CBARF: Cascaded Bundle-Adjusting Neural Radiance Fields From Imperfect Camera Poses. IEEE Transactions on Multimedia, 26, 9304-9315.
DOI Scopus3 WoS3
2024 Rao, Q., Yu, X., Li, G., & Zhu, L. (2024). CMGNet: Collaborative multi-modal graph network for video captioning. Computer Vision and Image Understanding, 238, 103864.
DOI Scopus5 WoS4
2024 Qi, X., Liu, C., Li, L., Hou, J., Xin, H., & Yu, X. (2024). EmotionGesture: Audio-Driven Diverse Emotional Co-Speech 3D Gesture Generation. IEEE Transactions on Multimedia, 26, 10420-10430.
DOI Scopus10 WoS4
2024 Zhang, W., Li, L., Ding, Y., Chen, W., Deng, Z., & Yu, X. (2024). Detecting Facial Action Units From Global-Local Fine-Grained Expressions. IEEE Transactions on Circuits and Systems for Video Technology, 34(2), 983-994.
DOI Scopus10 WoS3
2024 Liu, C., Li, P., Zhang, H., Li, L., Huang, Z., Wang, D., & Yu, X. (2024). BAVS: Bootstrapping Audio-Visual Segmentation by Integrating Foundation Knowledge. IEEE Transactions on Multimedia, 26, 10015-10028.
DOI Scopus16 WoS6
2024 Zhou, S., Zhu, T., Ye, D., Yu, X., & Zhou, W. (2024). Boosting Model Inversion Attacks With Adversarial Examples. IEEE Transactions on Dependable and Secure Computing, 21(3), 1451-1468.
DOI Scopus15 WoS10
2024 Zhao, M., Qi, X., Hu, Z., Li, L., Zhang, Y., Huang, Z., & Yu, X. (2024). Calligraphy Font Generation via Explicitly Modeling Location-Aware Glyph Component Deformations. IEEE Transactions on Multimedia, 26, 5939-5950.
DOI Scopus4 WoS4
2024 Gao, C., Yang, Q., Kim, M. E., Khairi, N. M., Cai, L. Y., Newlin, N. R., . . . Landman, B. A. (2024). Characterizing patterns of diffusion tensor imaging variance in aging brains. Journal of Medical Imaging, 11(4), 44007.
DOI
2024 Choi, S., Hike, D., Pohmann, R., Avdievich, N., Gomez-Cid, L., Man, W., . . . Yu, X. (2024). Alpha-180 spin-echo based line-scanning method for high resolution laminar-specific fMRI.. bioRxiv.
DOI
2023 Bao, S., Cui, C., Li, J., Tang, Y., Lee, H. H., Deng, R., . . . Huo, Y. (2023). Topological-Preserving Membrane Skeleton Segmentation in Multiplex Immunofluorescence Imaging. Progress in Biomedical Optics and Imaging Proceedings of SPIE, 12471, 10 pages.
DOI Scopus1
2023 Yu, X., Yang, Q., Zhou, Y., Cai, L. Y., Gao, R., Lee, H. H., . . . Tang, Y. (2023). UNesT: Local spatial representation learning with hierarchical transformer for efficient medical segmentation. Medical Image Analysis, 90, 102939.
DOI Scopus66 WoS53 Europe PMC30
2023 Ma, Y., Wang, S., Hu, Z., Fan, C., Lv, T., Ding, Y., . . . Yu, X. (2023). StyleTalk: One-Shot Talking Head Generation with Controllable Speaking Styles. Proceedings of the 37th Aaai Conference on Artificial Intelligence Aaai 2023, 37(2), 1896-1904.
DOI Scopus46 WoS26
2023 Shi, Y., Yu, X., Liu, L., Campbell, D., Koniusz, P., & Li, H. (2023). Accurate 3-DoF Camera Geo-Localization via Ground-to-Satellite Image Matching. IEEE Transactions on Pattern Analysis and Machine Intelligence, 45(3), 2682-2697.
DOI Scopus39 WoS24 Europe PMC2
2023 Mao, Y., Wan, Z., Dai, Y., & Yu, X. (2023). Deep Idempotent Network for Efficient Single Image Blind Deblurring. IEEE Transactions on Circuits and Systems for Video Technology, 33(1), 172-185.
DOI Scopus35 WoS28
2023 Zeng, H., Zhang, W., Fan, C., Lv, T., Wang, S., Zhang, Z., . . . Yu, X. (2023). FlowFace: Semantic Flow-Guided Shape-Aware Face Swapping. Proceedings of the 37th Aaai Conference on Artificial Intelligence Aaai 2023, 37(3), 3367-3375.
DOI Scopus3 WoS3
2023 Sheng, H., Yu, X., Wang, F., Khan, M. W., Weng, H., Shariflou, S., & Golzan, S. M. (2023). Autonomous Stabilization of Retinal Videos for Streamlining Assessment of Spontaneous Venous Pulsations. Proceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society EMBS, 2023, 4 pages.
DOI Scopus3 WoS3 Europe PMC1
2023 Xu, Y., Zhou, C., Yu, X., & Yang, Y. (2023). Cyclic Self-Training With Proposal Weight Modulation for Cross-Supervised Object Detection. IEEE Transactions on Image Processing, 32, 1992-2002.
DOI Scopus9 WoS8 Europe PMC2
2023 Yu, X., Tang, Y., Yang, Q., Lee, H. H., Gao, R., Bao, S., . . . Landman, B. A. (2023). Longitudinal Variability Analysis on Low-dose Abdominal CT with Deep Learning-based Segmentation. Progress in Biomedical Optics and Imaging Proceedings of SPIE, 12464, 7 pages.
DOI Scopus3 WoS2 Europe PMC2
2023 Ramadass, K., Yu, X., Cai, L. Y., Tang, Y., Bao, S., Kerley, C., . . . Landman, B. A. (2023). Deep whole brain segmentation of 7T structural MRI. Progress in Biomedical Optics and Imaging Proceedings of SPIE, 12464, 8 pages.
DOI
2022 Chen, X., Jiang, Y., Choi, S., Pohmann, R., Scheffler, K., Kleinfeld, D., & Yu, X. (2022). Erratum: Assessment of single-vessel cerebral blood velocity by phase contrast fMRI (PLoS Biol (2021) 19:9 (e3000923) DOI: 10.1371/journal.pbio.3000923). Plos Biology, 20(12), e3001951.
DOI
2022 Zhang, Y., Tsang, I. W., Luo, Y., Hu, C., Lu, X., & Yu, X. (2022). Recursive Copy and Paste GAN: Face Hallucination From Shaded Thumbnails. IEEE Transactions on Pattern Analysis and Machine Intelligence, 44(8), 4321-4338.
DOI Scopus20 WoS15 Europe PMC3
2022 Pan, L., Hartley, R., Scheerlinck, C., Liu, M., Yu, X., & Dai, Y. (2022). High Frame Rate Video Reconstruction Based on an Event Camera. IEEE Transactions on Pattern Analysis and Machine Intelligence, 44(5), 2519-2533.
DOI Scopus68 WoS67 Europe PMC10
2022 Xu, Y., Yu, X., Zhang, J., Zhu, L., & Wang, D. (2022). Weakly Supervised RGB-D Salient Object Detection with Prediction Consistency Training and Active Scribble Boosting. IEEE Transactions on Image Processing, 31, 2148-2161.
DOI Scopus50 WoS45 Europe PMC4
2022 Zheng, Y., Yu, X., Liu, M., & Zhang, S. (2022). Single-Image Deraining via Recurrent Residual Multiscale Networks. IEEE Transactions on Neural Networks and Learning Systems, 33(3), 1310-1323.
DOI Scopus27 WoS23 Europe PMC6
2022 Zhang, Y., Yu, X., Lu, X., & Liu, P. (2022). Pro-UIGAN: Progressive Face Hallucination From Occluded Thumbnails. IEEE Transactions on Image Processing, 31, 3236-3250.
DOI Scopus17 WoS13 Europe PMC2
2022 Fan, H., Yu, X., Yang, Y., & Kankanhalli, M. (2022). Deep Hierarchical Representation of Point Cloud Videos via Spatio-Temporal Decomposition. IEEE Transactions on Pattern Analysis and Machine Intelligence, 44(12), 9918-9930.
DOI Scopus40 WoS31 Europe PMC2
2022 Shi, Y., Campbell, D., Yu, X., & Li, H. (2022). Geometry-Guided Street-View Panorama Synthesis From Satellite Imagery. IEEE Transactions on Pattern Analysis and Machine Intelligence, 44(12), 10009-10022.
DOI Scopus35 WoS22 Europe PMC1
2022 Ma, F., Wu, Y., Yu, X., & Yang, Y. (2022). Learning With Noisy Labels via Self-Reweighting From Class Centroids. IEEE Transactions on Neural Networks and Learning Systems, 33(11), 6275-6285.
DOI Scopus35 WoS34 Europe PMC3
2022 Han, C., Yu, X., Gao, C., Sang, N., & Yang, Y. (2022). Single image based 3D human pose estimation via uncertainty learning. Pattern Recognition, 132, 108934.
DOI Scopus29 WoS26
2022 Wang, S., Li, L., Ding, Y., & Yu, X. (2022). One-Shot Talking Face Generation from Single-Speaker Audio-Visual Correlation Learning. Proceedings of the 36th Aaai Conference on Artificial Intelligence Aaai 2022, 36(3), 2531-2539.
DOI Scopus95 WoS70
2022 Tang, T., Du, H., Yu, X., & Yang, Y. (2022). Monocular Camera-Based Point-Goal Navigation by Learning Depth Channel and Cross-Modality Pyramid Fusion. Proceedings of the 36th Aaai Conference on Artificial Intelligence Aaai 2022, 36(5), 5422-5430.
DOI Scopus9 WoS4
2022 Fan, H., Zhuo, T., Yu, X., Yang, Y., & Kankanhalli, M. (2022). Understanding Atomic Hand-Object Interaction With Human Intention. IEEE Transactions on Circuits and Systems for Video Technology, 32(1), 275-285.
DOI Scopus25 WoS18
2021 Li, L., Wang, S., Zhang, Z., Ding, Y., Zheng, Y., Yu, X., & Fan, C. (2021). Write-a-speaker: Text-based Emotional and Rhythmic Talking-head Generation. 35th Aaai Conference on Artificial Intelligence Aaai 2021, 3A(3), 1911-1920.
DOI Scopus55 WoS43
2021 Ding, Y., Yu, X., & Yang, Y. (2021). Modeling the Probabilistic Distribution of Unlabeled Data for One-shot Medical Image Segmentation. 35th Aaai Conference on Artificial Intelligence Aaai 2021, 2A(2), 1246-1254.
DOI Scopus31 WoS23
2021 Xu, Y., Zhou, C., Yu, X., Xiao, B., & Yang, Y. (2021). Pyramidal Multiple Instance Detection Network with Mask Guided Self-Correction for Weakly Supervised Object Detection. IEEE Transactions on Image Processing, 30, 3029-3040.
DOI Scopus45 WoS38 Europe PMC7
2021 Mächler, P., Broggini, T., Mateo, C., Thunemann, M., Fomin-Thunemann, N., Doran, P. R., . . . Devor, A. (2021). A suite of neurophotonic tools to underpin the contribution of internal brain states in fMRI. Current Opinion in Biomedical Engineering, 18, 11 pages.
DOI Scopus5 WoS4 Europe PMC7
2021 Quan, R., Wu, Y., Yu, X., & Yang, Y. (2021). Progressive transfer learning for face anti-spoofing. IEEE Transactions on Image Processing, 30, 3946-3955.
DOI Scopus64 WoS47 Europe PMC7
2021 Zhang, Y., Tsang, I. W., Li, J., Liu, P., Lu, X., & Yu, X. (2021). Face Hallucination with Finishing Touches. IEEE Transactions on Image Processing, 30, 1728-1743.
DOI Scopus30 WoS24 Europe PMC4
2021 Sobczak, F., Pais-Roldan, P., Takahashi, K., & Yu, X. (2021). Decoding the brain state-dependent relationship between pupil dynamics and resting state fMRI signal fluctuation. ELIFE, 10, 21 pages.
DOI WoS11
2020 Qian, W., Yu, X., & Qian, C. (2020). Wireless Powered Encoding and Broadcasting of Frequency Modulated Detection Signals. IEEE ACCESS, 8, 200450-200460.
DOI WoS1 Europe PMC1
2020 Yu, X., Porikli, F., Fernando, B., & Hartley, R. (2020). Hallucinating Unaligned Face Images by Multiscale Transformative Discriminative Networks. International Journal of Computer Vision, 128(2), 500-526.
DOI Scopus35 WoS31
2020 Wang, Z., Yu, X., Lu, M., Wang, Q., Qian, C., & Xu, F. (2020). Single image portrait relighting via explicit multiple reflectance channel modeling. ACM Transactions on Graphics, 39(6), 1-13.
DOI Scopus83 WoS70
2020 Drew, P. J., Mateo, C., Turner, K. L., Yu, X., & Kleinfeld, D. (2020). Ultra-slow Oscillations in fMRI and Resting-State Connectivity: Neuronal and Vascular Contributions and Technical Confounds. Neuron, 107(5), 782-804.
DOI Scopus110 WoS100 Europe PMC116
2020 Yu, X., Fernando, B., Hartley, R., & Porikli, F. (2020). Semantic face hallucination: Super-resolving very low-resolution face images with supplementary attributes. IEEE Transactions on Pattern Analysis and Machine Intelligence, 42(11), 2926-2943.
DOI Scopus43 WoS35 Europe PMC4
2020 Yu, X., Shiri, F., Ghanem, B., & Porikli, F. (2020). Can We See More? Joint Frontalization and Hallucination of Unaligned Tiny Faces. IEEE Transactions on Pattern Analysis and Machine Intelligence, 42(9), 2148-2164.
DOI Scopus33 WoS26 Europe PMC6
2020 Qian, W., Yu, X., & Qian, C. (2020). Wireless Reconfigurable RF Detector Array for Focal and Multiregional Signal Enhancement. IEEE ACCESS, 8, 136594-136604.
DOI WoS5 Europe PMC6
2019 Chen, X., Sobczak, F., Chen, Y., Jiang, Y., Qian, C., Lu, Z., . . . Yu, X. (2019). Mapping optogenetically-driven single-vessel fMRI with concurrent neuronal calcium recordings in the rat hippocampus. Nature Communications, 10(1), 12 pages.
DOI Scopus37 WoS60 Europe PMC46
2019 Chen, Y., Pais-Roldan, P., Chen, X., Frosz, M. H., & Yu, X. (2019). MRI-guided robotic arm drives optogenetic fMRI with concurrent Ca2+ recording. Nature Communications, 10(1), 11 pages.
DOI Scopus26 WoS22 Europe PMC24
2019 Pais-Roldán, P., Edlow, B. L., Jiang, Y., Stelzer, J., Zou, M., & Yu, X. (2019). Multimodal assessment of recovery from coma in a rat model of diffuse brainstem tegmentum injury. Neuroimage, 189, 615-630.
DOI Scopus26 WoS20 Europe PMC29
2019 Shiri, F., Yu, X., Porikli, F., Hartley, R., & Koniusz, P. (2019). Identity-Preserving Face Recovery from Stylized Portraits. International Journal of Computer Vision, 127(6-7), 863-883.
DOI Scopus20 WoS17
2019 Yan, H., Yu, X., Zhang, Y., Zhang, S., Zhao, X., & Zhang, L. (2019). Single Image Depth Estimation with Normal Guided Scale Invariant Deep Convolutional Fields. IEEE Transactions on Circuits and Systems for Video Technology, 29(1), 80-92.
DOI Scopus28 WoS26
2018 He, Y., Wang, M., Chen, X., Pohmann, R., Polimeni, J. R., Scheffler, K., . . . Yu, X. (2018). Ultra-Slow Single-Vessel BOLD and CBV-Based fMRI Spatiotemporal Dynamics and Their Correlation with Neuronal Intracellular Calcium Signals. Neuron, 97(4), 925-939.e5.
DOI Scopus96 WoS95 Europe PMC102
2018 Pais-Roldán, P., Biswal, B., Scheffler, K., & Yu, X. (2018). Identifying respiration-related aliasing artifacts in the rodent resting-state fMRI. Frontiers in Neuroscience, 12(NOV), 14 pages.
DOI Scopus22 WoS22 Europe PMC26
2018 Yu, X., & Porikli, F. (2018). Imagining the Unimaginable Faces by Deconvolutional Networks. IEEE Transactions on Image Processing, 27(6), 2747-2761.
DOI Scopus31 WoS29 Europe PMC7
2018 Wang, M., He, Y., Sejnowski, T. J., & Yu, X. (2018). Brain-state dependent astrocytic Ca2+ signals are coupled to both positive and negative BOLD-fMRI signals. Proceedings of the National Academy of Sciences of the United States of America, 115(7), E1647-E1656.
DOI Scopus84 WoS82 Europe PMC87
2018 Li, L., Zhang, S., Yu, X., & Zhang, L. (2018). PMSC: PatchMatch-Based Superpixel Cut for Accurate Stereo Matching. IEEE Transactions on Circuits and Systems for Video Technology, 28(3), 679-692.
DOI Scopus81 WoS71
2017 Chung, S., Jeong, J. H., Ko, S., Yu, X., Kim, Y. H., Isaac, J. T. R., & Koretsky, A. P. (2017). Peripheral Sensory Deprivation Restores Critical-Period-like Plasticity to Adult Somatosensory Thalamocortical Inputs. Cell Reports, 19(13), 2707-2717.
DOI Scopus26 WoS26 Europe PMC24
2017 Li, L., Yu, X., Zhang, S., Zhao, X., & Zhang, L. (2017). 3D Cost aggregation with multiple minimum spanning trees for stereo matching. Applied Optics, 56(12), 3411-3420.
DOI Scopus81 WoS63 Europe PMC8
2017 Yu, X., & Porikli, F. (2017). Face hallucination with tiny unaligned images by transformative discriminative neural networks. 31st Aaai Conference on Artificial Intelligence Aaai 2017, 31(1), 4327-4333.
DOI Scopus77 WoS56
2016 Pais-Roldán, P., Singh, A. P., Schulz, H., & Yu, X. (2016). High magnetic field induced otolith fusion in the zebrafish larvae. Scientific Reports, 6(1), 11 pages.
DOI Scopus15 WoS13 Europe PMC6
2016 Yu, X., He, Y., Wang, M., Merkle, H., Dodd, S. J., Silva, A. C., & Koretsky, A. P. (2016). Sensory and optogenetically driven single-vessel fMRI. Nature Methods, 13(4), 337-340.
DOI Scopus84 WoS86 Europe PMC93
2015 Yu, X., Zhang, S., Zhao, X., & Zhang, L. (2015). Removing blur kernel noise via a hybrid lp norm. Journal of Electronic Imaging, 24(1), 19 pages.
DOI Scopus4 WoS3
2015 Sui, Y., Zhao, X., Zhang, S., Yu, X., Zhao, S., & Zhang, L. (2015). Self-expressive tracking. Pattern Recognition, 48(9), 2872-2884.
DOI Scopus12 WoS11
2015 Zhang, S., Sui, Y., Zhao, S., Yu, X., & Zhang, L. (2015). Multi-local-task learning with global regularization for object tracking. Pattern Recognition, 48(12), 3881-3894.
DOI Scopus23 WoS20
2015 Zhang, S., Sui, Y., Yu, X., Zhao, S., & Zhang, L. (2015). Hybrid support vector machines for robust object tracking. Pattern Recognition, 48(8), 2474-2488.
DOI Scopus32 WoS29
2015 Zhang, S., Yu, X., Sui, Y., Zhao, S., & Zhang, L. (2015). Object tracking with multi-view support vector machines. IEEE Transactions on Multimedia, 17(3), 265-278.
DOI Scopus113 WoS90
2014 Yu, X., Xu, F., Zhang, S., & Zhang, L. (2014). Efficient patch-wise non-uniform deblurring for a single image. IEEE Transactions on Multimedia, 16(6), 1510-1524.
DOI Scopus50 WoS47
2014 Qian, C., Yu, X., Pothayee, N., Dodd, S., Bouraoud, N., Star, R., . . . Koretsky, A. (2014). Live nephron imaging by MRI. American Journal of Physiology Renal Physiology, 307(10), F1162-F1168.
DOI Scopus17 WoS16 Europe PMC17
2014 Yu, X., Zhao, X., Sui, Y., & Zhang, L. (2014). Handling noise in single image defocus map estimation by using directional filters. Optics Letters, 39(21), 6281-6284.
DOI Scopus4 WoS3 Europe PMC2
2014 Yu, X., & Koretsky, A. P. (2014). Interhemispheric plasticity protects the deafferented somatosensory cortex from functional takeover after nerve injury. Brain Connectivity, 4(9), 709-717.
DOI Scopus16 WoS17 Europe PMC14
2014 Yu, X., Qian, C., Chen, D. Y., Dodd, S. J., & Koretsky, A. P. (2014). Deciphering laminar-specific neural inputs with line-scanning fMRI. Nature Methods, 11(1), 55-58.
DOI Scopus137 WoS135 Europe PMC139
2013 Qian, C., Yu, X., Chen, D. Y., Dodd, S., Bouraoud, N., Pothayee, N., . . . Koretsky, A. (2013). Wireless amplified nuclear MR detector (WAND) for high-spatial-resolution MR imaging of internal organs: Preclinical demonstration in a rodent model. Radiology, 268(1), 228-236.
DOI Scopus37 WoS34 Europe PMC35
2013 Zhang, L., Guo, Y., Sun, J., Yu, X., & Zhang, S. (2013). Object tracking by feedback update scheme with sparsity constraint. Qinghua Daxue Xuebao Journal of Tsinghua University, 53(11), 1531-1535.
2012 Yu, X., Chung, S., Chen, D. Y., Wang, S., Dodd, S. J., Walters, J. R., . . . Koretsky, A. P. (2012). Thalamocortical Inputs Show Post-Critical-Period Plasticity. Neuron, 74(4), 731-742.
DOI Scopus64 WoS65 Europe PMC66
2012 Yu, X., Glen, D., Wang, S., Dodd, S., Hirano, Y., Saad, Z., . . . Koretsky, A. P. (2012). Direct imaging of macrovascular and microvascular contributions to BOLD fMRI in layers IV-V of the rat whisker-barrel cortex. Neuroimage, 59(2), 1451-1460.
DOI Scopus87 WoS84 Europe PMC90
2011 Zhao, X., Yu, X., Sun, L., Hu, K., Wang, G., & Zhang, L. (2011). Non-rigid object tracking as salient region segmentation and association. IEICE Transactions on Information and Systems, E94-D(4), 934-937.
DOI Scopus2 WoS2
2010 Yu, X., Wang, S., Chen, D. Y., Dodd, S., Goloshevsky, A., & Koretsky, A. P. (2010). 3D mapping of somatotopic reorganization with small animal functional MRI. Neuroimage, 49(2), 1667-1676.
DOI Scopus35 WoS32 Europe PMC29

Year Citation
2025 Zhang, H., Xu, J., Tang, T., Sun, H., Yu, X., Huang, Z., & Yu, K. (2025). OpenSight: A Simple Open-Vocabulary Framework for LiDAR-Based Object Detection. In A. Leonardis, E. Ricci, S. Roth, O. Russakovsky, T. Sattler, & G. Varol (Eds.), Lecture Notes in Computer Science (Vol. 15142 LNCS, pp. 1-19). SPRINGER INTERNATIONAL PUBLISHING AG.
DOI
2024 Yu, Q., Du, H., & Yu, X. (2024). A New Perspective of Weakly Supervised 3D Instance Segmentation via Bounding Boxes. In T. Liu, L. Yue, G. Webb, & D. Wang (Eds.), Lecture Notes in Computer Science (Vol. 14471 LNAI, pp. 103-114). SPRINGER-VERLAG SINGAPORE PTE LTD.
DOI Scopus1
2024 Sheng, H., Yu, X., Li, X., & Golzan, M. (2024). Context-Based Masking for Spontaneous Venous Pulsations Detection. In T. Liu, L. Yue, G. Webb, & D. Wang (Eds.), Lecture Notes in Computer Science (Vol. 14471 LNAI, pp. 520-532). SPRINGER-VERLAG SINGAPORE PTE LTD.
DOI Scopus1
2024 Xu, Q., Du, H., Chen, H., Liu, B., & Yu, X. (2024). MMOOC: A Multimodal Misinformation Dataset for Out-of-Context News Analysis. In T. Zhu, & Y. Li (Eds.), Lecture Notes in Computer Science (Vol. 14897 LNCS, pp. 444-459). SPRINGER-VERLAG SINGAPORE PTE LTD.
DOI Scopus4 WoS3
2024 Du, H., Huang, Z., Chapman, S., & Yu, X. (2024). Toward a Unified Framework for RGB and RGB-D Visual Navigation. In T. Liu, L. Yue, G. Webb, & D. Wang (Eds.), Lecture Notes in Computer Science (Vol. 14472 LNAI, pp. 363-375). SPRINGER-VERLAG SINGAPORE PTE LTD.
DOI Scopus2
2024 Yuan, B., Wang, Z., & Yu, X. (2024). Towards Reliable and Efficient Vegetation Segmentation for Australian Wheat Data Analysis. In Z. Bao, R. Borovica-Gajic, R. Qiu, F. Choudhury, & Z. Yang (Eds.), Lecture Notes in Computer Science (Vol. 14386, pp. 119-135). SPRINGER INTERNATIONAL PUBLISHING AG.
DOI
2023 Shi, Y., Yu, X., Wang, S., & Li, H. (2023). CVLNet: Cross-view Semantic Correspondence Learning for Video-Based Camera Localization. In J. Gall, T. J. Chin, I. Sato, R. Chellappa, & L. Wang (Eds.), Lecture Notes in Computer Science (Vol. 13841 LNCS, pp. 123-141). SPRINGER INTERNATIONAL PUBLISHING AG.
DOI Scopus13 WoS6
2023 Wang, M., Lin, B., Guo, X., Li, L., Zhu, Z., Sun, J., . . . Yu, X. (2023). GaitStrip: Gait Recognition via Effective Strip-Based Feature Representations and Multi-level Framework. In L. Wang, J. Gall, T. J. Chin, I. Sato, & R. Chellappa (Eds.), Lecture Notes in Computer Science (Vol. 13844 LNCS, pp. 711-727). SPRINGER INTERNATIONAL PUBLISHING AG.
DOI Scopus5 WoS7
2023 Zhou, X. A., Jiang, Y., Man, W., & Yu, X. (2023). Multimodal methods to help interpret resting-state fMRI. In Advances in Resting-State Functional MRI (pp. 207-235). Elsevier.
DOI
2023 Fu, H., Liu, C., Qi, X., Lin, B., Li, L., Zhang, L., & Yu, X. (2023). Sign Spotting via Multi-modal Fusion and Testing Time Transferring. In Lecture Notes in Computer Science (Vol. 13808 LNCS, pp. 271-287). Springer Nature Switzerland.
DOI Scopus4
2023 Cai, J., Nguyen, K. N., Shrestha, N., Good, A., Tu, R., Yu, X., . . . Serra, T. (2023). Getting Away with More Network Pruning: From Sparsity to Geometry and Linear Regions. In A. A. Cire (Ed.), Lecture Notes in Computer Science (Vol. 13884 LNCS, pp. 200-218). SPRINGER INTERNATIONAL PUBLISHING AG.
DOI Scopus3 WoS3
2023 Lee, H. H., Liu, Q., Bao, S., Yang, Q., Yu, X., Cai, L. Y., . . . Landman, B. A. (2023). Scaling up 3D Kernels with Bayesian Frequency Re-parameterization for Medical Image Segmentation. In H. Greenspan, A. Madabhushi, P. Mousavi, S. Salcudean, J. Duncan, T. Syeda-Mahmood, & R. Taylor (Eds.), Lecture Notes in Computer Science (Vol. 14223 LNCS, pp. 632-641). SPRINGER INTERNATIONAL PUBLISHING AG.
DOI Scopus9 WoS9
2022 Zeng, H., Yu, X., Miao, J., & Yang, Y. (2022). MHR-Net: Multiple-Hypothesis Reconstruction of Non-Rigid Shapes from 2D Views. In S. Avidan, G. Brostow, M. Cisse, G. M. Farinella, & T. Hassner (Eds.), Lecture Notes in Computer Science (Vol. 13662 LNCS, pp. 1-17). SPRINGER INTERNATIONAL PUBLISHING AG.
DOI Scopus8 WoS6
2022 Zhu, F., Yang, Z., Yu, X., Yang, Y., & Wei, Y. (2022). Instance as Identity: A Generic Online Paradigm for Video Instance Segmentation. In S. Avidan, G. Brostow, M. Cisse, G. M. Farinella, & T. Hassner (Eds.), Lecture Notes in Computer Science (Vol. 13689 LNCS, pp. 524-540). SPRINGER INTERNATIONAL PUBLISHING AG.
DOI Scopus4 WoS5
2021 Liu, J., & Yu, X. (2021). Few-shot Weighted Style Matching for Glaucoma Detection. In Lecture Notes in Computer Science (Vol. 13069 LNAI, pp. 289-300). Springer International Publishing.
DOI Scopus1
2020 Liu, J., Zou, Z., Ye, X., Tan, X., Ding, E., Xu, F., & Yu, X. (2020). Leaping from 2D Detection to Efficient 6DoF Object Pose Estimation. In A. Bartoli, & A. Fusiello (Eds.), Lecture Notes in Computer Science (Vol. 12536 LNCS, pp. 707-714). SPRINGER INTERNATIONAL PUBLISHING AG.
DOI Scopus8 WoS8
2020 Du, H., Yu, X., & Zheng, L. (2020). Learning Object Relation Graph and Tentative Policy for Visual Navigation. In A. Vedaldi, H. Bischof, T. Brox, & J. M. Frahm (Eds.), Lecture Notes in Computer Science (Vol. 12352 LNCS, pp. 19-34). SPRINGER INTERNATIONAL PUBLISHING AG.
DOI Scopus104 WoS85
2018 Yu, X., Fernando, B., Ghanem, B., Porikli, F., & Hartley, R. (2018). Face super-resolution guided by facial component heatmaps. In V. Ferrari, M. Hebert, C. Sminchisescu, & Y. Weiss (Eds.), Lecture Notes in Computer Science (Vol. 11213 LNCS, pp. 219-235). SPRINGER INTERNATIONAL PUBLISHING AG.
DOI Scopus48 WoS192
2017 Yu, X. (2017). When Photons Meet Protons: Optogenetics, Calcium Signal Detection, and fMRI in Small Animals. In Small Animal Imaging (pp. 773-791). Springer International Publishing.
DOI
2016 Yu, X., & Porikli, F. (2016). Ultra-resolving face images by discriminative generative networks. In B. Leibe, J. Matas, N. Sebe, & M. Welling (Eds.), Lecture Notes in Computer Science (Vol. 9909 LNCS, pp. 318-333). SPRINGER INTERNATIONAL PUBLISHING AG.
DOI Scopus248 WoS216

Year Citation
2025 Chen, H., Zhu, T., Yu, X., & Zhou, W. (2025). Zero-Shot Machine Unlearning with Proxy Adversarial Data Generation. In Ijcai International Joint Conference on Artificial Intelligence (pp. 339-347). International Joint Conferences on Artificial Intelligence Organization.
DOI
2025 Zhang, B., Cao, Z., Du, H., Yu, X., Li, X., Liu, J., & Wang, S. (2025). TokenBinder: Text-Video Retrieval with One-to-Many Alignment Paradigm. In Proceedings 2025 IEEE Winter Conference on Applications of Computer Vision Wacv 2025 (pp. 4957-4967). AZ, Tucson: IEEE COMPUTER SOC.
DOI Scopus1
2025 Cao, Z., Zhang, B., Du, H., Yu, X., Li, X., & Wang, S. (2025). FlashVTG: Feature Layering and Adaptive Score Handling Network for Video Temporal Grounding. In Proceedings 2025 IEEE Winter Conference on Applications of Computer Vision Wacv 2025 (pp. 9226-9236). AZ, Tucson: IEEE COMPUTER SOC.
DOI
2025 Hu, Z., Zhang, Y., Liu, C., Li, L., Peng, S., Zhou, X., . . . Yu, X. (2025). CPT-VR: Improving Surface Rendering via Closest Point Transform with View-Reflection Appearance. In A. Leonardis, E. Ricci, S. Roth, O. Russakovsky, T. Sattler, & G. Varol (Eds.), Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics Vol. 15131 LNCS (pp. 223-239). ITALY, Milan: SPRINGER INTERNATIONAL PUBLISHING AG.
DOI
2025 He, M., Zhang, J., & Yu, X. (2025). Transferable Attacks for Semantic Segmentation. In T. Chen, Y. Cao, Q. V. H. Nguyen, & T. T. Nguyen (Eds.), Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics Vol. 15449 LNCS (pp. 372-388). AUSTRALIA, Gold Coast: SPRINGER-VERLAG SINGAPORE PTE LTD.
DOI
2025 Liu, C., Li, P., Yang, L., Wang, D., Li, L., & Yu, X. (2025). Robust Audio-Visual Segmentation via Audio-Guided Visual Convergent Alignment. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 28922-28931). IEEE.
DOI
2025 Wang, S., Zhang, H., Shen, X., Wang, D., & Yu, X. (2025). Blind Bitstream-corrupted Video Recovery via Metadata-guided Diffusion Model. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 22975-22984). IEEE.
DOI
2025 Liu, C., Zhang, W., Qiu, F., Li, L., Wang, D., & Yu, X. (2025). Affective Behaviour Analysis via Progressive Learning. In Lecture Notes in Computer Science Vol. 15637 LNCS (pp. 366-379). Springer Nature Switzerland.
DOI
2025 Xu, Q., Cao, R., Shen, X., Du, H., Wang, S., & Yu, X. (2025). M3GYM: A Large-Scale Multimodal Multi-view Multi-person Pose Dataset for Fitness Activity Understanding in Real-world Settings. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 12289-12300). IEEE.
DOI
2025 Wang, S., Chen, W., Zhang, W., Zhao, M., Li, L., Zhang, R., . . . Yu, X. (2025). EasyCraft: A Robust and Efficient Framework for Automatic Avatar Crafting. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 5581-5591). IEEE.
DOI
2025 Liu, C., Qiu, F., Zhang, W., Li, L., Wang, D., & Yu, X. (2025). Compound Expression Recognition via Curriculum Learning. In Lecture Notes in Computer Science Vol. 15637 LNCS (pp. 282-293). Springer Nature Switzerland.
DOI
2025 Liu, C., Yang, L., Li, P., Wang, D., Li, L., & Yu, X. (2025). Dynamic Derivation and Elimination: Audio Visual Segmentation with Enhanced Audio Semantics. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 3131-3141). IEEE.
DOI
2025 Shen, X., Du, H., Xu, M., Liu, M., & Yu, X. (2025). Cross-View Isolated Sign Language Recognition Challenge: Design, Results and Future Research. In Www Companion 2025 Companion Proceedings of the ACM Web Conference 2025 (pp. 2444-2447). ACM.
DOI
2025 Xu, Q., Du, H., Łukasik, S., Zhu, T., Wang, S., & Yu, X. (2025). MDAM3: A Misinformation Detection and Analysis Framework for Multitype Multimodal Media. In Www 2025 Proceedings of the ACM Web Conference (pp. 5285-5296). ACM.
DOI Scopus1
2025 Guo, T., Du, H., Huo, H., Liu, B., & Yu, X. (2025). Who is Being Impersonated? Deepfake Audio Detection and Impersonated Identification via Extraction of Id-Specific Features. In Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics Vol. 15255 LNCS (pp. 301-320). Springer Nature Singapore.
DOI Scopus1
2025 Ying, J., Shen, X., & Yu, X. (2025). Vision-Based Abnormal Action Dataset for Recognising Body Motion Disorders. In Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics Vol. 15443 LNAI (pp. 443-455). Springer Nature Singapore.
DOI Scopus1
2024 Qiu, F., Zhang, W., Liu, C., An, R., Li, L., Ding, Y., . . . Yu, X. (2024). FreeAvatar: Robust 3D Facial Animation Transfer by Learning an Expression Foundation Model. In S. N. Spencer (Ed.), Proceedings SIGGRAPH Asia 2024 Conference Papers SA 2024 (pp. 11 pages). JAPAN: ASSOC COMPUTING MACHINERY.
DOI Scopus3 WoS2
2024 Wei, T., Chen, Z., & Yu, X. (2024). Snap and Diagnose: An Advanced Multimodal Retrieval System for Identifying Plant Diseases in the Wild. In Proceedings of the 6th ACM International Conference on Multimedia in Asia Mmasia 2024 (pp. 3 pages). NEW ZEALAND, Auckland: ASSOC COMPUTING MACHINERY.
DOI
2024 Yu, X., Tang, Y., Yang, Q., Lee, H. H., Bao, S., Huo, Y., & Landman, B. A. (2024). Enhancing Hierarchical Transformers for Whole Brain Segmentation with Intracranial Measurements Integration.. In B. S. Gimi, & A. Krol (Eds.), Proceedings of SPIE--the International Society for Optical Engineering Vol. 12930 (pp. 129300K). United States: SPIE.
DOI Europe PMC1
2024 Tang, J., Li, L., Qi, X., Chen, Y., Fan, C., & Yu, X. (2024). AS-NeRF: Learning Auxiliary Sampling for Generalizable Novel View Synthesis from Sparse Views. In Proceedings IEEE International Conference on Multimedia and Expo (pp. 6 pages). CANADA, Niagra Falls: IEEE.
DOI
2024 Qiu, F., Du, H., Zhang, W., Liu, C., Li, L., Guo, T., & Yu, X. (2024). Learning Transferable Compound Expressions from Masked AutoEncoder Pretraining. In IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops (pp. 4733-4741). WA, Seattle: IEEE COMPUTER SOC.
DOI Scopus2 WoS2
2024 Zhang, W., Qiu, F., Liu, C., Li, L., Du, H., Guo, T., & Yu, X. (2024). An Effective Ensemble Learning Framework for Affective Behaviour Analysis. In IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops (pp. 4761-4772). WA, Seattle: IEEE COMPUTER SOC.
DOI Scopus7 WoS2
2024 Wei, T., Chen, Z., Huang, Z., & Yu, X. (2024). Benchmarking In-the-Wild Multimodal Disease Recognition and A Versatile Baseline. In Mm 2024 Proceedings of the 32nd ACM International Conference on Multimedia (pp. 1593-1601). ACM.
DOI Scopus3
2024 Qiu, F., Zhang, W., Liu, C., Li, L., Du, H., Guo, T., & Yu, X. (2024). Language-guided Multi-modal Emotional Mimicry Intensity Estimation. In IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops (pp. 4742-4751). WA, Seattle: IEEE COMPUTER SOC.
DOI Scopus3 WoS2
2024 Wu, Y., Meng, Y., Hu, Z., Li, L., Wu, H., Zhou, K., . . . Yu, X. (2024). Text-Guided 3D Face Synthesis - From Generation to Editing. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 1260-1269). WA, Seattle: IEEE COMPUTER SOC.
DOI Scopus7 WoS6
2024 Shiri, F., Guo, X. Y., Far, M. G., Yu, X., Haffari, G., & Li, Y. F. (2024). An Empirical Analysis on Spatial Reasoning Capabilities of Large Multimodal Models. In Emnlp 2024 2024 Conference on Empirical Methods in Natural Language Processing Proceedings of the Conference (pp. 21440-21455). Association for Computational Linguistics.
DOI Scopus2
2024 Liu, C., Li, P. P., Yu, Q., Sheng, H., Wang, D., Li, L., & Yu, X. (2024). Benchmarking Audio Visual Segmentation for Long-Untrimmed Videos. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 22712-22722). WA, Seattle: IEEE COMPUTER SOC.
DOI Scopus1 WoS1
2024 Hu, Z., Zhao, M., Zhao, C., Liang, X., Li, L., Zhao, Z., . . . Yu, X. (2024). EfficientDreamer: High-Fidelity and Stable 3D Creation via Orthogonal-view Diffusion Priors. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 4949-4958). WA, Seattle: IEEE COMPUTER SOC.
DOI Scopus8 WoS6
2024 Yu, Q., Du, H., Liu, C., & Yu, X. (2024). When 3D Bounding-Box Meets SAM: Point Cloud Instance Segmentation with Weak-and-Noisy Supervision. In Proceedings 2024 IEEE Winter Conference on Applications of Computer Vision Wacv 2024 (pp. 3707-3716). HI, Waikoloa: IEEE COMPUTER SOC.
DOI Scopus4 WoS1
2024 Dong, G., Wang, H., Sun, J., & Wang, X. (2024). Evaluating and Mitigating Linguistic Discrimination in Large Language Models: Perspectives on Safety Equity and Knowledge Equity. In Proceedings of the Thirty-ThirdInternational Joint Conference on Artificial Intelligence (pp. 348-356). International Joint Conferences on Artificial Intelligence Organization.
DOI
2024 Chen, H., Liu, Y., Ma, Y., Zheng, N., & Yu, X. (2024). TPR: Topology-Preserving Reservoirs for Generalized Zero-Shot Learning. In Advances in Neural Information Processing Systems Vol. 37.
Scopus2
2024 Chen, H., Zhu, T., Yu, X., & Zhou, W. (2024). Machine Unlearning via Null Space Calibration. In K. Larson (Ed.), Ijcai International Joint Conference on Artificial Intelligence (pp. 358-366). SOUTH KOREA, Jeju: IJCAI-INT JOINT CONF ARTIF INTELL.
Scopus5 WoS2
2024 Shen, X., Du, H., Sheng, H., Wang, S., Chen, H., Chen, H., . . . Yu, X. (2024). MM-WLAuslan: Multi-View Multi-Modal Word-Level Australian Sign Language Recognition Dataset. In Advances in Neural Information Processing Systems Vol. 37.
Scopus1
2024 Lim, J. S., Chen, Z., Baktashmotlagh, M., Chen, Z., Yu, X., Huang, Z., & Luo, Y. (2024). DiPEx: Dispersing Prompt Expansion for Class-Agnostic Object Detection. In Advances in Neural Information Processing Systems Vol. 37 (pp. 24795-24821). Neural Information Processing Systems Foundation, Inc. (NeurIPS).
DOI Scopus2
2024 Fang, S., Yu, X., Wang, Z., Li, S., Kirby, R. M., & Zhe, S. (2024). FUNCTIONAL BAYESIAN TUCKER DECOMPOSITION FOR CONTINUOUS-INDEXED TENSOR DATA. In 12th International Conference on Learning Representations Iclr 2024.
2024 Li, S., Yu, X., Xing, W., Kirby, R. M., Narayan, A., & Zhe, S. (2024). Multi-Resolution Active Learning of Fourier Neural Operators. In Proceedings of Machine Learning Research Vol. 238 (pp. 2440-2448).
Scopus6
2023 Fang, S., Yu, X., Li, S., Wang, Z., Kirby, R. M., & Zhe, S. (2023). Streaming Factor Trajectory Learning for Temporal Tensor Decomposition. In Advances in Neural Information Processing Systems Vol. 36.
Scopus3
2023 Khan, M. W., Sheng, H., Zhang, H., Du, H., Wang, S., Coroneo, M. T., . . . Yu, X. (2023). RVD: A Handheld Device-Based Fundus Video Dataset for Retinal Vessel Segmentation. In Advances in Neural Information Processing Systems Vol. 36.
Scopus6
2023 Shen, X., Yuan, S., Sheng, H., Du, H., & Yu, X. (2023). Auslan-Daily: Australian Sign Language Translation for Daily Communication and News. In Advances in Neural Information Processing Systems Vol. 36.
Scopus19
2023 Liu, P., Yu, X., & Zhou, J. T. (2023). META KNOWLEDGE CONDENSATION FOR FEDERATED LEARNING. In 11th International Conference on Learning Representations Iclr 2023.
Scopus8
2023 Tang, J., Li, L., Hou, J., Xin, H., & Yu, X. (2023). A Divide-and-conquer Solution to 3D Human Motion Estimation from Raw MoCap Data. In Proceedings 2023 IEEE Conference on Virtual Reality and 3D User Interfaces Abstracts and Workshops Vrw 2023 (pp. 767-768). PEOPLES R CHINA, Shanghai: IEEE COMPUTER SOC.
DOI Scopus1 WoS1
2023 Du, H., Yu, X., Hussain, F., Armin, M. A., Petersson, L., & Li, W. (2023). Weakly-supervised Point Cloud Instance Segmentation with Geometric Priors. In Proceedings 2023 IEEE Winter Conference on Applications of Computer Vision Wacv 2023 (pp. 4260-4269). HI, Waikoloa: IEEE COMPUTER SOC.
DOI Scopus14 WoS10
2023 Zhao, Y., Liu, B., Ding, M., Liu, B., Zhu, T., & Yu, X. (2023). Proactive Deepfake Defence via Identity Watermarking. In Proceedings 2023 IEEE Winter Conference on Applications of Computer Vision Wacv 2023 (pp. 4591-4600). HI, Waikoloa: IEEE COMPUTER SOC.
DOI Scopus42 WoS30
2023 Luo, Y., Chen, Z., Wang, Z., Yu, X., Huang, Z., & Baktashmotlagh, M. (2023). EXPLORING ACTIVE 3D OBJECT DETECTION FROM A GENERALIZATION PERSPECTIVE. In 11th International Conference on Learning Representations Iclr 2023.
Scopus19
2023 Rao, Q., Yu, X., Navasardyan, S., & Shi, H. (2023). Sim2RealVS: A New Benchmark for Video Stabilization with a Strong Baseline. In Proceedings 2023 IEEE Winter Conference on Applications of Computer Vision Wacv 2023 (pp. 5395-5404). HI, Waikoloa: IEEE COMPUTER SOC.
DOI Scopus6 WoS4
2023 Liu, B., Liu, B., Ding, M., Zhu, T., & Yu, X. (2023). TI2Net: Temporal Identity Inconsistency Network for Deepfake Detection. In Proceedings 2023 IEEE Winter Conference on Applications of Computer Vision Wacv 2023 (pp. 4680-4689). HI, Waikoloa: IEEE COMPUTER SOC.
DOI Scopus34 WoS26
2023 Wang, M., Guo, X., Lin, B., Yang, T., Zhu, Z., Li, L., . . . Yu, X. (2023). DyGait: Exploiting Dynamic Representations for High-performance Gait Recognition. In Proceedings of the IEEE International Conference on Computer Vision (pp. 13378-13387). FRANCE, Paris: IEEE COMPUTER SOC.
DOI Scopus43 WoS38
2023 Qi, X., Liu, C., Sun, M., Li, L., Fan, C., & Yu, X. (2023). Diverse 3D Hand Gesture Prediction from Body Dynamics by Bilateral Hand Disentanglement. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition Vol. 2023-June (pp. 4616-4626). CANADA, Vancouver: IEEE COMPUTER SOC.
DOI Scopus15 WoS14
2023 Wu, H., Hu, Z., Li, L., Zhang, Y., Fan, C., & Yu, X. (2023). NeFII: Inverse Rendering for Reflectance Decomposition with Near-Field Indirect Illumination. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition Vol. 2023-June (pp. 4295-4304). CANADA, Vancouver: IEEE COMPUTER SOC.
DOI Scopus33 WoS21
2023 Zhang, Y., Wang, Z., Luo, Y., Yu, X., & Huang, Z. (2023). Learning Efficient Unsupervised Satellite Image-based Building Damage Detection. In G. Chen, L. Khan, X. Gao, M. Qiu, W. Pedrycz, & X. Wu (Eds.), Proceedings IEEE International Conference on Data Mining Icdm (pp. 1547-1552). PEOPLES R CHINA, Shanghai: IEEE COMPUTER SOC.
DOI Scopus2 WoS3
2023 Du, H., Li, L., Huang, Z., & Yu, X. (2023). Object-Goal Visual Navigation via Effective Exploration of Relations Among Historical Navigation States. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition Vol. 2023-June (pp. 2563-2573). CANADA, Vancouver: IEEE COMPUTER SOC.
DOI Scopus26 WoS21
2023 Shen, C., Lin, B., Zhang, S., Yu, X., Huang, G. Q., & Yu, S. (2023). Gait Recognition with Mask-based Regularization. In 2023 IEEE International Joint Conference on Biometrics Ijcb 2023 (pp. 10 pages). SLOVENIA, Ljubljana: IEEE.
DOI Scopus5 WoS4
2023 Liu, C., Li, P. P., Qi, X., Zhang, H., Li, L., Wang, D., & Yu, X. (2023). Audio-Visual Segmentation by Exploring Cross-Modal Mutual Semantics. In Mm 2023 Proceedings of the 31st ACM International Conference on Multimedia (pp. 7590-7598). CANADA, Ottawa: ASSOC COMPUTING MACHINERY.
DOI Scopus32 WoS27
2022 Yao, G., Wu, H., Yuan, Y., Li, L., Zhou, K., & Yu, X. (2022). Learning Implicit Body Representations from Double Diffusion Based Neural Radiance Fields. In L. DeRaedt (Ed.), Ijcai International Joint Conference on Artificial Intelligence (pp. 1566-1572). AUSTRIA, Vienna: IJCAI-INT JOINT CONF ARTIF INTELL.
DOI Scopus4 WoS3
2022 Li, S., Phillips, J. M., Yu, X., Kirby, R. M., & Zhe, S. (2022). Batch Multi-Fidelity Active Learning with Budget Constraints. In Advances in Neural Information Processing Systems Vol. 35.
Scopus11
2022 Yu, X., Serra, T., Ramalingam, S., & Zhe, S. (2022). The Combinatorial Brain Surgeon: Pruning Weights That Cancel One Another in Neural Networks. In Proceedings of Machine Learning Research Vol. 162 (pp. 25668-25683).
Scopus39 WoS17
2022 Good, A., Lin, J., Yu, X., Sieg, H., Ferguson, M., Zhe, S., . . . Serra, T. (2022). Recall Distortion in Neural Network Pruning and the Undecayed Pruning Algorithm. In Advances in Neural Information Processing Systems Vol. 35.
Scopus7
2021 Ranade, S., Yu, X., Kakkar, S., Miraldo, P., & Ramalingam, S. (2021). Mapping of Sparse 3D Data Using Alternating Projection. In Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics Vol. 12622 LNCS (pp. 295-313). Springer International Publishing.
DOI Scopus1
2021 Serra, T., Yu, X., Kumar, A., & Ramalingam, S. (2021). Scaling Up Exact Neural Network Compression by ReLU Stability. In Advances in Neural Information Processing Systems Vol. 32 (pp. 27081-27093).
Scopus14
2021 Fan, H., Yu, X., Ding, Y., Yang, Y., & Kankanhalli, M. (2021). PSTNET: POINT SPATIO-TEMPORAL CONVOLUTION ON POINT CLOUD SEQUENCES. In Iclr 2021 9th International Conference on Learning Representations.
Scopus67
2021 Du, H., Yu, X., & Zheng, L. (2021). VTNET: VISUAL TRANSFORMER NETWORK FOR OBJECT GOAL NAVIGATION. In Iclr 2021 9th International Conference on Learning Representations.
Scopus41
2021 Yu, X., Van Baar, J., & Chen, S. (2021). Joint 3D Human Shape Recovery and Pose Estimation from a Single Image with Bilayer Graph. In Proceedings 2021 International Conference on 3D Vision 3dv 2021 (pp. 505-514). ELECTR NETWORK: IEEE COMPUTER SOC.
DOI Scopus5 WoS3
2021 Wang, S., Li, L., Ding, Y., Fan, C., & Yu, X. (2021). Audio2Head: Audio-driven One-shot Talking-head Generation with Natural Head Motion. In Z. H. Zhou (Ed.), Ijcai International Joint Conference on Artificial Intelligence (pp. 1098-1105). ELECTR NETWORK: IJCAI-INT JOINT CONF ARTIF INTELL.
DOI Scopus60 WoS54
2021 Ding, Y., Yu, X., & Yang, Y. (2021). RFNet: Region-aware Fusion Network for Incomplete Multi-modal Brain Tumor Segmentation. In Proceedings of the IEEE International Conference on Computer Vision (pp. 3955-3964). ELECTR NETWORK: IEEE.
DOI Scopus138 WoS112
2021 Li, P., Yu, X., & Yang, Y. (2021). Super-Resolving Cross-Domain Face Miniatures by Peeking at One-Shot Exemplar. In Proceedings of the IEEE International Conference on Computer Vision (pp. 4449-4459). ELECTR NETWORK: IEEE.
DOI Scopus2
2021 Zeng, H., Dai, Y., Yu, X., Wang, X., & Yang, Y. (2021). PR-RRN: Pairwise-Regularized Residual-Recursive Networks for Non-rigid Structure-from-Motion. In Proceedings of the IEEE International Conference on Computer Vision (pp. 5580-5589). ELECTR NETWORK: IEEE.
DOI Scopus10 WoS6
2021 Lin, B., Zhang, S., & Yu, X. (2021). Gait Recognition via Effective Global-Local Feature Representation and Local Temporal Aggregation. In Proceedings of the IEEE International Conference on Computer Vision (pp. 14628-14636). ELECTR NETWORK: IEEE.
DOI Scopus273 WoS226
2021 Yang, Z., Yu, X., & Yang, Y. (2021). DSC-PoseNet: Learning 6DoF Object Pose Estimation via Dual-scale Consistency. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 3906-3915). ELECTR NETWORK: IEEE COMPUTER SOC.
DOI Scopus45 WoS40
2021 Zhuang, Z., Yu, X., & Mahony, R. (2021). End-to-end Multi-Instance Robotic Reaching from Monocular Vision. In Proceedings IEEE International Conference on Robotics and Automation Vol. 2021-May (pp. 12974-12980). PEOPLES R CHINA, Xian: IEEE.
DOI Scopus1
2021 Kennedy, G., Gao, J., Zhuang, Z., Yu, X., & Mahony, R. (2021). A General Approach to State Refinement. In IEEE International Conference on Intelligent Robots and Systems (pp. 8985-8991). ELECTR NETWORK: IEEE.
DOI
2021 Zhang, J., Fan, D. P., Dai, Y., Yu, X., Zhong, Y., Barnes, N., & Shao, L. (2021). RGB-D Saliency Detection via Cascaded Mutual Information Minimization. In Proceedings of the IEEE International Conference on Computer Vision (pp. 4318-4327). ELECTR NETWORK: IEEE.
DOI Scopus111 WoS97
2021 Tang, T., Yu, X., Dong, X., & Yang, Y. (2021). Auto-Navigator: Decoupled neural architecture search for visual navigation. In Proceedings 2021 IEEE Winter Conference on Applications of Computer Vision Wacv 2021 (pp. 3742-3751). ELECTR NETWORK: IEEE.
DOI Scopus9 WoS5
2021 Quan, R., Yu, X., Liang, Y., & Yang, Y. (2021). Removing Raindrops and Rain Streaks in One Go. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 9143-9152). ELECTR NETWORK: IEEE COMPUTER SOC.
DOI Scopus162 WoS136
2021 Shi, Y., Li, H., & Yu, X. (2021). Self-Supervised Visibility Learning for Novel View Synthesis. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 9670-9679). ELECTR NETWORK: IEEE COMPUTER SOC.
DOI Scopus15 WoS12
2021 Li, D., Xu, C., Zhang, K., Yu, X., Zhong, Y., Ren, W., . . . Li, H. (2021). ARVo: Learning All-Range Volumetric Correspondence for Video Deblurring. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 7717-7727). ELECTR NETWORK: IEEE COMPUTER SOC.
DOI Scopus56 WoS46
2021 Ben-Shabat, Y., Yu, X., Saleh, F., Campbell, D., Rodriguez Opazo, C., Li, H., & Gould, S. (2021). The IKEA ASM Dataset: Understanding people assembling furniture through actions, objects and pose. In Proceedings of the IEEE Winter Conference on Applications of Computer Vision (WACV 2021) (pp. 846-858). virtual online: IEEE.
DOI Scopus95 WoS83
2020 Li, D., Rodriguez Opazo, C., Yu, X., & Li, H. (2020). Word-level Deep Sign Language Recognition from Video: A New Large-scale Dataset and Methods Comparison. In Proceedings of the 2020 IEEE Winter Conference on Applications of Computer Vision (pp. 1448-1458). Snowmass Village, CO, USA: IEEE.
DOI Scopus452 WoS317
2020 Zhuang, Z., Yu, X., & Mahony, R. (2020). LyRN (Lyapunov Reaching Network): A Real-Time Closed Loop approach from Monocular Vision. In Proceedings IEEE International Conference on Robotics and Automation (pp. 8331-8337). ELECTR NETWORK: IEEE.
DOI Scopus8 WoS7
2020 Zheng, Z., Jiang, M., Wang, Z., Wang, J., Bai, Z., Zhang, X., . . . Ding, E. (2020). Going beyond real data: A robust visual representation for vehicle re-identification. In IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops Vol. 2020-June (pp. 2550-2558). ELECTR NETWORK: IEEE COMPUTER SOC.
DOI Scopus49 WoS32
2020 Shi, Y., Yu, X., Campbell, D., & Li, H. (2020). Where am I looking at? Joint location and orientation estimation by cross-view matching. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 4063-4071). ELECTR NETWORK: IEEE COMPUTER SOC.
DOI Scopus194 WoS153
2020 Zhang, J., Yu, X., Li, A., Song, P., Liu, B., & Dai, Y. (2020). Weakly-Supervised Salient Object Detection via Scribble Annotations. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 12543-12552). ELECTR NETWORK: IEEE COMPUTER SOC.
DOI Scopus296 WoS270
2020 Li, D., Yu, X., Xu, C., Petersson, L., & Li, H. (2020). Transferring Cross-Domain Knowledge for Video Sign Language Recognition. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 6204-6213). ELECTR NETWORK: IEEE COMPUTER SOC.
DOI Scopus121 WoS97
2020 Zhang, Y., Tsang, I. W., Luo, Y., Hu, C. H., Lu, X., & Yu, X. (2020). Copy and Paste GAN: Face Hallucination from Shaded Thumbnails. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 7353-7362). ELECTR NETWORK: IEEE COMPUTER SOC.
DOI Scopus30 WoS29
2020 Yu, X., Zhuang, Z., Koniusz, P., & Li, H. (2020). 6DoF Object Pose Estimation via Differentiable Proxy Voting Regularizer. In 31st British Machine Vision Conference Bmvc 2020.
Scopus18
2020 Li, D., Xu, C., Yu, X., Zhang, K., Swift, B., Suominen, H., & Li, H. (2020). TSPNet: Hierarchical feature learning via temporal semantic pyramid for sign language translation. In Advances in Neural Information Processing Systems Vol. 2020-December.
Scopus106
2020 Li, P., Dong, X., Yu, X., & Yang, Y. (2020). When Humans Meet Machines: Towards Efficient Segmentation Networks. In 31st British Machine Vision Conference Bmvc 2020.
Scopus27
2020 Zheng, Y., Yu, X., Liu, M., & Zhang, S. (2020). Residual multiscale based single image deraining. In 30th British Machine Vision Conference 2019 Bmvc 2019.
Scopus23
2020 Shi, Y., Yu, X., Liu, L., Zhang, T., & Li, H. (2020). Optimal feature transport for cross-view image geo-localization. In Aaai 2020 34th Aaai Conference on Artificial Intelligence (pp. 11990-11997).
Scopus162
2019 Shi, Y., Liu, L., Yu, X., & Li, H. (2019). Spatial-aware feature aggregation for cross-view image based geo-localization. In Advances in Neural Information Processing Systems Vol. 32.
Scopus210 WoS130
2019 Pan, L., Scheerlinck, C., Yu, X., Hartley, R., Liu, M., & Dai, Y. (2019). Bringing a blurry frame alive at high frame-rate with an event camera. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition Vol. 2019-June (pp. 6813-6822). CA, Long Beach: IEEE.
DOI Scopus246 WoS219
2019 Yu, X., Tian, Y., Porikli, F., Hartley, R., Li, H., Heijnen, H., & Balntas, V. (2019). Unsupervised extraction of local image descriptors via relative distance ranking loss. In Proceedings 2019 International Conference on Computer Vision Workshop Iccvw 2019 (pp. 2893-2902). SOUTH KOREA, Seoul: IEEE COMPUTER SOC.
DOI Scopus23 WoS21
2019 Tian, Y., Yu, X., Fan, B., Wu, F., Heijnen, H., & Balntas, V. (2019). Sosnet: Second order similarity regularization for local descriptor learning. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition Vol. 2019-June (pp. 11008-11017). CA, Long Beach: IEEE.
DOI Scopus359 WoS302
2019 Shiri, F., Yu, X., Porikli, F., Hartley, R., & Koniusz, P. (2019). Recovering faces from portraits with auxiliary facial attributes. In Proceedings 2019 IEEE Winter Conference on Applications of Computer Vision Wacv 2019 (pp. 406-415). HI, Waikoloa Village: IEEE.
DOI Scopus12 WoS10
2018 Shiri, F., Yu, X., Porikli, F., Hartley, R., & Koniusz, P. (2018). Identity-preserving face recovery from portraits. In Proceedings 2018 IEEE Winter Conference on Applications of Computer Vision Wacv 2018 Vol. 2018-January (pp. 102-111). NV: IEEE.
DOI Scopus16 WoS13
2018 Yu, X., Yu, Z., & Ramalingam, S. (2018). Learning Strict Identity Mappings in Deep Residual Networks. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 4432-4440). UT, Salt Lake City: IEEE.
DOI Scopus60 WoS35
2018 Yu, X., Fernando, B., Hartley, R., & Porikli, F. (2018). Super-Resolving Very Low-Resolution Face Images with Supplementary Attributes. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 908-917). UT, Salt Lake City: IEEE.
DOI Scopus170 WoS144
2018 Yu, X., Chaturvedi, S., Feng, C., Taguchi, Y., Lee, T. Y., Fernandes, C., & Ramalingam, S. (2018). VLASE: Vehicle Localization by Aggregating Semantic Edges. In IEEE International Conference on Intelligent Robots and Systems (pp. 3196-3203). IEEE.
DOI Scopus42
2017 Paul, D., Li, F., Teja, M. K., Yu, X., & Frost, R. (2017). Compass: Spatio temporal sentiment analysis of US election what twitter says!. In Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining Vol. Part F129685 (pp. 1585-1594). ACM.
DOI Scopus59
2017 Yu, X., & Porikli, F. (2017). Hallucinating very low-Resolution unaligned and noisy face images by transformative discriminative autoencoders. In Proceedings 30th IEEE Conference on Computer Vision and Pattern Recognition Cvpr 2017 Vol. 2017-January (pp. 5367-5375). HI, Honolulu: IEEE.
DOI Scopus122 WoS98
2017 Shiri, F., Yu, X., Koniusz, P., & Porikli, F. (2017). Face Destylization. In Dicta 2017 2017 International Conference on Digital Image Computing Techniques and Applications Vol. 2017-December (pp. 1-8). IEEE.
DOI Scopus14

Connect With Me

External Profiles

Other Links