Prof Chunhua Shen
Professor
School of Computer Science and Information Technology
College of Engineering and Information Technology
N/A
Chunhua Shen is no longer supervising new HDR students as of Nov. 2021.
He was an Adjunct Professor of Data Science and AI at Faculty of Information Technology, Monash University.
| Year | Citation |
|---|---|
| 2026 | Zhao, C., Ding, G., Wang, W., Yang, Z., Liu, Z., Chen, H., & Shen, C. (2026). FreerCustom: Training-Free Multi-Concept Customization for Image and Video Generation. International Journal of Computer Vision, 134(1). |
| 2026 | Li, H., Wu, J., Liu, D., Wu, L. Y., Chen, H., & Shen, C. (2026). Accurate Industrial Anomaly Detection and Localization Using Weakly-Supervised Residual Transformers. IEEE Transactions on Image Processing, 35, 1551-1566. |
| 2026 | Jing, C., Zhang, H., Lu, J., Liu, Y., Chen, H., Zhang, X., & Shen, C. (2026). Multi-Modal Primitive Retrieval for Compositional Zero-Shot Learning. International Journal of Computer Vision, 134(3). |
| 2025 | Ge, Y., Wang, W., Chen, Y., Wang, F., Yang, L., Chen, H., & Shen, C. (2025). Diffusion Models are Efficient Data Generators for Human Mesh Recovery. IEEE Transactions on Pattern Analysis and Machine Intelligence, 1-14. |
| 2025 | Zhu, H., Yang, H., Wu, X., Huang, D., Zhang, S., He, X., . . . Ouyang, W. (2025). PonderV2: Improved 3D Representation with A Universal Pre-training Paradigm. IEEE Transactions on Pattern Analysis and Machine Intelligence, 47(8), 6550-6565. Scopus1 |
| 2025 | Wu, L. Y., Li, B., Wang, H., Shen, C., Mora, B., Chen, C., & Xie, X. (2025). Editorial: Special Section on Intelligent Network Video Advances Based on Transformers. Big Data Mining and Analytics, 8(3), 519. |
| 2025 | Wu, W., Li, Z., He, Y., Shou, M. Z., Shen, C., Cheng, L., . . . Zhang, D. (2025). Paragraph-to-Image Generation with Information-Enriched Diffusion Model. International Journal of Computer Vision, 133(8), 5413-5434. Scopus4 |
| 2025 | Liu, Y., Huang, M., Yan, H., Deng, L., Wu, W., Lu, H., . . . Bai, X. (2025). VimTS: A Unified Video and Image Text Spotter for Enhancing the Cross-Domain Generalization. IEEE Transactions on Pattern Analysis and Machine Intelligence, 47(4), 2957-2972. Scopus2 Europe PMC1 |
| 2025 | Liu, Y., Zhu, M., Chen, H., Wang, X., Feng, B., Wang, H., . . . Shen, C. (2025). Segment Anything in Context with Vision Foundation Models. International Journal of Computer Vision, 133(10), 7460-7485. Scopus1 WoS1 |
| 2025 | Wang, Q., Liu, L., Jing, C., Wang, P., Zhang, Y., & Shen, C. (2025). Learning Dual-Stream Conditional Concepts in Compositional Zero-Shot Learning. IEEE Transactions on Pattern Analysis and Machine Intelligence, 47(11), 1-18. |
| 2024 | Wu, W., Cai, Y., Shen, C., Zhang, D., Fu, Y., Zhou, H., & Luo, P. (2024). End-to-End Video Text Spotting with Transformer. International Journal of Computer Vision, 132(9), 4019-4035. Scopus10 |
| 2024 | Wang, W., Zhao, C., Chen, H., Chen, Z., Zheng, K., & Shen, C. (2024). AutoStory: Generating Diverse Storytelling Images with Minimal Human Efforts. International Journal of Computer Vision, 133(6), 3083-3104. |
| 2024 | Li, H., Hu, J., Li, B., Chen, H., Zheng, Y., & Shen, C. (2024). Target before Shooting: Accurate Anomaly Detection and Localization under One Millisecond via Cascade Patch Retrieval. IEEE Transactions on Image Processing, 33, 5606-5621. Scopus13 Europe PMC2 |
| 2024 | Sun, L., Bian, J. -W., Zhan, H., Yin, W., Reid, I., & Shen, C. (2024). SC-DepthV3: Robust Self-Supervised Monocular Depth Estimation for Dynamic Scenes. IEEE Transactions on Pattern Analysis and Machine Intelligence, 46(1), 497-508. Scopus56 WoS45 Europe PMC11 |
| 2024 | Xian, K., Cao, Z., Shen, C., & Lin, G. (2024). Towards Robust Monocular Depth Estimation: A New Baseline and Benchmark. International Journal of Computer Vision, 132(7), 2401-2419. Scopus13 |
| 2024 | Liu, Y., Wang, X., Zhu, M., Cao, Y., Huang, T., & Shen, C. (2024). Masked Channel Modeling for Bootstrapping Visual Pre-training. International Journal of Computer Vision, 133(2), 760-780. Scopus5 WoS7 |
| 2024 | Hu, M., Yin, W., Zhang, C., Cai, Z., Long, X., Chen, H., . . . Shen, S. (2024). Metric3D v2: A Versatile Monocular Geometric Foundation Model for Zero-shot Metric Depth and Surface Normal Estimation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 46(12), 10579-10596. Scopus68 WoS51 Europe PMC10 |
| 2024 | Li, R., Zhang, C., Wang, Z., Shen, C., & Lin, G. (2024). Self-Supervised 3D Scene Flow Estimation and Motion Prediction using Local Rigidity Prior. IEEE Transactions on Pattern Analysis and Machine Intelligence, 46(12), 1-16. Scopus7 WoS5 Europe PMC1 |
| 2024 | Yin, W., Liu, Y., Shen, C., Sun, B., & van den Hengel, A. (2024). Scaling Up Multi-domain Semantic Segmentation with Sentence Embeddings. International Journal of Computer Vision, 132(9), 4036-4051. Scopus4 WoS4 |
| 2023 | Zhang, S., Sun, X., Chen, H., Li, B., & Shen, C. (2023). RGM: A Robust Generalizable Matching Model. |
| 2023 | Lin, M., Chen, M., Zhang, Y., Shen, C., Ji, R., & Cao, L. (2023). Super Vision Transformer. International Journal of Computer Vision, 131(12), 3136-3151. Scopus28 |
| 2023 | Zhang, B., Liu, L., Phan, M. H., Tian, Z., Shen, C., & Liu, Y. (2023). SegViT v2: Exploring Efficient and Continual Semantic Segmentation with Plain Vision Transformers. International Journal of Computer Vision, 132(4), 1126-1147. Scopus30 WoS25 |
| 2023 | Xie, Y., Zhang, J., Xia, Y., & Shen, C. (2023). Learning from partially labeled data for multi-organ and tumor segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 45(12), 14905-14919. Scopus38 WoS35 Europe PMC17 |
| 2023 | Liu, Y., Zhang, J., Peng, D., Huang, M., Wang, X., Tang, J., . . . Jin, L. (2023). SPTS v2: Single-Point Scene Text Spotting. IEEE Transactions on Pattern Analysis and Machine Intelligence, 45(12), 1-15. Scopus41 WoS37 Europe PMC4 |
| 2023 | Zhu, M., Li, H., Chen, H., Fan, C., Mao, W., Jing, C., . . . Shen, C. (2023). SegPrompt: Boosting Open-world Segmentation via Category-level Prompt Learning. |
| 2023 | Ge, Y., Zhou, Q., Wang, X., Shen, C., Wang, Z., & Li, H. (2023). Point-Teaching: Weakly Semi-supervised Object Detection with Point Annotations. Proceedings of the 37th AAAI Conference on Artificial Intelligence, AAAI 2023, 37, 667-675. Scopus21 |
| 2023 | Sai, N., Bockman, J. P., Chen, H., Watson-Haigh, N., Xu, B., Feng, X., . . . Gilliham, M. (2023). StomaAI: an efficient and user-friendly tool for measurement of stomatal pores and density using deep computer vision. New Phytologist, 238(2), 904-915. Scopus19 WoS18 Europe PMC14 |
| 2023 | Xi, Y., Chen, H., Wang, N., Wang, P., Zhang, Y., Shen, C., & Liu, Y. (2023). A Dynamic Feature Interaction Framework for Multi-task Visual Perception. International Journal of Computer Vision, 131(11), 2977-2993. Scopus11 WoS10 |
| 2023 | Wang, X., Zhang, R., Shen, C., & Kong, T. (2023). DenseCL: A simple framework for self-supervised dense visual pre-training. Visual Informatics, 7(1), 30-40. Scopus13 WoS13 |
| 2023 | Liu, Y., Shu, C., Wang, J., & Shen, C. (2023). Structured Knowledge Distillation for Dense Prediction. IEEE Transactions on Pattern Analysis and Machine Intelligence, 45(6), 7035-7049. Scopus111 WoS122 Europe PMC24 |
| 2023 | Liu, J., Zhuang, B., Chen, P., Shen, C., Cai, J., & Tan, M. (2023). Single-path Bit Sharing for Automatic Loss-aware Model Compression.. IEEE transactions on pattern analysis and machine intelligence, PP(10), 1-14. Scopus9 WoS8 Europe PMC3 |
| 2023 | Yan, Y., Shu, Y., Chen, S., Xue, J. H., Shen, C., & Wang, H. (2023). SPL-Net: Spatial-Semantic Patch Learning Network for Facial Attribute Recognition with Limited Labeled Data. International Journal of Computer Vision, 131(8), 2097-2121. Scopus5 |
| 2022 | Sun, L., Yin, W., Xie, E., Li, Z., Sun, C., & Shen, C. (2022). Improving Monocular Visual Odometry Using Learned Depth. IEEE Transactions on Robotics, 38(5), 3173-3186. Scopus35 WoS31 |
| 2022 | Yin, W., Zhang, J., Wang, O., Niklaus, S., Chen, S., Liu, Y., & Shen, C. (2022). Towards Accurate Reconstruction of 3D Scene Shape from A Single Monocular Image. IEEE Transactions on Pattern Analysis and Machine Intelligence, 45(5), 1-21. Scopus36 WoS33 Europe PMC8 |
| 2022 | Zhang, C., Cai, Y., Lin, G., & Shen, C. (2022). DeepEMD: Differentiable Earth Mover's Distance for Few-Shot Learning. IEEE Transactions on Pattern Analysis and Machine Intelligence, 45(5), 1-17. Scopus133 WoS113 Europe PMC18 |
| 2022 | Tian, Z., Chu, X., Wang, X., Wei, X., & Shen, C. (2022). Fully Convolutional One-Stage 3D Object Detection on LiDAR Range Images. Advances in Neural Information Processing Systems, 35, 1-14. Scopus131 |
| 2022 | Pang, G., Aggarwal, C., Shen, C., & Sebe, N. (2022). Editorial Deep Learning for Anomaly Detection. IEEE Transactions on Neural Networks and Learning Systems, 33(6), 2282-2286. Scopus14 WoS12 |
| 2022 | Mei, T., Corso, J. J., Kim, G., Luo, J., Shen, C., & Zhang, H. (2022). Guest Editorial Introduction to the Special Section on Video and Language. IEEE Transactions on Circuits and Systems for Video Technology, 32(1), 1-4. Scopus3 |
| 2022 | Cai, Y., Liu, Y., Shen, C., Jin, L., Li, Y., & Ergu, D. (2022). Arbitrarily shaped scene text detection with dynamic convolution. Pattern Recognition, 127, 108608-1-108608-11. Scopus37 WoS31 |
| 2022 | Chen, P., Zhang, M., Shen, Y., Sheng, K., Gao, Y., Sun, X., . . . Shen, C. (2022). Efficient Decoder-Free Object Detection with Transformers. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 13670, 70-86. Scopus24 |
| 2022 | Cheng, L., Fang, P., Liang, Y., Zhang, L., Shen, C., & Wang, H. (2022). TSGB: Target-Selective Gradient Backprop for Probing CNN Visual Saliency. IEEE Transactions on Image Processing, 31, 2529-2540. Scopus14 Europe PMC3 |
| 2022 | Lu, H., Dai, Y., Shen, C., & Xu, S. (2022). Index Networks. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 44(1), 242-255. Scopus39 WoS39 Europe PMC9 |
| 2022 | Liu, Y., Shen, C., Jin, L., He, T., Chen, P., Liu, C., & Chen, H. (2022). ABCNet v2: Adaptive Bezier-Curve Network for Real-time End-to-end Text Spotting. IEEE Transactions on Pattern Analysis and Machine Intelligence, 44(11), 1-17. Scopus142 WoS98 Europe PMC9 |
| 2022 | Zhao, Y., Yu, X., Gao, Y., & Shen, C. (2022). Learning discriminative region representation for person retrieval. Pattern Recognition, 121, 10 pages. Scopus23 WoS19 |
| 2022 | Zhou, Y., Ji, R., Sun, X., Su, J., Meng, D., Gao, Y., & Shen, C. (2022). Plenty is Plague: Fine-Grained Learning for Visual Question Answering. IEEE Transactions on Pattern Analysis and Machine Intelligence, 44(2), 697-709. Scopus15 WoS12 |
| 2022 | Cui, Y., Guo, D., Shao, Y., Wang, Z., Shen, C., Zhang, L., & Chen, S. (2022). Joint Classification and Regression for Visual Tracking with Fully Convolutional Siamese Networks. International Journal of Computer Vision, 130(2), 550-566. Scopus52 |
| 2022 | Cao, J., Guo, Y., Wu, Q., Shen, C., Huang, J., & Tan, M. (2022). Improving Generative Adversarial Networks With Local Coordinate Coding. IEEE transactions on pattern analysis and machine intelligence, 44(1), 211-227. Scopus16 WoS12 Europe PMC2 |
| 2022 | Xu, G., Yin, W., Chen, H., Shen, C., Cheng, K., Wu, F., & Zhao, F. (2022). Towards 3D Scene Reconstruction from Locally Scale-Aligned Monocular Video Depth. |
| 2022 | Yin, W., Liu, Y., Shen, C., Sun, B., & Hengel, A. V. D. (2022). Scaling up Multi-domain Semantic Segmentation with Sentence Embeddings. |
| 2022 | Sai, N., Bockman, J. P., Chen, H., Watson-Haigh, N., Xu, B., Feng, X., . . . Gilliham, M. (2022). SAI: Fast and automated quantification of stomatal parameters on microscope images. Europe PMC3 |
| 2022 | Wang, L., Zhang, H., Xiao, Q., Xu, H., Shen, C., & Jin, X. (2022). Effective Eyebrow Matting with Domain Adaptation. Computer Graphics Forum, 41(7), 347-358. Scopus2 WoS2 |
| 2021 | Xie, Y., Zhang, J., Liao, Z., Verjans, J., Shen, C., & Xia, Y. (2021). Intra- and Inter-pair Consistency for Semi-supervised Gland Segmentation.. IEEE transactions on image processing : a publication of the IEEE Signal Processing Society, 31, 894-905. Scopus38 WoS34 Europe PMC14 |
| 2021 | Bian, J. W., Zhan, H., Wang, N., Chin, T. J., Shen, C., & Reid, I. (2021). Auto-Rectify Network for Unsupervised Indoor Depth Estimation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 44(12), 12 pages. Scopus63 WoS54 Europe PMC13 |
| 2021 | Liu, Y., He, T., Chen, H., Wang, X., Luo, C., Zhang, S., . . . Jin, L. (2021). Exploring the Capacity of an Orderless Box Discretization Network for Multi-orientation Scene Text Detection. International Journal of Computer Vision, 129(6), 1972-1992. Scopus29 WoS25 |
| 2021 | Yan, Q., Gong, D., Shi, J. Q., van den Hengel, A., Shen, C., Reid, I., & Zhang, Y. (2021). Dual-attention-guided network for ghost-free high dynamic range imaging. International Journal of Computer Vision, 130(1), 19 pages. Scopus45 WoS37 |
| 2021 | Yin, W., Liu, Y., & Shen, C. (2021). Virtual Normal: Enforcing Geometric Constraints for Accurate and Robust Depth Prediction. IEEE Transactions on Pattern Analysis and Machine Intelligence, 44(10), 13 pages. Scopus44 WoS45 Europe PMC6 |
| 2019 | Teney, D., Wang, P., Cao, J., Liu, L., Shen, C., & Hengel, A. V. D. (2019). V-PROM: A Benchmark for Visual Reasoning Using Visual Progressive Matrices.. CoRR, abs/1907.12271, 12071-12078. WoS13 |
| 2017 | Cao, Y., Shen, C., & Shen, H. T. (2017). Exploiting depth from single monocular images for object detection and semantic segmentation. IEEE Transactions on Image Processing, 26(2), 836-846. Scopus63 WoS51 Europe PMC8 |
| Year | Citation |
|---|---|
| 2025 | Ge, Y., Xie, K., Xu, G., Ke, L., Liu, M., Huang, L., . . . Shen, C. (2025). Generative Video Matting. In S. N. Spencer (Ed.), Proceedings SIGGRAPH 2025 Conference Papers (pp. 10 pages). CANADA, Vancouver: ASSOC COMPUTING MACHINERY. DOI |
| 2025 | Zhu, M., Tian, Y., Chen, H., Zhou, C., Guo, Q., Liu, Y., . . . Shen, C. (2025). SegAgent: Exploring Pixel Understanding Capabilities in MLLMs by Imitating Human Annotator Trajectories. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 3686-3696). IEEE. DOI |
| 2025 | Jing, C., Liu, M., Chen, H., Xi, Y., Bu, X., Gong, D., & Shen, C. (2025). Seeing the Unseen: Composing Outliers for Compositional Zero-Shot Learning. In Ijcai International Joint Conference on Artificial Intelligence (pp. 1278-1286). International Joint Conferences on Artificial Intelligence Organization. DOI |
| 2025 | Liu, K., Chen, H., & Shen, C. (2025). Physics Aware Neural Networks for Unsupervised Binding Energy Prediction. In Proceedings of Machine Learning Research Vol. 267 (pp. 38169-38187). |
| 2024 | Jiang, P. T., Yang, Y., Cao, Y., Hou, Q., Cheng, M. M., & Shen, C. (2024). Traffic Scene Parsing Through the TSP6K Dataset. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 21874-21885). Seattle, Washington: IEEE. DOI Scopus4 |
| 2024 | Zhu, M., Liu, Y., Luo, Z., Jing, C., Chen, H., Xu, G., . . . Shen, C. (2024). Unleashing the Potential of the Diffusion Model in Few-shot Semantic Segmentation. In Advances in Neural Information Processing Systems Vol. 37. Vancouver: Neural information processing systems foundation. Scopus7 |
| 2024 | Liu, Y., Jing, C., Li, H., Zhu, M., Chen, H., Wang, X., & Shen, C. (2024). A Simple Image Segmentation Framework via In-Context Examples. In Advances in Neural Information Processing Systems Vol. 37. Scopus4 |
| 2024 | Zhu, M., Fan, C., Chen, H., Liu, Y., Mao, W., Xu, X., & Shen, C. (2024). Generative Active Learning for Long-tailed Instance Segmentation. In Proceedings of Machine Learning Research Vol. 235 (pp. 62349-62368). Vienna: ML Research Press. Scopus4 |
| 2024 | Liu, K., Mao, W., Shen, S., Jiao, X., Sun, Z., Chen, H., & Shen, C. (2024). Floating Anchor Diffusion Model for Multi-motif Scaffolding. In Proceedings of Machine Learning Research Vol. 235 (pp. 31691-31708). Vienna: ML Research Press. |
| 2024 | Zhang, M., Chen, H., Shen, C., Yang, Z., Ou, L., Yu, X., & Zhuang, B. (2024). LoRAPrune: Structured Pruning Meets Low-Rank Parameter-Efficient Fine-Tuning. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (pp. 3013-3026). Hybrid, Bangkok: Association for Computational Linguistics (ACL). DOI Scopus24 |
| 2024 | Chen, D., Zhou, Z., Wang, C., Shen, C., & Lyu, S. (2024). On the Trajectory Regularity of ODE-based Diffusion Sampling. In Proceedings of Machine Learning Research Vol. 235 (pp. 7905-7934). Vienna: ML Research Press. Scopus4 |
| 2024 | Ying, K., Zhong, Q., Mao, W., Wang, Z., Chen, H., Wu, L. Y., . . . Shen, C. (2024). CTVIS: Consistent Training for Online Video Instance Segmentation. In Proceedings of the IEEE International Conference on Computer Vision (pp. 899-908). Paris, France: IEEE. DOI Scopus54 WoS39 |
| 2024 | Wang, W., Ge, Y., Mei, H., Cai, Z., Sun, Q., Wang, Y., . . . Komura, T. (2024). Zolly: Zoom Focal Length Correctly for Perspective-Distorted Human Mesh Reconstruction. In Proceedings of the IEEE International Conference on Computer Vision (pp. 3902-3912). Online: IEEE. DOI Scopus24 WoS12 |
| 2024 | Wang, X., Zhang, X., Cao, Y., Wang, W., Shen, C., & Huang, T. (2024). SegGPT: Towards Segmenting Everything in Context. In Proceedings of the IEEE International Conference on Computer Vision (pp. 1130-1140). Paris, France: IEEE. DOI Scopus130 |
| 2024 | Jing, C., Li, Y., Chen, H., & Shen, C. (2024). Retrieval-Augmented Primitive Representations for Compositional Zero-Shot Learning. In Proceedings of the AAAI Conference on Artificial Intelligence Vol. 38 (pp. 2652-2660). Online: Association for the Advancement of Artificial Intelligence (AAAI). DOI Scopus19 |
| 2024 | Wang, J., Cui, Y., Guo, D., Li, J., Liu, Q., & Shen, C. (2024). PointAttN: You Only Need Attention for Point Cloud Completion. In Proceedings of the AAAI Conference on Artificial Intelligence Vol. 38 (pp. 5472-5480). Vancouver, Canada: Association for the Advancement of Artificial Intelligence (AAAI). DOI Scopus66 |
| 2024 | Zhao, Y., Ye, Q., Wu, W., Shen, C., & Wan, F. (2024). Generative Prompt Model for Weakly Supervised Object Localization. In Proceedings of the IEEE International Conference on Computer Vision (pp. 6328-6338). Online: IEEE. DOI Scopus26 |
| 2024 | Wu, W., Zhao, Y., Shou, M. Z., Zhou, H., & Shen, C. (2024). DiffuMask: Synthesizing Images with Pixel-level Annotations for Semantic Segmentation Using Diffusion Models. In Proceedings of the IEEE International Conference on Computer Vision (pp. 1206-1217). Online: IEEE. DOI Scopus138 |
| 2024 | Mao, W., Zhu, M., Sun, Z., Shen, S., Wu, L. Y., Chen, H., & Shen, C. (2024). DE NOVO PROTEIN DESIGN USING GEOMETRIC VECTOR FIELD NETWORKS. In 12th International Conference on Learning Representations, ICLR 2024. Hybrid, Vienna: International Conference on Learning Representations, ICLR. Scopus5 |
| 2024 | Liu, Y., Zhu, M., Li, H., Chen, H., Wang, X., & Shen, C. (2024). MATCHER: SEGMENT ANYTHING WITH ONE SHOT USING ALL-PURPOSE FEATURE MATCHING. In 12th International Conference on Learning Representations, ICLR 2024. Online: International Conference on Learning Representations, ICLR. Scopus23 |
| 2024 | Yang, Z., Ding, G., Wang, W., Chen, H., Zhuang, B., & Shen, C. (2024). OBJECT-AWARE INVERSION AND REASSEMBLY FOR IMAGE EDITING. In 12th International Conference on Learning Representations, ICLR 2024. Online: International Conference on Learning Representations, ICLR. Scopus4 |
| 2023 | Chu, X., Tian, Z., Zhang, B., Wang, X., & Shen, C. (2023). CONDITIONAL POSITIONAL ENCODINGS FOR VISION TRANSFORMERS. In 11th International Conference on Learning Representations, ICLR 2023. Kigali: International Conference on Learning Representations, ICLR. Scopus139 |
| 2023 | Xu, G., Yin, W., Chen, H., Shen, C., Cheng, K., & Zhao, F. (2023). FrozenRecon: Pose-free 3D Scene Reconstruction with Frozen Depth Models. In Proceedings of the IEEE International Conference on Computer Vision (pp. 9276-9286). Paris, France: IEEE. DOI Scopus15 |
| 2023 | Zhu, M., Li, H., Chen, H., Fan, C., Mao, W., Jing, C., . . . Shen, C. (2023). SegPrompt: Boosting Open-world Segmentation via Category-level Prompt Learning. In Proceedings of the IEEE International Conference on Computer Vision (pp. 999-1008). FRANCE, Paris: IEEE COMPUTER SOC. DOI Scopus20 WoS16 |
| 2023 | Yin, W., Zhang, C., Chen, H., Cai, Z., Yu, G., Wang, K., . . . Shen, C. (2023). Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image. In Proceedings of the IEEE International Conference on Computer Vision (pp. 9009-9019). Paris, France: IEEE. DOI Scopus157 |
| 2023 | Zhang, C., Yin, W., Yu, G., Wang, Z., Chen, T., Fu, B., . . . Shen, C. (2023). Robust Geometry-Preserving Depth Estimation Using Differentiable Rendering. In Proceedings of the IEEE International Conference on Computer Vision (pp. 8917-8927). Paris, France: IEEE. DOI Scopus6 |
| 2023 | Pang, G., Shen, C., Jin, H., & van den Hengel, A. (2023). Deep Weakly-supervised Anomaly Detection. In Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (pp. 13 pages). Online: ACM. DOI Scopus93 WoS67 |
| 2023 | Qin, Y., Chen, X., Chen, C., Shen, Y., Ren, B., Gu, Y., . . . Shen, C. (2023). FoPro: Few-Shot Guided Robust Webly-Supervised Prototypical Learning. In B. Williams, Y. Chen, & J. Neville (Eds.), Proceedings of the 37th AAAI Conference on Artificial Intelligence, AAAI 2023 Vol. 37 (pp. 2101-2109). Washington DC, USA: PKP Publishing Services Network. DOI Scopus3 |
| 2023 | Zhuang, B., Liu, J., Pan, Z., He, H., Weng, Y., & Shen, C. (2023). A Survey on Efficient Training of Transformers. In IJCAI : proceedings of the conference / sponsored by the International Joint Conferences on Artificial Intelligence Vol. 2023-August (pp. 6823-6831). Macao, S.A.R: International Joint Conferences on Artificial Intelligence. DOI Scopus19 |
| 2023 | Wang, Q., Liu, L., Jing, C., Chen, H., Liang, G., Wang, P., & Shen, C. (2023). Learning Conditional Attributes for Compositional Zero-Shot Learning. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition Vol. 2023-June (pp. 11197-11206). Online: IEEE. DOI Scopus63 WoS60 |
| 2023 | Wang, X., Wang, W., Cao, Y., Shen, C., & Huang, T. (2023). Images Speak in Images: A Generalist Painter for In-Context Visual Learning. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition Vol. 2023-June (pp. 6830-6839). Vancouver, BC, Canada: IEEE. DOI Scopus221 |
| 2023 | Ge, Y., Zhou, Q., Wang, X., Shen, C., Wang, Z., & Li, H. (2023). Point-Teaching: Weakly Semi-supervised Object Detection with Point Annotations. In B. Williams, Y. Chen, & J. Neville (Eds.), THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 1 (pp. 667-675). DC, Washington: ASSOC ADVANCEMENT ARTIFICIAL INTELLIGENCE. WoS19 |
| 2023 | Zhang, J., Chen, C., Li, B., Lyu, L., Wu, S., Ding, S., . . . Wu, C. (2023). DENSE: Data-Free One-Shot Federated Learning. In Advances in Neural Information Processing Systems Vol. 35. USA: Neural information processing systems foundation. Scopus118 |
| 2023 | Zhang, C., Yin, W., Wang, Z., Yu, G., Fu, B., & Shen, C. (2023). Hierarchical Normalization for Robust Monocular Depth Estimation. In Advances in Neural Information Processing Systems Vol. 35. USA: Neural information processing systems foundation. Scopus31 |
| 2023 | Gao, Y., Liu, J., Xu, Z., Zhang, J., Li, K., & Shen, C. (2023). PyramidCLIP: Hierarchical Feature Alignment for Vision-language Model Pretraining. In Advances in Neural Information Processing Systems Vol. 35. USA: Neural information processing systems foundation. Scopus76 |
| 2022 | Zhang, B., Tian, Z., Tang, Q., Chu, X., Wei, X., Shen, C., & Liu, Y. (2022). SegViT: Semantic Segmentation with Plain Vision Transformers. In Advances in Neural Information Processing Systems Vol. 35 (pp. 12 pages). Online: Neural information processing systems foundation. Scopus125 |
| 2022 | Long, A., Yin, W., Ajanthan, T., Nguyen, V., Purkait, P., Garg, R., . . . Van Den Hengel, A. (2022). Retrieval Augmented Classification for Long-Tail Visual Recognition. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition Vol. 2022-June (pp. 6949-6959). Online: IEEE. DOI Scopus112 WoS78 |
| 2022 | Gao, Y., Zhuang, J. X., Lin, S., Cheng, H., Sun, X., Li, K., & Shen, C. (2022). DisCo: Remedying Self-supervised Learning on Lightweight Models with Distilled Contrastive Learning. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) Vol. 13686 LNCS (pp. 237-253). Online: Springer Nature Switzerland. DOI Scopus25 |
| 2022 | Zhang, W., Huang, Z., Luo, G., Chen, T., Wang, X., Liu, W., . . . Shen, C. (2022). TopFormer: Token Pyramid Transformer for Mobile Semantic Segmentation. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition Vol. 2022-June (pp. 12073-12083). Online: IEEE. DOI Scopus307 |
| 2022 | Li, R., Zhang, C., Lin, G., Wang, Z., & Shen, C. (2022). RigidFlow: Self-Supervised Scene Flow Learning on Point Clouds by Local Rigidity Prior. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition Vol. 2022-June (pp. 16938-16947). Online: IEEE. DOI Scopus52 |
| 2022 | He, T., Yin, W., Shen, C., & van den Hengel, A. (2022). PointInst3D: Segmenting 3D Instances by Points. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) Vol. 13663 LNCS (pp. 286-302). Online: Springer. DOI Scopus18 WoS18 |
| 2022 | Ding, C., Pang, G., & Shen, C. (2022). Catching Both Gray and Black Swans: Open-set Supervised Anomaly Detection. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition Vol. 2022-June (pp. 7378-7388). New Orleans, LA, USA: IEEE. DOI Scopus157 WoS150 |
| 2022 | Wang, X., Yu, Z., De Mello, S., Kautz, J., Anandkumar, A., Shen, C., & Alvarez, J. M. (2022). FreeSOLO: Learning to Segment Objects without Annotations. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition Vol. 2022-June (pp. 14156-14166). New Orleans, LA, USA: IEEE. DOI Scopus114 WoS83 |
| 2022 | Mao, W., Ge, Y., Shen, C., Tian, Z., Wang, X., Wang, Z., & den Hengel, A. V. (2022). Poseur: Direct Human Pose Regression with Transformers. In S. Avidan, G. Brostow, M. Cisse, G. M. Farinella, & T. Hassner (Eds.), Computer Vision - ECCV 2022. Vol. 13666 LNCS (pp. 72-88). Tel Aviv, Israel: Springer, Cham. DOI Scopus88 WoS74 |
| 2022 | Dai, Y., Price, B., Zhang, H., & Shen, C. (2022). Boosting Robustness of Image Matting with Context Assembling and Strong Data Augmentation. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition Vol. 2022-June (pp. 11697-11706). New Orleans, LA, USA: IEEE. DOI Scopus29 WoS17 |
| 2022 | Jia, S., Yin, B., Yao, T., Ding, S., Shen, C., Yang, X., & Ma, C. (2022). Adv-Attribute: Inconspicuous and Transferable Adversarial Attack on Face Recognition. In Advances in Neural Information Processing Systems Vol. 35 (pp. 1-12). New Orleans, Louisiana, USA: MIT Press. Scopus54 |
| 2022 | Peng, D., Wang, X., Liu, Y., Zhang, J., Huang, M., Lai, S., . . . Jin, L. (2022). SPTS: Single-Point Text Spotting. In Proceedings of the 30th ACM International Conference on Multimedia (pp. 10 pages). Online: ACM. DOI Scopus56 |
| 2022 | Lin, C., Wu, A., Liang, J., Zhang, J., Ge, W., Zheng, W. S., & Shen, C. (2022). Text-Adaptive Multiple Visual Prototype Matching for Video-Text Retrieval. In Advances in Neural Information Processing Systems Vol. 35 (pp. 12 pages). Online: Neural information processing systems foundation. Scopus29 |
| 2022 | Liang, J., Zhang, E., Zhang, J., & Shen, C. (2022). Multi-dataset Training of Transformers for Robust Action Recognition. In Advances in Neural Information Processing Systems Vol. 35. Online: Neural information processing systems foundation. Scopus8 |
| 2021 | Li, R., Lin, G., He, T., Liu, F., & Shen, C. (2021). HCRF-flow: Scene Flow from Point Clouds with Continuous High-order CRFs and Position-aware Flow Embedding. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 364-373). online: IEEE. DOI Scopus56 WoS47 |
| 2021 | Shu, Y., Yan, Y., Chen, S., Xue, J. H., Shen, C., & Wang, H. (2021). Learning spatial-semantic relationship for facial attribute recognition with limited labeled data. In 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (pp. 11911-11920). online: IEEE COMPUTER SOC. DOI Scopus38 WoS35 |
| 2021 | Ruan, D., Yan, Y., Lai, S., Chai, Z., Shen, C., & Wang, H. (2021). Feature Decomposition and Reconstruction Learning for Effective Facial Expression Recognition. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 7656-7665). online: IEEE. DOI Scopus223 WoS175 |
| 2021 | Zhuge, Y., & Shen, C. (2021). Deep Reasoning Network for Few-shot Semantic Segmentation. In MM 2021 - Proceedings of the 29th ACM International Conference on Multimedia (pp. 5344-5352). New York, USA: ACM. DOI Scopus15 WoS10 |
| 2021 | Mao, W., Tian, Z., Wang, X., & Shen, C. (2021). FCPose: Fully Convolutional Multi-Person Pose Estimation with Dynamic Instance-Aware Convolutions. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 9030-9039). Nashville, TN, USA: IEEE. DOI Scopus84 WoS75 |
| 2021 | He, T., Shen, C., & van den Hengel, A. (2021). DyCO3D: Robust Instance Segmentation of 3D Point Clouds through Dynamic Convolution. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 354-363). online: IEEE. DOI Scopus108 WoS84 |
| 2021 | Wang, H., Chen, P., Zhuang, B., & Shen, C. (2021). Fully Quantized Image Super-Resolution Networks. In MM '21: Proceedings of the 29th ACM International Conference on Multimedia (pp. 639-647). New York, USA: ACM. DOI Scopus23 WoS15 |
| 2021 | Dai, Y., Lu, H., & Shen, C. (2021). Learning Affinity-Aware Upsampling for Deep Image Matting. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 6837-6846). online: IEEE COMPUTER SOC. DOI Scopus58 WoS51 |
| 2021 | Zhang, J., Xie, Y., Xia, Y., & Shen, C. (2021). DoDNet: Learning to Segment Multi-Organ and Tumors from Multiple Partially Labeled Datasets. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 1195-1204). online: IEEE. DOI Scopus183 WoS165 |
| 2021 | Shu, C., Liu, Y., Gao, J., Yan, Z., & Shen, C. (2021). Channel-wise Knowledge Distillation for Dense Prediction. In Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision (ICCV) (pp. 5291-5300). online: IEEE. DOI Scopus416 WoS363 |
| 2021 | Liu, Y., Chen, H., Chen, Y., Yin, W., & Shen, C. (2021). Generic Perceptual Loss for Modeling Structured Output Dependencies. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 5420-5428). online: IEEE. DOI Scopus42 WoS33 |
| 2021 | Yuan, J., Liu, Y., Shen, C., Wang, Z., & Li, H. (2021). A Simple Baseline for Semi-supervised Semantic Segmentation with Strong Data Augmentation. In Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision (ICCV) (pp. 8209-8218). online: IEEE. DOI Scopus137 WoS121 |
| 2021 | Zhang, X., Wang, X., Bian, J. -W., Shen, C., & You, M. (2021). Diverse Knowledge Distillation for End-to-End Person Search. In THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE Vol. 35 (pp. 3412-3420). Virtual, Online: ASSOC ADVANCEMENT ARTIFICIAL INTELLIGENCE. DOI Scopus42 WoS32 |
| 2021 | Pang, G., Van Den Hengel, A., Shen, C., & Cao, L. (2021). Toward Deep Supervised Anomaly Detection: Reinforcement Learning from Partially Labeled Anomaly Data. In Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (pp. 1298-1308). online: ACM. DOI Scopus119 WoS88 |
| 2021 | Xie, Y., Zhang, J., Shen, C., & Xia, Y. (2021). CoTr: Efficiently Bridging CNN and Transformer for 3D Medical Image Segmentation. In Medical Image Computing and Computer Assisted Intervention – MICCAI 2021 24th International Conference, Strasbourg, France, September 27–October 1, 2021, Proceedings, Part III Vol. 12903 LNCS (pp. 171-180). Switzerland: Springer International Publishing. DOI Scopus632 WoS572 |
| 2021 | Chen, P., Liu, J., Zhuang, B., Tan, M., & Shen, C. (2021). AQD: Towards Accurate Quantized Object Detection. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 104-113). online: IEEE. DOI Scopus31 WoS30 |
| 2021 | Kong, L., Shen, C., & Yang, J. (2021). FastFlowNet: A Lightweight Network for Fast Optical Flow Estimation. In Proceedings - IEEE International Conference on Robotics and Automation (ICRA) Vol. 2021-May (pp. 10310-10316). Piscataway, New Jersey, United States: IEEE. DOI Scopus65 WoS47 |
| 2021 | Guo, D., Shao, Y., Cui, Y., Wang, Z., Zhang, L., & Shen, C. (2021). Graph Attention Tracking. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 9538-9547). Piscataway, New Jersey, USA: IEEE. DOI Scopus484 |
| 2021 | Yan, C., Pang, G., Wang, L., Jiao, J., Feng, X., Shen, C., & Li, J. (2021). BV-Person: A Large-scale Dataset for Bird-view Person Re-identification. In Proceedings 2021 IEEE/CVF International Conference on Computer Vision ICCV 2021 (pp. 10923-10932). Los Alamitos, CA, USA: IEEE. DOI Scopus25 WoS13 |
| 2021 | Yan, C., Pang, G., Jiao, J., Bai, X., Feng, X., & Shen, C. (2021). Occluded Person Re-Identification with Single-scale Global Representations. In Proceedings of the IEEE International Conference on Computer Vision (pp. 11855-11864). online: IEEE. DOI Scopus62 WoS57 |
| 2021 | Chen, P., Zhuang, B., & Shen, C. (2021). FATNN: Fast and Accurate Ternary Neural Networks. In Proceedings 2021 IEEE/CVF International Conference on Computer Vision ICCV 2021 (pp. 5199-5208). Los Alamitos, CA,USA: IEEE. DOI Scopus20 |
| 2021 | Zhang, C., Ding, H., Lin, G., Li, R., Wang, C., & Shen, C. (2021). Meta Navigator: Search for a Good Adaptation Policy for Few-shot Learning. In Proceedings 2021 IEEE/CVF International Conference on Computer Vision ICCV 2021 (pp. 9415-9424). Los Alamitos, CA, USA: IEEE. DOI Scopus47 |
| 2021 | Chu, X., Tian, Z., Wang, Y., Zhang, B., Ren, H., Wei, X., . . . Shen, C. (2021). Twins: Revisiting the Design of Spatial Attention in Vision Transformers. In M. Ranzato (Ed.), Advances in Neural Information Processing Systems 34 Vol. 12 (pp. 9355-9366). Online: Curran Associates, Inc.. Scopus992 WoS852 |
| 2020 | Teney, D., Wang, P., Cao, J., Liu, L., Shen, C., & Van Den Hengel, A. (2020). V-PROM: A benchmark for visual reasoning using visual progressive matrices. In Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence (AAAI-20) Vol. 34 (pp. 12071-12078). Palo Alto, CA: Association for the Advancement of Artificial Intelligence. DOI Scopus23 |
| 2020 | Wang, X., Zhang, R., Kong, T., Li, L., & Shen, C. (2020). SOLOv2: Dynamic and Fast Instance Segmentation. In H. Larochelle, M. Ranzato, R. Hadsell, M. F. Balcan, & H. Lin (Eds.), ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020 Vol. 33 (pp. 12 pages). ELECTR NETWORK: NEURAL INFORMATION PROCESSING SYSTEMS (NIPS). |
| 2018 | Cao, J., Guo, Y., Wu, Q., Shen, C., Huang, J., & Tan, M. (2018). Adversarial Learning with Local Coordinate Coding. In Proceedings of Machine Learning Research Vol. 80 (pp. 707-715). Scopus9 |
| 2011 | Park, K., Shen, C., Hao, Z., & Kim, J. (2011). Efficiently learning a distance metric for large margin nearest neighbor classification. In Proceedings of the Twenty-Fifth AAAI Conference on Artificial Intelligence Vol. 1 (pp. 453-458). online: AAAI Press. Scopus16 |
| Year | Citation |
|---|---|
| 2025 | Zhu, M., Zhong, H., Zhao, C., Du, Z., Huang, Z., Liu, M., . . . Shen, C. (2025). Active-O3: Empowering Multimodal Large Language Models with Active Perception via GRPO. |
| 2025 | Zhong, H., Zhu, M., Du, Z., Huang, Z., Zhao, C., Liu, M., . . . Shen, C. (2025). Omni-R1: Reinforcement Learning for Omnimodal Reasoning via Two-System Collaboration. |
| 2024 | Zhu, M., Liu, Y., Luo, Z., Jing, C., Chen, H., Xu, G., . . . Shen, C. (2024). Unleashing the Potential of the Diffusion Model in Few-shot Semantic Segmentation. |
| 2024 | Liu, Y., Jing, C., Li, H., Zhu, M., Chen, H., Wang, X., & Shen, C. (2024). A Simple Image Segmentation Framework via In-Context Examples. |
| 2024 | Chen, Z., Wang, W., Yang, Z., Yuan, Z., Chen, H., & Shen, C. (2024). FreeCompose: Generic Zero-Shot Image Composition with Diffusion Prior. |
| 2024 | Xie, K., Yang, B., Chen, H., Wang, M., Zou, C., Xue, H., . . . Shen, C. (2024). Zippo: Zipping Color and Transparency Distributions into a Single Diffusion Model. |
| 2024 | Xu, G., Ge, Y., Liu, M., Fan, C., Xie, K., Zhao, Z., . . . Shen, C. (2024). What Matters When Repurposing Diffusion Models for General Dense Perception Tasks?. |
| 2023 | Wang, W., Zhao, C., Chen, H., Chen, Z., Zheng, K., & Shen, C. (2023). AutoStory: Generating Diverse Storytelling Images with Minimal Human Effort. |
| Date | Role | Research Topic | Program | Degree Type | Student Load | Student Name |
|---|---|---|---|---|---|---|
| 2022 - 2025 | Principal Supervisor | Skeleton Representation: From Human Pose to Protein Backbone | Doctor of Philosophy | Doctorate | Full Time | Mr Weian Mao |
| 2021 - 2025 | Principal Supervisor | Visual Perception and Reconstruction at Scale | Doctor of Philosophy | Doctorate | Full Time | Mr Yongtao Ge |
| 2021 - 2024 | Co-Supervisor | Towards Effective and Efficient Semantic Segmentation | Doctor of Philosophy | Doctorate | Full Time | Mr Bowen Zhang |
| 2021 - 2025 | Principal Supervisor | Towards Accurate Semi-Supervised Semantic Segmentation with Fewer Annotations | Doctor of Philosophy | Doctorate | Full Time | Miss Jinchao Ge |
| 2020 - 2023 | Principal Supervisor | Deep Learning for Scene Text Detection, Recognition, and Understanding | Doctor of Philosophy | Doctorate | Full Time | Mr Xinyu Wang |
| 2020 - 2023 | Principal Supervisor | Label-Efficient Segmentation for Diverse Scenarios | Doctor of Philosophy | Doctorate | Full Time | Mr Yunzhi Zhuge |
| 2020 - 2022 | Principal Supervisor | Deep Object Segmentation and Beyond | Doctor of Philosophy | Doctorate | Full Time | Mr Xinlong Wang |
| 2020 - 2025 | Principal Supervisor | Deep Anomaly Detection in Open Worlds | Doctor of Philosophy | Doctorate | Full Time | Mr Choubo Ding |
| 2019 - 2022 | Co-Supervisor | Self-supervised Learning of Monocular Depth from Video | Doctor of Philosophy | Doctorate | Full Time | Mr Jiawang Bian |
| 2019 - 2022 | Principal Supervisor | Deep Learning for Robotic Scene Understanding | Doctor of Philosophy | Doctorate | Full Time | Mr Libo Sun |
| 2018 - 2021 | Principal Supervisor | Fully Convolutional Instance-level Visual Recognition | Doctor of Philosophy | Doctorate | Full Time | Mr Zhi Tian |
| 2018 - 2022 | Principal Supervisor | 3D Scene Reconstruction from A Monocular Image | Doctor of Philosophy | Doctorate | Full Time | Mr Wei Yin |
| 2018 - 2019 | Principal Supervisor | High-performance Object Detection and Tracking using Deep Learning | Master of Philosophy | Master | Full Time | Mr Xinyu Wang |
| 2018 - 2021 | Principal Supervisor | Multi-modality Data Analysis Using Deep Reinforcement Learning | Doctor of Philosophy | Doctorate | Full Time | Mr Hu Wang |
| 2018 - 2022 | Principal Supervisor | Efficient Deep Networks for Image Matting | Doctor of Philosophy | Doctorate | Full Time | Ms Yutong Dai |
| 2018 - 2021 | Principal Supervisor | Efficient Fully Convolutional Networks for Dense Prediction Tasks | Doctor of Philosophy | Doctorate | Full Time | Ms Yifan Liu |
| 2017 - 2020 | Co-Supervisor | Semantic Image Segmentation and Other Dense Per-Pixel Tasks: Practical Approaches | Doctor of Philosophy | Doctorate | Full Time | Mr Vladimir Nekrasov |
| 2017 - 2020 | Principal Supervisor | Efficient Scene Parsing with Imagery and Point Cloud Data | Doctor of Philosophy | Doctorate | Full Time | Mr Tong He |
| 2017 - 2021 | Principal Supervisor | Efficient Fully-Convolutional Networks for Image Perception | Doctor of Philosophy | Doctorate | Full Time | Mr Hao Chen |
| 2016 - 2017 | Principal Supervisor | Deep Learning for Fine-Gained Visual Recognition | Master of Philosophy | Master | Full Time | Mr Teng Li |
| 2016 - 2020 | Co-Supervisor | How Geometry Meets Learning in Pose Estimation | Doctor of Philosophy | Doctorate | Full Time | Mr Ming Cai |
| 2015 - 2017 | Principal Supervisor | Sketch Image Recognition Using Deep Features | Master of Philosophy | Master | Full Time | Miss Yuchao Jiang |
| 2015 - 2019 | Principal Supervisor | Context Learning and Weakly Supervised Learning for Semantic Segmentation | Doctor of Philosophy | Doctorate | Full Time | Mr Tong Shen |
| 2014 - 2018 | Principal Supervisor | Text Detection and Recognition in Natural Scene Images | Doctor of Philosophy | Doctorate | Full Time | Mrs Hui Li |
| 2014 - 2018 | Principal Supervisor | Towards Efficient Deep Neural Networks with Applications to Visual Recognition | Doctor of Philosophy | Doctorate | Full Time | Dr Bohan Zhuang |
| 2014 - 2017 | Principal Supervisor | Deep Visual Representation for Weakly-supervised and Structured Output Tasks | Doctor of Philosophy | Doctorate | Full Time | Mr Yao Li |
| 2014 - 2016 | Principal Supervisor | Deep Learning for Multi-label Scene Classification | Master of Philosophy | Master | Full Time | Mr Junjie Zhang |
| 2013 - 2018 | Principal Supervisor | Mid-level Representations for Action Recognition and Zero-shot Learning | Doctor of Philosophy | Doctorate | Full Time | Mr Ruizhi Qiao |
| 2013 - 2017 | Principal Supervisor | Dynamic Scene Understanding with Applications to Traffic Monitoring | Doctor of Philosophy | Doctorate | Full Time | Mr Qichang Hu |
| 2013 - 2018 | Principal Supervisor | Deep Learning Based RGB-D Vision Tasks | Doctor of Philosophy | Doctorate | Full Time | Mr Yuanzhouhan Cao |
| 2012 - 2014 | Principal Supervisor | Hypergraph Modeling for Saliency Detection and Beyond | Master of Engineering Science | Master | Full Time | Mr Yao Li |
| 2012 - 2014 | Principal Supervisor | Structured Output Prediction and Binary Code Learning in Computer Vision | Doctor of Philosophy | Doctorate | Full Time | Dr Guosheng Lin |
| 2011 - 2015 | Principal Supervisor | Learning Structured Prediction Models in Computer Vision | Doctor of Philosophy | Doctorate | Full Time | Miss Fayao Liu |