Prof Minh Hoai Nguyen

Professor of Computer Vision

School of Computer Science and Information Technology

College of Engineering and Information Technology

Eligible to supervise Masters and PhD - email supervisor to discuss availability.

Minh Hoai Nguyen is a Professor of Computer Vision at the Australian Institute for Machine Learning (AIML) and the School of Computer and Mathematical Sciences (CMS) at the University of Adelaide. Before joining the University of Adelaide, he was a tenured Associate Professor at Stony Brook University from 2014 until 2024. During this time, he also took a leave to work at VinAI in Vietnam. He received a Bachelor of Software Engineering from the University of New South Wales in 2006 and a Ph.D. in Robotics from Carnegie Mellon University in 2012. His research interests lie in computer vision and machine learning. In 2012, Nguyen and his coauthor received the Best Student Paper Award at the IEEE Conference On Computer Vision and Pattern Recognition (CVPR).Google Scholar page: https://scholar.google.com/citations?user=hRV0tY4AAAAJ&hl=enPersonal website: https://minhhoai.net

Date	Position	Institution name
2024 - ongoing	Professor	Univesity of Adelaide
2020 - 2023	Associate Professor	Stony Brook University
2019 - ongoing	Consulting Research Scientist	VinAI Research
2014 - 2020	Assistant Professor	Stony Brook University
2013 - 2014	Junior Research Fellow	University of Oxford

Language	Competency
English	Can read, write, speak, understand spoken and peer review
Vietnamese	Can read, write, speak, understand spoken and peer review

Date	Institution name	Country	Title
2006 - 2012	Carnegie Mellon University	United States	PhD in Robotics
2002 - 2005	The University of New South Wales	Australia	Bachelor of Software Engineering

Date	Title	Institution	Country
2012 - 2014	Postdoctoral Research	University of Oxford	United Kingdom

Year	Citation
2025	Hartley, R., Jawahar, C. V., Nguyen, M. H., & Samaras, D. (2025). Foreword. Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, 15472 LNCS, v-vi.
2025	Hartley, R., Jawahar, C. V., Nguyen, M. H., & Samaras, D. (2025). Foreword. Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, 15474 LNCS, v-vi.
2025	Hartley, R., Jawahar, C. V., Nguyen, M. H., & Samaras, D. (2025). Foreword. Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, 15481 LNCS, v-vi.
2025	Nguyen, D. D., Nguyen, L. T., Huang, Y., Pham, C., & Hoai, M. (2025). Class-Agnostic Repetitive Action Counting Using Wearable Devices. IEEE Transactions on Pattern Analysis and Machine Intelligence, 47(6), 1-13. DOI
2023	Bui, H., Nguyen, M. H., Nguyen, D. Q., Pham, L., & Phung, D. (2023). Building and Nurturing AI Development in Vietnam. Communications of the ACM, 66(7), 75-76. DOI
2022	Sun, S., Annadi, R. R., Chaudhri, I., Munir, K., Hajagos, J., Saltz, J., . . . Koraishy, F. M. (2022). Short- and Long-Term Recovery after Moderate/Severe AKI in Patients with and without COVID-19. Kidney360, 3(2), 242-257. DOI Scopus18 WoS17 Europe PMC14
2022	Ali, F. Z., Wengler, K., He, X., Nguyen, M. H., Parsey, R. V., & DeLorenzo, C. (2022). Gradient boosting decision-tree-based algorithm with neuroimaging for personalized treatment in depression. Neuroscience Informatics, 2(4), 100110. DOI Scopus16 Europe PMC6
2021	Zelinsky, G. J., Chen, Y., Ahn, S., Adeli, H., Yang, Z., Huang, L., . . . Hoai, M. (2021). Predicting Goal-directed Attention Control Using Inverse-Reinforcement Learning.. Neurons, behavior, data analysis, and theory, 2021(2). DOI Europe PMC6
2021	Chen, Y., Yang, Z., Ahn, S., Samaras, D., Hoai, M., & Zelinsky, G. (2021). COCO-Search18 fixation dataset for predicting goal-directed attention control. Scientific Reports, 11(1), 11 pages. DOI Scopus41 WoS22 Europe PMC12
2021	Hou, L., Vicente, T. F. Y., Hoai, M., & Samaras, D. (2021). Large Scale Shadow Annotation and Detection Using Lazy Annotation and Stacked CNNs. IEEE Transactions on Pattern Analysis and Machine Intelligence, 43(4), 1337-1351. DOI Scopus26 WoS19 Europe PMC2
2021	Wei, Z., Wang, B., Hoai, M., Zhang, J., Shen, X., Lin, Z., . . . Samaras, D. (2021). Sequence-to-Segments Networks for Detecting Segments in Videos. IEEE Transactions on Pattern Analysis and Machine Intelligence, 43(3), 1009-1021. DOI Scopus10 WoS16 Europe PMC6
2021	Do, N., Truong, D., Nguyen, D., Hoai, M., & Pham, C. (2021). Self-controlling photonic-on-chip networks with deep reinforcement learning. Scientific Reports, 11(1), 18 pages. DOI Scopus11 WoS9 Europe PMC3
2021	Huang, X., Jamonnak, S., Zhao, Y., Wang, B., Hoai, M., Yager, K., & Xu, W. (2021). Interactive Visual Study of Multiple Attributes Learning Model of X-Ray Scattering Images. IEEE Transactions on Visualization and Computer Graphics, 27(2), 1312-1321. DOI Scopus8 WoS6 Europe PMC4
2020	Chaudhri, I., Moffitt, R., Taub, E., Annadi, R. R., Hoai, M., Bolotova, O., . . . Koraishy, F. M. (2020). Association of Proteinuria and Hematuria with Acute Kidney Injury and Mortality in Hospitalized Patients with COVID-19. Kidney and Blood Pressure Research, 45(6), 1018-1032. DOI Scopus50 WoS40 Europe PMC37
2018	Liu, Y., Hoai, M., Shao, M., & Kim, T. K. (2018). Latent Bi-Constraint SVM for Video-Based Object Recognition. IEEE Transactions on Circuits and Systems for Video Technology, 28(10), 3044-3052. DOI Scopus7 WoS7
2018	Wang, B., & Hoai, M. (2018). Back to the beginning: Starting point detection for early recognition of ongoing human actions. Computer Vision and Image Understanding, 175, 24-31. DOI Scopus11 WoS8
2018	Vicente, T. F. Y., Hoai, M., & Samaras, D. (2018). Leave-One-Out Kernel Optimization for Shadow Detection and Removal. IEEE Transactions on Pattern Analysis and Machine Intelligence, 40(3), 682-695. DOI Scopus152 WoS128 Europe PMC16
2017	Wei, Z., Adeli, H., Hoai, M., Zelinsky, G., & Samaras, D. (2017). Predicting Scanpath Agreement during Scene Viewing using Deep Neural Networks. Journal of Vision, 17(10), 749. DOI
2014	Hoai, M., & De La Torre, F. (2014). Max-margin early event detectors. International Journal of Computer Vision, 107(2), 191-202. DOI Scopus181 WoS151
2014	Hoai, M., Torresani, L., De La Torre, F., & Rother, C. (2014). Learning discriminative localization from weakly labeled data. Pattern Recognition, 47(3), 1523-1534. DOI Scopus32 WoS31
2010	Nguyen, M. H., & De La Torre, F. (2010). Metric learning for image alignment. International Journal of Computer Vision, 88(1), 69-84. DOI Scopus15 WoS11
2010	Nguyen, M. H., & de la Torre, F. (2010). Optimal feature selection for support vector machines. Pattern Recognition, 43(3), 584-591. DOI Scopus208 WoS165
1988	DERYCKE, A., VIEVILLE, C., POISSON, D., STACH, C., & NGUYEN, M. H. (1988). NANORESEAU, EDUCATIONAL UTILIZATION OF A LOCAL-NETWORK. TSI-TECHNIQUE ET SCIENCE INFORMATIQUES, 7(1), 7-20.
-	Zelinsky, G. J., Ahn, S., Yang, Z., Chen, Y., Mondal, S., Hoai, M., & Samaras, D. (2023). Reward Maps Predict Target-present and Target-absent Visual Search. Journal of Vision, 23(9), 5161. DOI

Year	Citation
2014	Hoai, M., & de la Torre, F. (2014). Structured Prediction for Event Detection. In S. Nowozin, P. V. Gehler, J. Jancsary, & C. H. Lampert (Eds.), ADVANCED STRUCTURED PREDICTION (pp. 333-361). MIT PRESS. WoS1

Year	Citation
2025	Nguyen, G. K., Huang, Y., & Hoai, M. (2025). Can Current AI Models Count What We Mean, Not What They See? A Benchmark and Systematic Evaluation. In Proceedings 2025 International Conference on Digital Image Computing Techniques and Applications Dicta 2025 (pp. 397-404). AUSTRALIA, Adelaide: IEEE COMPUTER SOC. DOI
2025	Mondal, S., Ahn, S., Yang, Z., Balasubramanian, N., Samaras, D., Zelinsky, G., & Hoai, M. (2025). Look Hear: Gaze Prediction for Speech-Directed Human Attention. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) Vol. 15100 LNCS (pp. 236-255). Milan, Italy: Springer Science and Business Media Deutschland GmbH. DOI Scopus1 WoS1
2025	Miao, Q., Graikos, A., Zhang, J., Mondal, S., Minh, H., & Samaras, D. (2025). Diffusion-Refined VQA Annotations for Semi-supervised Gaze Following. In A. Leonardis, E. Ricci, S. Roth, O. Russakovsky, T. Sattler, & G. Varol (Eds.), COMPUTER VISION - ECCV 2024, PT XXXIX Vol. 15097 (pp. 439-457). ITALY, Milan: SPRINGER INTERNATIONAL PUBLISHING AG. DOI
2025	Xue, R., Xu, J., Mondal, S., Le, H., Zelinsky, G., Hoai, M., & Samaras, D. (2025). Few-shot Personalized Scanpath Prediction. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 13497-13507). TN, Nashville: IEEE COMPUTER SOC. DOI Scopus1
2025	Huang, Y., Chen, Z., Xu, Y., Hoai, M., & Li, Z. (2025). DualMat: PBR Material Estimation via Coherent Dual-Path Diffusion. In Mm 2025 Proceedings of the 33rd ACM International Conference on Multimedia Co Located with mm 2025 (pp. 9930-9939). IRELAND, Dublin: ASSOC COMPUTING MACHINERY. DOI
2024	Chandran, P., Huang, Y., Munsell, J., Howatt, B., Wallace, B., Wilson, L., . . . Loschky, L. C. (2024). Characterizing Learners' Complex Attentional States During Online Multimedia Learning Using Eye-tracking, Egocentric Camera, Webcam, and Retrospective recalls. In Proceedings of the 2024 Symposium on Eye Tracking Research and Applications Vol. 2024 (pp. 7 pages). Online: ACM. DOI Scopus5 WoS3
2024	Rebello, N. S., Munsell, J., Chandran, P., Loschky, L. C., Huang, Y., Hoai, M., & D�Mello, S. (2024). Mapping students� self-reported cognitive load, situational engagement, and attentional-cognitive states in an online multimedia learning module. In 2024 Physics Education Research Conference Proceedings (pp. 354-360). Boston Massachusetts: American Association of Physics Teachers. DOI
2024	Nguyen, P., Do, A., & Hoai, M. (2024). Detecting Omissions in Geographic Maps through Computer Vision. In 2024 International Conference on Multimedia Analysis and Pattern Recognition, MAPR 2024 - Proceedings Vol. 6 (pp. 1-6). Da Nang: IEEE. DOI
2024	Yang, Z., Mondal, S., Ahn, S., Xue, R., Zelinsky, G., Hoai, M., & Samaras, D. (2024). Unifying Top-Down and Bottom-Up Scanpath Prediction Using Transformers. In 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (pp. 1683-1693). Seattle, WA, USA: IEEE. DOI Scopus17 WoS10
2024	Pham, B. -D., Tran, P., Tran, A., Pham, C., Nguyen, R., & Hoai, M. (2024). Blur2Blur: Blur Conversion for Unsupervised Image Deblurring on Unknown Domains. In 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Vol. 30 (pp. 2804-2813). Seattle, WA, USA: IEEE. DOI Scopus17 WoS14
2024	Lee, S., Lu, Z., Zhang, Z., Hoai, M., & Elhamifar, E. (2024). Error Detection in Egocentric Procedural Task Videos. In 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Vol. abs/2105.10859 (pp. 18655-18666). Seattle, WA, USA: IEEE. DOI Scopus17 WoS10
2024	Narasimhaswamy, S., Bhattacharya, U., Chen, X., Dasgupta, I., Mitra, S., & Hoai, M. (2024). HanDiffuser: Text-to-Image Generation with Realistic Hand Appearances. In 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Vol. 23 (pp. 2468-2479). Seattle, WA, USA: IEEE. DOI Scopus20 WoS16
2024	Narasimhaswamy, S., Nguyen, H. A., Huang, L., & Hoai, M. (2024). HOIST-Former: Hand-Held Objects Identification, Segmentation, and Tracking in the Wild. In 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (pp. 2351-2361). Seattle, WA, USA: IEEE. DOI Scopus4 WoS2
2024	Huang, Y., Nguyen, D. D., Nguyen, L., Pham, C., & Hoai, M. (2024). Count What You Want: Exemplar Identification and Few-Shot Counting of Human Actions in the Wild. In Proceedings of the AAAI Conference on Artificial Intelligence Vol. 38 (pp. 10057-10065). Online: AAAI. DOI Scopus2 WoS2
2024	Zhang, Z., Truong, V. Q., & Hoai, M. (2024). Efficiency-preserving Scene-adaptive Object Detection. In 35th British Machine Vision Conference Bmvc 2024.
2023	Huang, Y., Ranjan, V., & Hoai, M. (2023). Interactive Class-Agnostic Object Counting. In Proceedings of the IEEE International Conference on Computer Vision (pp. 22255-22265). Paris, France: IEEE. DOI Scopus11 WoS5
2023	Ghosh, S., Aggarwal, T., Hoai, M., & Balasubramanian, N. (2023). Text-Derived Knowledge Helps Vision: A Simple Cross-modal Distillation for Video-based Action Anticipation. In I. Augenstein, & A. Vlachos (Eds.), Proceedings OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023 (pp. 1882-1897). Online: ASSOC COMPUTATIONAL LINGUISTICS-ACL.
2023	Ghosh, S., Aggarwal, T., Hoai, M., & Balasubramanian, N. (2023). Text-Derived Knowledge Helps Vision: A Simple Cross-modal Distillation for Video-based Action Anticipation. In EACL 2023 - 17th Conference of the European Chapter of the Association for Computational Linguistics, Findings of EACL 2023 (pp. 1837-1852). Dubrovnik, Croatia: ACL Anthology. DOI Scopus3
2023	Zhang, Z., & Hoai, M. (2023). Object Detection with Self-Supervised Scene Adaptation. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition Vol. 2023-June (pp. 21589-21599). Vancouver, CANADA.: IEEE COMPUTER SOC. DOI Scopus18 WoS15
2023	Pham, B. D., Tran, P., Tran, A., Pham, C., Nguyen, R., & Hoai, M. (2023). HyperCUT: Video Sequence from a Single Blurry Image using Unsupervised Ordering. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition Vol. 2023-June (pp. 9843-9852). Online: IEEE COMPUTER SOC. DOI Scopus2 WoS1
2023	Mondal, S., Yang, Z., Ahn, S., Samaras, D., Zelinsky, G., & Hoai, M. (2023). Gazeformer: Scalable, Effective and Fast Prediction of Goal-Directed Human Attention. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition Vol. 2023-June (pp. 1441-1450). Vancouver, BC, CANADA: IEEE COMPUTER SOC. DOI Scopus41 WoS27
2023	Miao, Q., Hoai, M., & Samaras, D. (2023). Patch-level Gaze Distribution Prediction for Gaze Following. In Proceedings - 2023 IEEE Winter Conference on Applications of Computer Vision, WACV 2023 (pp. 880-889). Waikoloa, HI, USA: IEEE COMPUTER SOC. DOI Scopus23 WoS21
2023	Tran, V., Balasubramanian, N., & Hoai, M. (2023). From Within to Between: Knowledge Distillation for Cross Modality Retrieval. In L. Wang, J. Gall, T. J. Chin, I. Sato, & R. Chellappa (Eds.), Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) Vol. 13844 LNCS (pp. 605-622). Macao, PEOPLES R CHINA: SPRINGER INTERNATIONAL PUBLISHING AG. DOI
2023	Tran, B., Hua, B. S., Tran, A. T., & Hoai, M. (2023). Self-supervised Learning with Multi-view Rendering for 3D Point Cloud Analysis. In J. Gall, T. J. Chin, I. Sato, R. Chellappa, & L. Wang (Eds.), Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) Vol. 13841 LNCS (pp. 413-431). Macao, China: SPRINGER INTERNATIONAL PUBLISHING AG. DOI Scopus3 WoS3
2023	Ranjan, V., & Nguyen, M. H. (2023). Exemplar Free Class Agnostic Counting. In L. Wang, J. Gall, T. J. Chin, I. Sato, & R. Chellappa (Eds.), Computer Vision – ACCV 2022 16th Asian Conference on Computer Vision, Proceedings Vol. 13844 (pp. 71-87). Macao, China: SPRINGER INTERNATIONAL PUBLISHING AG. DOI Scopus7 WoS12
2022	Ho, L. N., Tran, A. T., Phung, Q., & Hoai, M. (2022). Toward Realistic Single-View 3D Object Reconstruction with Unsupervised Learning from Multiple Images. In Proceedings of the IEEE International Conference on Computer Vision (pp. 12580-12590). Montreal, QC, Canada: IEEE. DOI Scopus8 WoS1
2022	Nguyen, T., Pham, C., Nguyen, K., & Hoai, M. (2022). Few-Shot Object Counting and Detection. In S. Avidan, G. Brostow, M. Cisse, G. M. Farinella, & T. Hassner (Eds.), European Conference on Computer Vision. Computer Vision - ECCV Vol. 13680 (pp. 348-365). Tel Aviv, ISRAEL: SPRINGER INTERNATIONAL PUBLISHING AG. DOI Scopus36 WoS28
2022	Yang, Z., Mondal, S., Ahn, S., Zelinsky, G., Hoai, M., & Samaras, D. (2022). Target-Absent Human Attention. In S. Avidan, G. Brostow, M. Cisse, G. M. Farinella, & T. Hassner (Eds.), Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) Vol. 13664 (pp. 52-68). ISRAEL, Tel Aviv: SPRINGER INTERNATIONAL PUBLISHING AG. DOI Scopus17 WoS15 Europe PMC1
2022	Ranjan, V., & Hoai, M. (2022). Vicinal Counting Networks. In Conference on Computer Vision and Pattern Recognition Workshops IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Workshops Vol. 2022-June (pp. 4220-4229). New Orleans, LA, USA: IEEE. DOI Scopus15 WoS12
2022	Chen, Y., Yang, Z., Chakraborty, S., Mondal, S., Ahn, S., Samaras, D., . . . Zelinsky, G. (2022). Characterizing Target-absent Human Attention. In Conference on Computer Vision and Pattern Recognition Workshops IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Workshops Vol. 2022 (pp. 5027-5036). New Orleans, LA, USA: IEEE. DOI Scopus15 WoS7
2022	Huang, M., Narasimhaswamy, S., Vazir, S., Ling, H., & Hoai, M. (2022). Forward Propagation, Backward Regression, and Pose Association for Hand Tracking in the Wild. In Proceedings / CVPR, IEEE Computer Society Conference on Computer Vision and Pattern Recognition. IEEE Computer Society Conference on Computer Vision and Pattern Recognition Vol. 2022-June (pp. 6396-6406). New Orleans. LA, USA: IEEE. DOI Scopus10 WoS4
2022	Narasimhaswamy, S., Nguyen, T., Huang, M., & Hoai, M. (2022). Whose Hands are These? Hand Detection and Hand-Body Association in the Wild. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition Vol. 2022-June (pp. 4879-4889). New Orleans, LA, USA: IEEE COMPUTER SOC. DOI Scopus22 WoS13
2021	Nguyen, T., Tran, A. T., & Hoai, M. (2021). Lipstick ain't enough: Beyond color matching for in-the-wild makeup transfer. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 13300-13309). Nashville, TN, USA: IEEE COMPUTER SOC. DOI Scopus52 WoS51
2021	Tran, V., Balasubramanian, N., & Hoai, M. (2021). PROGRESSIVE KNOWLEDGE DISTILLATION FOR EARLY ACTION RECOGNITION. In Proceedings - International Conference on Image Processing, ICIP Vol. 2021-September (pp. 2583-2587). Anchorage, AK, USA: IEEE. DOI Scopus11 WoS8
2021	Ranjan, V., Sharma, U., Nguyen, T., & Hoai, M. (2021). Learning To Count Everything. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 3393-3402). Nashville, TN, USA: IEEE COMPUTER SOC. DOI Scopus171 WoS145
2021	Nguyen, N., Nguyen, T., Tran, V., Tran, M. T., Ngo, T. D., Nguyen, T. H., & Hoai, M. (2021). Dictionary-guided Scene Text Recognition. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 7379-7388). Nashville, TN, USA: IEEE COMPUTER SOC. DOI Scopus64 WoS45
2021	Park, S., Hoai, M., Bhattacharya, A., & Das, S. R. (2021). Adaptive streaming of 360-degree videos with reinforcement learning. In Proceedings - 2021 IEEE Winter Conference on Applications of Computer Vision, WACV 2021 (pp. 1838-1847). Waikoloa, HI, USA: IEEE COMPUTER SOC. DOI Scopus31 WoS21
2021	Wang, Y., Bertasius, G., Oh, T. H., Gupta, A., Hoai, M., & Torresani, L. (2021). Supervoxel attention graphs for long-range video modeling. In Proceedings - 2021 IEEE Winter Conference on Applications of Computer Vision, WACV 2021 (pp. 155-166). Waikoloa, HI, USA: IEEE COMPUTER SOC. DOI Scopus12 WoS7
2021	Abousamra, S., Hoai, M., Samaras, D., & Chen, C. (2021). Localization in the Crowd with Topological Constraints. In 35th AAAI Conference on Artificial Intelligence, AAAI 2021 Vol. 2A (pp. 872-881). Virtual, Online: ASSOC ADVANCEMENT ARTIFICIAL INTELLIGENCE. DOI Scopus131 WoS100
2021	Tran, V., Wang, Y., Zhang, Z., & Hoai, M. (2021). KNOWLEDGE DISTILLATION FOR HUMAN ACTION ANTICIPATION. In Proceedings - International Conference on Image Processing, ICIP Vol. 2021-September (pp. 2518-2522). Anchorage, AK, USA: IEEE. DOI Scopus8 WoS7
2021	Tran, P., Tran, A. T., Phung, Q., & Hoai, M. (2021). Explore image deblurring via encoded blur kernel space. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 11951-11960). Nashville, TN, USA: IEEE COMPUTER SOC. DOI Scopus80 WoS63
2021	Huynh, C., Tran, A. T., Luu, K., & Hoai, M. (2021). Progressive semantic segmentation. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 16750-16759). Nashville, TN, USA: IEEE COMPUTER SOC. DOI Scopus116 WoS99
2021	Zhang, Z., Koraishy, F. M., & Hoai, M. (2021). Exemplar-Based Early Event Prediction in Video. In 32nd British Machine Vision Conference, BMVC 2021. Virtual, Online: British Machine Vision Association, BMVA.
2020	Wang, Y., Tran, V., Bertasius, G., Torresani, L., & Hoai, M. (2020). Attentive Action and Context Factorization. In 31st British Machine Vision Conference, BMVC 2020. Virtual, Online: British Machine Vision Association, BMVA. Scopus4
2020	Nguyen, M. T., Phung, D., Hoai, M., & Nguyen, T. H. (2020). Structural and functional decomposition for personality image captioning in a communication game. In Findings of the Association for Computational Linguistics Findings of ACL: EMNLP 2020 (pp. 4587-4593). Virtual, Online: Association for Computational Linguistics (ACL). Scopus2 WoS3
2020	Wang, B., Liu, H., Samaras, D., & Hoai, M. (2020). Distribution matching for crowd counting. In Advances in Neural Information Processing Systems Vol. 2020-December. Virtual, Online: Neural information processing systems foundation. Scopus315
2020	Narasimhaswamy, S., Nguyen, T., & Hoai, M. (2020). Detecting hands and recognizing physical contact in the wild. In Advances in Neural Information Processing Systems Vol. 2020-December. Virtual, Online: Neural information processing systems foundation. Scopus41
2020	Ranjan, V., Wang, B., Shah, M., & Hoai, M. (2020). Uncertainty Estimation and Sample Selection for Crowd Counting. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) Vol. 12626 LNCS (pp. 375-391). Virtual, Online: Springer International Publishing. DOI Scopus6 WoS2
2020	Wei, Z., Zhang, J., Lin, Z., Lee, J. Y., Balasubramanian, N., Hoai, M., & Samaras, D. (2020). Learning Visual Emotion Representations from Web Data. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 13103-13112). Seattle, WA, USA: IEEE. DOI Scopus44 WoS13
2020	Yang, Z., Huang, L., Chen, Y., Wei, Z., Ahn, S., Zelinsky, G., . . . Hoai, M. (2020). Predicting Goal-Directed Human Attention Using Inverse Reinforcement Learning. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition Vol. 2020 (pp. 190-199). Seattle, WA, USA: IEEE. DOI Scopus90 WoS82 Europe PMC21
2020	Shilkrot, R., Narasimhaswamy, S., Vazir, S., & Hoai, M. (2020). WorkingHands: A hand-tool assembly dataset for image segmentation and activity mining. In 30th British Machine Vision Conference 2019, BMVC 2019. Cardiff: BMVA Press. Scopus9
2020	Wang, B., Huang, L., & Hoai, M. (2020). Active vision for early recognition of human actions. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 1078-1088). Seattle, WA, USA: IEEE. DOI Scopus16 WoS3
2019	Zelinsky, G., Yang, Z., Huang, L., Chen, Y., Ahn, S., Wei, Z., . . . Hoai, M. (2019). Benchmarking gaze prediction for categorical visual search. In IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops Vol. 2019-June (pp. 828-836). CA, Long Beach: IEEE. DOI Scopus39 WoS28
2019	Narasimhaswamy, S., Wei, Z., Wang, Y., Zhang, J., & Nguyen, M. H. (2019). Contextual attention for hand detection in the wild. In Proceedings of the IEEE International Conference on Computer Vision (pp. 9566-9575). SOUTH KOREA, Seoul: IEEE. DOI Scopus59 WoS39
2019	Wang, Y., Huang, H., Wang, C., He, T., Wang, J., & Hoai, M. (2019). GIF2VIDEO: Color dequantization and temporal interpolation of GIF images. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition Vol. 2019-June (pp. 1419-1428). CA, Long Beach: IEEE COMPUTER SOC. DOI Scopus14 WoS10
2019	Rebello, N. S., Minh, H. N., Wang, Y., Zu, T., Hutson, J., & Loschky, L. C. (2019). Machine learning predicts responses to conceptual tasks using eye movements. In A. Traxler, Y. Cao, & S. Wolf (Eds.), 2018 PHYSICS EDUCATION RESEARCH CONFERENCE (PERC) (pp. 4 pages). DC, Washington: AMER ASSOC PHYSICS TEACHERS. DOI WoS2
2018	Wei, Z., Wang, B., Hoai, M., Zhang, J., Lin, Z., Shen, X., . . . Samaras, D. (2018). Sequence-to-segments networks for segment detection. In S. Bengio, H. Wallach, H. Larochelle, K. Grauman, N. CesaBianchi, & R. Garnett (Eds.), Advances in Neural Information Processing Systems Vol. 2018-December (pp. 3507-3516). CANADA, Montreal: NEURAL INFORMATION PROCESSING SYSTEMS (NIPS). Scopus11 WoS1
2018	Wei, Z., Zhang, J., Shen, X., Lin, Z., Mech, R., Hoai, M., & Samaras, D. (2018). Good View Hunting: Learning Photo Composition from Dense View Pairs. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 5437-5446). UT, Salt Lake City: IEEE. DOI Scopus87 WoS67
2018	Wang, Y., & Hoai, M. (2018). Pulling Actions out of Context: Explicit Separation for Effective Combination. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 7044-7053). UT, Salt Lake City: IEEE. DOI Scopus23
2018	Wang, Y., Tran, V. Q., & Nguyen, M. H. (2018). Eigen-evolution dense trajectory descriptors. In Proceedings 13th IEEE International Conference on Automatic Face and Gesture Recognition Fg 2018 (pp. 473-479). PEOPLES R CHINA, Xi an: IEEE. DOI Scopus4 WoS4
2018	Ranjan, V., Le, H., & Hoai, M. (2018). Iterative crowd counting. In V. Ferrari, M. Hebert, C. Sminchisescu, & Y. Weiss (Eds.), Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics Vol. 11211 LNCS (pp. 278-293). GERMANY, Munich: SPRINGER INTERNATIONAL PUBLISHING AG. DOI Scopus79 WoS211
2018	Wang, Y., Wang, L., You, Y., Zou, X., Chen, V., Li, S., . . . Weinberger, K. Q. (2018). Resource Aware Person Re-identification across Multiple Resolutions. In 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) (pp. 8042-8051). UT, Salt Lake City: IEEE. DOI WoS255
2018	Le, H., Vicente, T. F. Y., Nguyen, V., Hoai, M., & Samaras, D. (2018). A+D Net: Training a Shadow Detector with Adversarial Shadow Attenuation. In V. Ferrari, M. Hebert, C. Sminchisescu, & Y. Weiss (Eds.), Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics Vol. 11206 LNCS (pp. 680-696). GERMANY, Munich: SPRINGER INTERNATIONAL PUBLISHING AG. DOI Scopus32 WoS105
2018	Wang, B., & Hoai, M. (2018). Predicting body movement and recognizing actions: An integrated framework for mutual benefits. In Proceedings 13th IEEE International Conference on Automatic Face and Gesture Recognition Fg 2018 (pp. 341-348). PEOPLES R CHINA, Xi an: IEEE. DOI Scopus11 WoS4
2017	Nguyen, V., Vicente, T. F. Y., Zhao, M., Hoai, M., & Samaras, D. (2017). Shadow Detection with Conditional Generative Adversarial Networks. In Proceedings of the IEEE International Conference on Computer Vision Vol. 2017-October (pp. 4520-4528). ITALY, Venice: IEEE. DOI Scopus204 WoS173
2017	Wang, B., Yager, K., Yu, D., & Hoai, M. (2017). X-Ray scattering image classification using deep learning. In Proceedings 2017 IEEE Winter Conference on Applications of Computer Vision Wacv 2017 (pp. 697-704). CA, Santa Rosa: IEEE. DOI Scopus55 WoS47
2017	Ma, K., Hoai, M., & Samaras, D. (2017). Large-scale continual road inspection: Visual infrastructure assessment in the wild. In British Machine Vision Conference 2017 Bmvc 2017. British Machine Vision Association. DOI Scopus27
2016	Wang, Y., & Hoai, M. (2016). Improving Human Action Recognition by Non-action Classification. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition Vol. 2016-December (pp. 2698-2707). WA, Seattle: IEEE. DOI Scopus16 WoS7
2016	Vicente, T. F. Y., Hoai, M., & Samaras, D. (2016). Noisy Label Recovery for Shadow Detection in Unfamiliar Domains. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition Vol. 2016-December (pp. 3783-3792). WA, Seattle: IEEE. DOI Scopus34 WoS26
2016	Vicente, T. F. Y., Hou, L., Yu, C. P., Hoai, M., & Samaras, D. (2016). Large-scale training of shadow detectors with noisily-annotated shadow examples. In B. Leibe, J. Matas, N. Sebe, & M. Welling (Eds.), Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics Vol. 9910 LNCS (pp. 816-832). NETHERLANDS, Amsterdam: SPRINGER INTERNATIONAL PUBLISHING AG. DOI Scopus215 WoS198
2016	Wei, Z., & Hoai, M. (2016). Region Ranking SVM for Image Classification. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition Vol. 2016-December (pp. 2987-2996). WA, Seattle: IEEE. DOI Scopus25 WoS15
2016	Wei, Z., Adeli, H., Zelinsky, G., Hoai, M., & Samaras, D. (2016). Learned region sparsity and diversity also predict visual attention. In D. D. Lee, M. Sugiyama, U. V. Luxburg, I. Guyon, & R. Garnett (Eds.), Advances in Neural Information Processing Systems Vol. 29 (pp. 1902-1910). SPAIN, Barcelona: NEURAL INFORMATION PROCESSING SYSTEMS (NIPS). Scopus10 WoS2
2016	Wang, B., Guan, Z., Yao, S., Qin, H., Nguyen, M. H., Yager, K., & Yu, D. (2016). Deep learning for analysing synchrotron data streams. In 2016 New York Scientific Data Summit Nysds 2016 Proceedings (pp. 5 pages). NY, New York: IEEE. DOI Scopus10 WoS2
2015	Hoai, M., & Zisserman, A. (2015). Improving human action recognition using score distribution and ranking. In D. Cremers, I. Reid, H. Saito, & M. H. Yang (Eds.), Lecture Notes in Computer Science Vol. 9007 (pp. 3-20). SINGAPORE, Singapore: SPRINGER-VERLAG BERLIN. DOI Scopus46 WoS22
2015	Kwon, H., Yun, K., Hoai, M., & Samaras, D. (2015). Recognizing cultural events in images: A study of image categorization models. In IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops Vol. 2015-October (pp. 51-57). MA, Boston: IEEE. DOI Scopus6
2015	Hoai, M., & Zisserman, A. (2015). Thread-safe: Towards recognizing human actions across shot boundaries. In D. Cremers, I. Reid, H. Saito, & M. H. Yang (Eds.), Lecture Notes in Computer Science Vol. 9006 (pp. 222-237). SINGAPORE, Singapore: SPRINGER-VERLAG BERLIN. DOI Scopus3 WoS2
2015	Vicente, T. F. Y., Hoai, M., & Samaras, D. (2015). Leave-one-out kernel optimization for shadow detection. In Proceedings of the IEEE International Conference on Computer Vision Vol. 2015 International Conference on Computer Vision, ICCV 2015 (pp. 3388-3396). CHILE, Santiago: IEEE. DOI Scopus55 WoS39
2014	Hoai, M., & Zisserman, A. (2014). Talking heads: Detecting humans and recognizing their interactions. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 875-882). OH, Columbus: IEEE. DOI Scopus51 WoS26
2014	Hoai, M., Ladický, L., & Zisserman, A. (2014). Action recognition from weak alignment of body parts. In Bmvc 2014 Proceedings of the British Machine Vision Conference 2014 (pp. 86.1-86.12). British Machine Vision Association. DOI Scopus12
2014	Hoai, M. (2014). Regularized Max Pooling for image categorization. In Bmvc 2014 Proceedings of the British Machine Vision Conference 2014 (pp. 32.1-32.12). British Machine Vision Association. DOI Scopus54
2013	Hoai, M., & Zisserman, A. (2013). Discriminative sub-categorization. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 1666-1673). OR, Portland: IEEE. DOI Scopus44 WoS25
2012	Hoai, M., & De La Torre, F. (2012). Max-margin early event detectors. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 2863-2870). RI, Providence: IEEE. DOI Scopus128 WoS61
2012	Hoai, M., & De La Torre, F. (2012). Maximum margin temporal clustering. In Journal of Machine Learning Research Vol. 22 (pp. 520-528). Scopus22
2011	Hoai, M., Lan, Z. Z., & De La Torre, F. (2011). Joint segmentation and classification of human actions in video. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 3265-3272). CO, Colorado Springs: IEEE. DOI Scopus208 WoS18
2010	Simon, T., Nguyen, M. H., De La Torre, F., & Cohn, J. F. (2010). Action unit detection with segment-based SVMs. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 2737-2744). CA, San Francisco: IEEE COMPUTER SOC. DOI Scopus69 WoS33
2009	Cohn, J. F., Kruez, T. S., Matthews, I., Yang, Y., Nguyen, M. H., Padilla, M. T., . . . De La Torre, F. (2009). Detecting depression from facial actions and vocal prosody. In Proceedings 2009 3rd International Conference on Affective Computing and Intelligent Interaction and Workshops Acii 2009 (pp. 1-7). IEEE. DOI Scopus418
2009	Nguyen, M. H., & De La Torre, F. (2009). Robust kernel principal component analysis. In Advances in Neural Information Processing Systems 21 Proceedings of the 2008 Conference (pp. 1185-1192). Scopus46
2009	Nguyen, M. H., Torresani, L., De La Torre, F., & Rother, C. (2009). Weakly supervised discriminative localization and classification: A joint learning process. In Proceedings of the IEEE International Conference on Computer Vision (pp. 1925-1932). JAPAN, Kyoto: IEEE. DOI Scopus151 WoS81
2008	Minh, H. N., & De La Torre, F. (2008). Local minima free parameterized appearance models. In 26th IEEE Conference on Computer Vision and Pattern Recognition Cvpr (pp. 1412-1419). AK, Anchorage: IEEE. DOI Scopus24
2008	Nguyen, M. H., & De La Torre, F. (2008). Learning image alignment without local minima for face detection and tracking. In 2008 8th IEEE International Conference on Automatic Face and Gesture Recognition Fg 2008 (pp. 466-472). NETHERLANDS, Amsterdam: IEEE. DOI Scopus8
2008	Nguyen, M. H., Lalonde, J. F., Efros, A. A., & De La Torre, F. (2008). Image-based shaving. In Computer Graphics Forum Vol. 27 (pp. 627-635). GREECE, Crete: WILEY. DOI Scopus39 WoS31
2008	De La Torre, F., & Minh, H. N. (2008). Parameterized kernel principal component analysis: Theory and applications to supervised and unsupervised image alignment. In 26th IEEE Conference on Computer Vision and Pattern Recognition Cvpr (pp. 1404-1411). AK, Anchorage: IEEE. DOI Scopus35
2008	Nguyen, M. H., Perez, J., & Torre, F. D. L. (2008). Facial feature detection with optimal pixel reduction SVM. In 2008 8th IEEE International Conference on Automatic Face and Gesture Recognition Fg 2008 (pp. 460-465). NETHERLANDS, Amsterdam: IEEE. DOI Scopus37
2006	Nguyen, M. H., & Wobcke, W. (2006). A flexible framework for sharedplans. In A. Sattar, & B. H. Kang (Eds.), Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics Vol. 4304 LNAI (pp. 393-402). AUSTRALIA, Hobart: SPRINGER-VERLAG BERLIN. DOI Scopus3 WoS2
2003	Chan, K., Chen, A., Liang, Z. C. L., Michail, A., Nguyen, M. H., & Seow, N. (2003). DRT: A tool for design recovery of interactive graphical applications. In Proceedings International Conference on Software Engineering (pp. 814-815). OR, PORTLAND: IEEE COMPUTER SOC. DOI Scopus1
1996	Minh, H. N. (1996). Summations of polylogarithms via evaluation transform. In MATHEMATICS AND COMPUTERS IN SIMULATION Vol. 42 (pp. 707-728). FRANCE, LILLE: ELSEVIER SCIENCE BV. WoS13
1991	MINH, H. N., JACOB, G., & OUSSOUS, N. E. (1991). INPUT OUTPUT BEHAVIOR OF NONLINEAR ANALYTIC SYSTEMS - RATIONAL-APPROXIMATIONS, NILPOTENT STRUCTURAL APPROXIMATIONS. In B. BONNARD, B. BRIDE, J. P. GAUTHIER, & I. KUPKA (Eds.), ANALYSIS OF CONTROLLED DYNAMICAL SYSTEMS Vol. 8 (pp. 253-262). FRANCE, UNIV LYON, LYON: BIRKHAUSER BOSTON. WoS1

Teaching at Stony Brook University

Spring 2021: CSE512 – Machine Learning – Graduate

Fall 2020: CSE353 – Machine Learning – Undergraduate

Spring 2020: CSE615 – Advanced Computer Vision – Graduate

Fall 2019: CSE512 – Machine Learning – Graduate

Spring 2019: CSE378 – Introduction to Robotics – Undergraduate

Fall 2018: CSE512 – Machine Learning – Graduate

Spring 2018: CSE512 – Machine Learning – Graduate

Spring 2018: CSE378 – Introduction to Robotics – Undergraduate

Fall 2016: CSE527 – Introduction to Computer Vision – Graduate

Spring 2016: CSE512 – Machine Learning – Graduate

Spring 2015: CSE525 – Introduction to Robotics – Graduate

Fall 2014: CSE594 – Video Analysis – Graduate

Date	Role	Research Topic	Program	Degree Type	Student Load	Student Name
2026	Principal Supervisor	Region-level Data Attribution to enhance Reliability and Applications of Text-to-Image Generative models	-	Doctorate	Full Time	Mr Trong Bang Nguyen
2026	Principal Supervisor	Weakly Supervised Methods for Class-Agnostic Fine-Grained Visual Counting in Open-World Scenarios	-	Master	Full Time	Mr Gia Khanh Nguyen
2025	Principal Supervisor	Multimodal Action Quality Assessment Using Cameras and Bodyworn Sensors	Doctor of Philosophy	Doctorate	Full Time	Mr Duc Duy Nguyen
2025	Co-Supervisor	Artificial Intelligence for Space	Doctor of Philosophy	Doctorate	Full Time	Mr Anh Vu Nguyen
2025	Co-Supervisor	Collaborative object identification for Un-crewed Aerial System (UAS) tasks in complex environments	Doctor of Philosophy	Doctorate	Full Time	Mr Andrew Martin Chesson
2025	Principal Supervisor	Fine-Grained Action Counting and Quality Evaluation in Video	Doctor of Philosophy	Doctorate	Full Time	Mr Chang Dong
2025	Co-Supervisor	Cutting-Edge Artificial Intelligence for Prognostic Prediction of Major Amputation in Patients with Chronic Limb Threatening Ischaemia and Diabetes-related Foot Disease	Doctor of Philosophy	Doctorate	Full Time	Ms Lipin Guo
2025	Co-Supervisor	Collaborative object identification for Un-crewed Aerial System (UAS) tasks in complex environments	-	Doctorate	Full Time	Mr Andrew Martin Chesson
2025	Co-Supervisor	Cutting-Edge Artificial Intelligence for Prognostic Prediction of Major Amputation in Patients with Chronic Limb Threatening Ischaemia and Diabetes-related Foot Disease	-	Doctorate	Full Time	Ms Lipin Guo
2025	Principal Supervisor	Multimodal Action Quality Assessment Using Cameras and Bodyworn Sensors	-	Doctorate	Full Time	Mr Duc Duy Nguyen
2025	Principal Supervisor	Fine-Grained Action Counting and Quality Evaluation in Video	-	Doctorate	Full Time	Mr Chang Dong
2025	Co-Supervisor	Artificial Intelligence for Space	-	Doctorate	Full Time	Mr Anh Vu Nguyen
2024	Principal Supervisor	Hand-held Object Identification, Segmentation, and Tracking in the Wild	Doctor of Philosophy	Doctorate	Full Time	Mr Huy Anh Nguyen
2024	Principal Supervisor	Hand-held Object Identification, Segmentation, and Tracking in the Wild	-	Doctorate	Full Time	Mr Huy Anh Nguyen

Position: Professor of Computer Vision
Email: minhhoai.nguyen@adelaide.edu.au

Prof Minh Hoai Nguyen

Prof Minh Hoai Nguyen

Teaching at Stony Brook University

Connect With Me

External Profiles

Other Links

Prof Minh Hoai Nguyen

Prof Minh Hoai Nguyen

Appointments

Language Competencies

Education

Postgraduate Training

Journals

Book Chapters

Conference Papers

Teaching at Stony Brook University

Current Higher Degree by Research Supervision (Adelaide University)

Connect With Me

External Profiles

Other Links