Qi Wu

Associate Professor Qi Wu

Associate Prof/Reader

School of Computer and Mathematical Sciences

Faculty of Sciences, Engineering and Technology

Eligible to supervise Masters and PhD - email supervisor to discuss availability.


Dr Qi Wu is currently an Associate Professor in the University of Adelaide and he was an ARC Senior Research Associate in the Australian Centre for Robotic Vision (ACRV) in the University of Adelaide, Australia. Before that, he works as a Postdoc Researcher in the Australian Centre for Visual Technologies (ACVT). He received an MSc in Global Computing and Media Technology, a PhD in Computer Science from the University of Bath (United Kingdom), in 2011 and 2015. His research interests include cross-depictive style object modelling, object detection and Vision-to-Language problems. He is especially interested in the problem of Image Captioning and Visual Question Answering. His image captioning model produced the best result in the Microsoft COCO Image Captioning Challenges in the last year and his VQA model is the current state-of-the-art in the area. His work has been published in prestigious journals and conferences such as TPAMI, CVPR, ICCV and ECCV.

My research interests are mainly in computer vision and machine learning. My previous research projects include modeling visual objects regardless of depictive styles and image understanding using contextual cues. I am currently leading a small team at the Adelaide to research on the topic of Vision-and-Language.

I have been in the computer vision filed for nearly 10 years and I have a strong track record in this field. Currently, I am working on the vision to language problem and I am especially an expert in the image captioning and visual question answering (VQA). In 2015, my image captioning model and VQA model achieved the leading performance in the Microsoft COCO Image Captioning Challenges and VQA Challenges. I have published several papers in the top journals such as IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), IEEE Signal Processing Magazine (SPM), Computer Vision and Image Understanding (CVIU). I also have published several papers on the top conference, such as International Joint Conference on Artificial Intelligence (IJCAI), AAAI, The Conference on Computer Vision and Pattern Recognition (CVPR) and the European Conference on Computer Vision (ECCV), and so on.

  • Appointments

    Date Position Institution name
    2023 - ongoing Associate Professor University of Adelaide
    2018 - 2022 Senior Lecturer University of Adelaide, Adelaide
    2017 - 2018 ARC Senior Research Associate Australia Centre for Robotic Vision, University of Adelaide
    2015 - 2017 Senior Research Associate University of Adelaide
    2014 - ongoing Research Intern Lenovo
    2011 - 2015 PhD University of Bath
  • Language Competencies

    Language Competency
    Chinese (Mandarin) Can read, write, speak, understand spoken and peer review
    English Can read, write, speak, understand spoken and peer review
  • Education

    Date Institution name Country Title
    2011 - 2015 University of Bath United Kingdom PhD
    2010 - 2011 University of Bath United Kingdom MSc
    2006 - 2010 China Jiliang University China BSc
  • Research Interests

  • Journals

    Year Citation
    2025 Tan, M., Chen, Q., Huang, Z., Wu, Q., Li, Y., & Zhou, J. (2025). Auto-3D-house Design from Structured User Requirements. Machine Intelligence Research, 18 pages.
    DOI
    2025 Zhang, J., Chen, X., Yang, B., Guan, Q., Chen, Q., Chen, J., . . . Xia, Y. (2025). Advances in attention mechanisms for medical image segmentation. Computer Science Review, 56, 100721.
    DOI
    2024 Cardoso, P., Fukushima, C. S., Maxhelaku, A., Poczai, P., Porto, M., Puksas, A., . . . Veríssimo, D. (2024). Reform wildlife trade in the European Union. Science, 383(6687), 1066.
    DOI
    2024 Stephens, A. N., Hobbs, S. J., Kang, S. W., Oehler, M. K., Jobling, T. W., & Allman, R. (2024). ReClassification of Patients with Ambiguous CA125 for Optimised Pre-Surgical Triage. Diagnostics, 14(7), 11114-11124.
    DOI Scopus1 Europe PMC1
    2024 Gillani, S. E., Al-Abdeli, Y. M., & Tian, Z. F. (2024). Influence of side dilution jets on swirling and non-swirling pulverised biomass turbulent annular flows. Powder Technology, 438, 119683.
    DOI
    2024 Phan, V. M. H., Xie, Y., Qi, Y., Liu, L., Liu, L., Zhang, B., . . . Verjans, J. W. (2024). Decomposing Disease Descriptions for Enhanced Pathology Detection: A Multi-Aspect Vision-Language Pre-Training Framework. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, abs/2403.07636, 11492-11501.
    DOI
    2024 Sturman, D., Bell, E. A., Auton, J. C., Breakey, G. R., & Wiggins, M. W. (2024). The roles of phishing knowledge, cue utilization, and decision styles in phishing email detection. Applied Ergonomics, 119, 104309-1-104309-11.
    DOI Scopus3
    2024 Cabada, A., & Murray, P. (2024). The role of academics as refugee policy advocates: lessons from Australia. Policy Studies, 1-25.
    DOI
    2024 Hermoza, R., Nascimento, J. C., & Carneiro, G. (2024). Weakly-supervised preclinical tumor localization associated with survival prediction from lung cancer screening Chest X-ray images. Computerized Medical Imaging and Graphics, 115, 102395-1-102395-10.
    DOI
    2024 Sun, M., Suo, W., Wang, P., Niu, K., Liu, L., Lin, G., . . . Wu, Q. (2024). An Adaptive Correlation Filtering Method for Text-Based Person Search. International Journal of Computer Vision, 132(10), 4440-4455.
    DOI Scopus2
    2024 Xie, Y., Zhang, J., Xia, Y., & Wu, Q. (2024). UniMiSS+: Universal Medical Self-Supervised Learning From Cross-Dimensional Unpaired Data. IEEE Transactions on Pattern Analysis and Machine Intelligence, 46(12), 1-15.
    DOI Scopus1
    2024 Xie, Y., Gu, L., Harada, T., Zhang, J., Xia, Y., & Wu, Q. (2024). Rethinking masked image modeling for medical image representation. Medical Image Analysis, 98, 103304.
    DOI Scopus1
    2024 Jevric, M., Klepp, J., Puschnig, J., Lamb, O., Sumby, C. J., & Greatrex, B. W. (2024). Skeletal rearrangement of 6,8-dioxabicyclo[3.2.1]octan-4-ols promoted by thionyl chloride or Appel conditions.. Beilstein J Org Chem, 20(20), 823-829.
    DOI Scopus2
    2024 Adel Hamza, M., Abd El-Rahman, S. A., Ramadan, S. K., Ezz-Elregal, E. E. M., Rizk, S. A., & Abou-Gamra, Z. M. (2024). The enhanced visible-light-driven photocatalytic performance of nanocrystalline TiO<inf>2</inf> decorated by quinazolinone-photosensitizer toward photocatalytic treatment of simulated wastewater. Journal of Photochemistry and Photobiology A: Chemistry, 452, 10 pages.
    DOI Scopus2
    2024 Panganiban, H. P., Nguyen, C. D., Abdelhamid, Y. A., Ankravs, M., Karahalios, E., MacIsaac, C., . . . Deane, A. M. (2024). Feasibility of Embedding a Randomised Clinical Trial (RCT) into an Electronic Medical Record (EMR) for Patients Admitted to an Intensive Care Unit (ICU). Studies in Health Technology and Informatics, 310, 1420-1421.
    DOI
    2024 Ding, N., Deng, C., Tan, M., Du, Q., Ge, Z., & Wu, Q. (2024). Image Captioning With Controllable and Adaptive Length Levels. IEEE Transactions on Pattern Analysis and Machine Intelligence, 764(779), 1-16.
    DOI Scopus5
    2024 Gao, C., Liu, S., Chen, J., Wang, L., Wu, Q., Li, B., & Tian, Q. (2024). Room-Object Entity Prompting and Reasoning for Embodied Referring Expression. IEEE Transactions on Pattern Analysis and Machine Intelligence, 46(2), 994-1010.
    DOI Scopus4
    2024 Wen, Z., Niu, S., Li, G., Wu, Q., Tan, M., & Wu, Q. (2024). Test-Time Model Adaptation for Visual Question Answering with Debiased Self-Supervisions. IEEE Transactions on Multimedia, 26, 2137-2147.
    DOI Scopus3
    2023 Lin, Z., Zhang, D., Tao, Q., Shi, D., Haffari, G., Wu, Q., . . . Ge, Z. (2023). Medical visual question answering: A survey. Artificial Intelligence in Medicine, 143, 16 pages.
    DOI Scopus48 WoS1 Europe PMC6
    2023 Zhou, G., Hong, Y., & Wu, Q. (2023). NavGPT: Explicit Reasoning in Vision-and-Language Navigation with Large
    Language Models.
    2023 Wang, Z., Byrnes, O., Wang, H., Sun, R., Ma, C., Chen, H., . . . Xue, M. (2023). Data Hiding With Deep Learning: A Survey Unifying Digital Watermarking and Steganography. IEEE Transactions on Computational Social Systems, 10(6), 1-15.
    DOI Scopus27
    2023 Li, H., Huang, J., Jin, P., Song, G., Wu, Q., & Chen, J. (2023). Weakly-Supervised 3D Spatial Reasoning for Text-based Visual Question Answering. IEEE Transactions on Image Processing, 32, 3367-3382.
    DOI Scopus13
    2023 Tan, M., Wen, Z., Fang, L., & Wu, Q. (2023). Transformer-Based Relational Inference Network for Complex Visual Relational Reasoning. ACM Transactions on Multimedia Computing, Communications and Applications, 20(1), 23 pages.
    DOI Scopus3
    2023 Shi, X., Qiao, Y., Wu, Q., Liu, L., & Dayoub, F. (2023). Improving Online Source-free Domain Adaptation for Object Detection by
    Unsupervised Data Acquisition.
    2023 He, M., Du, W., Wen, Z., Du, Q., Xie, Y., & Wu, Q. (2023). Multi-Granularity Aggregation Transformer for Joint Video-Audio-Text Representation Learning. IEEE Transactions on Circuits and Systems for Video Technology, 33(6), 2990-3002.
    DOI Scopus6
    2023 Qiao, Y., Qi, Y., Hong, Y., Yu, Z., Wang, P., & Wu, Q. (2023). HOP+: History-Enhanced and Order-Aware Pre-Training for Vision-and-Language Navigation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 45(7), 8524-8537.
    DOI Scopus26 WoS1
    2023 Liu, D., Chen, Z., Huang, Z., Wu, Q., Song, Y., Yao, J., . . . Fang, G. (2023). In Situ Surface Modification Enables High Stability and Optoelectrical Performance for a Self-powered Photodetector. ADVANCED OPTICAL MATERIALS, 11(22), 10 pages.
    DOI
    2022 Xun, L., Zhang, H., Yan, Q., Wu, Q., & Zhang, J. (2022). VISOR-NET: Visibility Estimation Based on Deep Ordinal Relative Learning under Discrete-Level Labels. SENSORS, 22(16), 20 pages.
    DOI WoS4
    2022 Li, Y., Wu, Q., Lai, M., Zhao, J., Liu, Y., Fan, Y., . . . Liu, B. (2022). Influence of chemical disorder on mechanical and thermal properties of multi-component rare earth zirconate pyrochlores (<i>n</i>RE<sub>1/<i>n</i></sub>)<sub>2</sub>Zr<sub>2</sub>O<sub>7</sub>. JOURNAL OF APPLIED PHYSICS, 132(7), 11 pages.
    DOI WoS4
    2022 Ji, G., Chen, C., Zhou, M., Wen, W., Wang, C., Tang, J., . . . Feng, Z. (2022). Post-COVID-19 fatigue among COVID-19 in patients discharged from hospital: A meta-analysis. JOURNAL OF INFECTION, 84(5), 731-733.
    DOI WoS4
    2022 Wu, Y., Feng, T., Shen, Y., Fu, F., Meng, N., Li, X., . . . Wang, M. (2022). Total-body parametric imaging using the Patlak model: Feasibility of reduced scan time. MEDICAL PHYSICS, 49(7), 4529-4539.
    DOI WoS9
    2022 Kiwan, R., Jukes, A., Johnston, B., & Boulton, M. (2022). Concurrent Carotid Endarterectomy and Flow Diverting for Supraclinoid Artery Aneurysm. Canadian Journal of Neurological Sciences, 49(1), 146-148.
    DOI
    2022 Ling, L., Wu, Q., Huang, K., Wang, Y., & Wang, C. (2022). A Lightweight Bearing Fault Diagnosis Method Based on Multi-Channel Depthwise Separable Convolutional Neural Network. Electronics (Switzerland), 11(24), 21 pages.
    DOI Scopus7 WoS3
    2022 Manchin, A., Sherrah, J., Wu, Q., & van den Hengel, A. (2022). Program Generation from Diverse Video Demonstrations. BMVC 2022 - 33rd British Machine Vision Conference Proceedings.
    2022 Suo, W., Sun, M., Wang, P., Zhang, Y., & Wu, Q. (2022). Rethinking and Improving Feature Pyramids for One-stage Referring Expression Comprehension. IEEE Transactions on Image Processing, 32, 854-864.
    DOI Scopus10 WoS1
    2022 Sun, M., Suo, W., Wang, P., Zhang, Y., & Wu, Q. (2022). A proposal-free one-stage framework for referring expression comprehension and generation via dense cross-attention. IEEE Transactions on Multimedia, 25, 2446-2458.
    DOI Scopus23 WoS9
    2022 Deng, C., Wu, Q., Wu, Q., Hu, F., Lyu, F., & Tan, M. (2022). Visual Grounding Via Accumulated Attention. IEEE Transactions on Pattern Analysis and Machine Intelligence, 44(3), 1670-1684.
    DOI Scopus9 WoS3
    2022 Parvaneh, A., Abbasnejad, E., Wu, Q., Shi, Q., & Van Den Hengel, A. (2022). Show, price and negotiate: a negotiator with online value look-ahead. IEEE Transactions on Multimedia, 24, 1426-1434.
    DOI Scopus2
    2022 Sun, Z., Liu, H., Wang, Q., Zhou, T., Wu, Q., & Tang, Z. (2022). Co-LDL: A Co-training-based Label Distribution Learning Method for Tackling Label Noise. IEEE Transactions on Multimedia, 24, 1093-1104.
    DOI Scopus24 WoS7
    2021 Yu, J., Jiang, X., Qin, Z., Zhang, W., Hu, Y., & Wu, Q. (2021). Learning Dual Encoding Model for Adaptive Visual Understanding in Visual Dialogue. IEEE TRANSACTIONS ON IMAGE PROCESSING, 30, 220-233.
    DOI Scopus24 WoS9
    2021 Wang, Y., Qi, Y., Yao, H., Gong, D., & Wu, Q. (2021). Image editing with varying intensities of processing. Computer Vision and Image Understanding, 211, 1-13.
    DOI Scopus4 WoS4
    2021 Zhang, W., Ma, C., Wu, Q., & Yang, X. (2021). Language-guided Navigation via Cross-Modal Grounding and Alternate Adversarial Learning. IEEE Transactions on Circuits and Systems for Video Technology, 31(9), 3469-3481.
    DOI Scopus37 WoS18
    2021 Wang, H., Chen, H., Wu, Q., Ma, C., & Li, Y. (2021). Multi-Intersection Traffic Optimisation: A Benchmark Dataset and a Strong Baseline. IEEE Open Journal of Intelligent Transportation Systems, 3, 126-136.
    DOI Scopus10 WoS2
    2021 Zhang, C., Wang, Q., Xie, G., Wu, Q., Shen, F., & Tang, Z. (2021). Robust Learning from Noisy Web Images via Data Purification for Fine-Grained Recognition. IEEE Transactions on Multimedia, 24, 1.
    DOI Scopus11 WoS2
    2020 Gao, C., Zhu, Q., Wang, P., Li, H., Liu, Y., Van den Hengel, A., & Wu, Q. (2020). Structured Multimodal Attentions for TextVQA. IEEE Transactions on Pattern Analysis and Machine Intelligence, 14(8), 1.
    DOI Scopus30
    2020 Chen, Q., Wu, Q., Chen, J., Wu, Q., Van Den Hengel, A., & Tan, M. (2020). Scripted Video Generation with a Bottom-Up Generative Adversarial Network. IEEE Transactions on Image Processing, 29, 7454-7467.
    DOI Scopus23 WoS9
    2020 Qiao, Y., Deng, C., & Wu, Q. (2020). Referring expression comprehension: a survey of methods and datasets. IEEE Transactions on Multimedia, 23, 4426-4440.
    DOI Scopus49 WoS11
    2020 Yu, J., Zhang, W., Lu, Y., Qin, Z., Hu, Y., Tan, J., & Wu, Q. (2020). Reasoning on the Relation: Enhancing Visual Representation for Visual Question Answering and Cross-Modal Retrieval. IEEE Transactions on Multimedia, 22(12), 3196-3209.
    DOI Scopus77 WoS47
    2020 Huang, Y., Wu, Q., Wang, W., & Wang, L. (2020). Image and Sentence Matching via Semantic Concepts and Order Learning. IEEE Transactions on Pattern Analysis and Machine Intelligence, 42(3), 636-650.
    DOI Scopus30 WoS15
    2020 Triplett, J. D., Khor, T. S., & Kermode, A. G. (2020). Recurrent upper limb neuropathies secondary to an epithelioid haemangioendothelioma – A rare mimic of nerve tumours. Journal of Clinical Neuroscience, 73, 326-328.
    DOI Scopus2 Europe PMC2
    2020 Liu, X., Dai, P., Gu, T., Wu, Q., Wei, H., Liu, S., . . . Zhao, Q. (2020). Cyclometalated iridium(III) complexes containing an anthracene unit for sensing and imaging singlet oxygen in cellular mitochondria. JOURNAL OF INORGANIC BIOCHEMISTRY, 209, 10 pages.
    DOI WoS10
    2019 Li, T., Shi, P., Zhang, R., Liu, H., Cheng, X. -B., & Zhang, Q. (2019). Dendrite-free sandwiched ultrathin lithium metal anode with even lithium plating and stripping behavior. NANO RESEARCH, 12(9), 2224-2229.
    DOI WoS36
    2019 Ding, L. -H., Sun, B., & Shi, P. (2019). Empirical study of knowledge network based on complex network theory. ACTA PHYSICA SINICA, 68(12), 15 pages.
    DOI WoS2
    2019 Liu, W., Li, Y., & Wu, Q. (2019). An Attribute-Based High-Level Image Representation for Scene Classification. IEEE Access, 7, 4629-4640.
    DOI Scopus4 WoS2
    2019 Lyu, F., Wu, Q., Hu, F., Wu, Q., & Tan, M. (2019). Attend and Imagine: Multi-Label Image Classification with Visual Attention and Recurrent Neural Networks. IEEE Transactions on Multimedia, 21(8), 1971-1981.
    DOI Scopus57 WoS44
    2019 Zhang, J., Wu, Q., Zhang, J., Shen, C., Lu, J., & Wu, Q. (2019). Heritage image annotation via collective knowledge. Pattern Recognition, 93, 204-214.
    DOI Scopus5 WoS5
    2019 Zhang, J., Xie, Y., Wu, Q., & Xia, Y. (2019). Medical image classification using synergic deep learning. Medical Image Analysis, 54, 10-19.
    DOI Scopus302 WoS175 Europe PMC50
    2018 Wu, Q., Shen, C., Wang, P., Dick, A., & van den Hengel, A. (2018). Image captioning and visual question answering based on attributes and external knowledge. IEEE Transactions on Pattern Analysis and Machine Intelligence, 40(6), 1367-1381.
    DOI Scopus326 WoS196 Europe PMC8
    2018 Zhang, J., Wu, Q., Shen, C., Zhang, J., & Lu, J. (2018). Multilabel image classification with regional latent semantic dependencies. IEEE Transactions on Multimedia, 20(10), 2801-2813.
    DOI Scopus166 WoS46
    2018 Hu, L., Zhu, Q., Wu, Q., Li, D., An, Z., & Xu, B. (2018). Natural Biomass-Derived Hierarchical Porous Carbon Synthesized by an <i>in Situ</i> Hard Template Coupled with NaOH Activation for Ultrahigh Rate Supercapacitors. ACS SUSTAINABLE CHEMISTRY & ENGINEERING, 6(11), 13949-13959.
    DOI WoS115
    2018 Sun, P., Wu, Q., Sun, X., Miao, H., Deng, W., Zhang, W., . . . Huang, W. (2018). J-Aggregate squaraine nanoparticles with bright NIR-II fluorescence for imaging guided photothermal therapy. CHEMICAL COMMUNICATIONS, 54(95), 13395-13398.
    DOI WoS123
    2018 Zhang, K. Y., Zhang, T., Wei, H., Wu, Q., Liu, S., Zhao, Q., & Huang, W. (2018). Phosphorescent iridium(III) complexes capable of imaging and distinguishing between exogenous and endogenous analytes in living cells. CHEMICAL SCIENCE, 9(36), 7236-7240.
    DOI WoS41
    2018 Wu, Q., Ma, H., Ling, K., Gan, N., Cheng, Z., Gu, L., . . . Huang, W. (2018). Reversible Ultralong Organic Phosphorescence for Visual and Selective Chloroform Detection. ACS APPLIED MATERIALS & INTERFACES, 10(39), 33730-33736.
    DOI WoS68
    2018 Lu, X., Yuan, P., Zhang, W., Wu, Q., Wang, X., Zhao, M., . . . Fan, Q. (2018). A highly water-soluble triblock conjugated polymer for <i>in vivo</i> NIR-II imaging and photothermal therapy of cancer. POLYMER CHEMISTRY, 9(22), 3118-3126.
    DOI WoS59
    2018 Cai, S., Shi, H., Zhang, Z., Wang, X., Ma, H., Gan, N., . . . Huang, W. (2018). Hydrogen-Bonded Organic Aromatic Frameworks for Ultralong Phosphorescence by Intralayer π-π Interactions. ANGEWANDTE CHEMIE-INTERNATIONAL EDITION, 57(15), 4005-4009.
    DOI WoS187
    2018 Li, S., Cheng, L., Wu, Q., Zhang, Q., Yang, J., & Liu, J. (2018). Mechanism of Aerobic Alcohol Oxidation Mediated by Water-Soluble Cu<SUP>II</SUP>-TEMPO Catalyst in Water: A Density Functional Theory Study. CHEMISTRYSELECT, 3(4), 1268-1274.
    DOI WoS2
    2018 Sun, C., Ran, X., Wang, X., Cheng, Z., Wu, Q., Cai, S., . . . Huang, W. (2018). Twisted Molecular Structure on Tuning Ultralong Organic Phosphorescence. JOURNAL OF PHYSICAL CHEMISTRY LETTERS, 9(2), 335-339.
    DOI WoS67
    2018 Cui, S., Wang, X., Zhang, X., Xia, W., Tang, X., Lin, B., . . . Shen, X. (2018). Preparation of magnetic MnFe<sub>2</sub>O<sub>4</sub>-Cellulose aerogel composite and its kinetics and thermodynamics of Cu(II) adsorption. CELLULOSE, 25(1), 735-751.
    DOI WoS49
    2018 Gu, L., Shi, H., Miao, C., Wu, Q., Cheng, Z., Cai, S., . . . Huang, W. (2018). Prolonging the lifetime of ultralong organic phosphorescence through dihydrogen bonding. JOURNAL OF MATERIALS CHEMISTRY C, 6(2), 226-233.
    DOI WoS85
    2018 Wu, Q., Li, Y., Wang, C., Zhang, J., Huang, M., Kim, J. K., & Wu, Y. (2018). 1,4-Refunctionalization of β-diketones to γ-keto nitriles <i>via</i> C-C single bond cleavage. ORGANIC CHEMISTRY FRONTIERS, 5(16), 2496-2500.
    DOI WoS15
    2018 Bian, L., Shi, H., Wang, X., Ling, K., Ma, H., Li, M., . . . Huang, W. (2018). Simultaneously Enhancing Efficiency and Lifetime of Ultralong Organic Phosphorescence Materials by Molecular Self-Assembly. JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 140(34), 10734-10739.
    DOI WoS359
    2018 Chen, H., Xu, J., Xiao, G., Wu, Q., & Zhang, S. (2018). Fast auto-clean CNN model for online prediction of food materials. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 117, 218-227.
    DOI WoS19
    2018 Deng, W., Wu, Q., Sun, P., Yuan, P., Lu, X., Fan, Q., & Huang, W. (2018). Zwitterionic diketopyrrolopyrrole for fluorescence/photoacoustic imaging guided photodynamic/photothermal therapy. POLYMER CHEMISTRY, 9(20), 2805-2812.
    DOI WoS29
    2018 Cai, S., Shi, H., Tian, D., Ma, H., Cheng, Z., Wu, Q., . . . Huang, W. (2018). Enhancing Ultralong Organic Phosphorescence by Effective π-Type Halogen Bonding. ADVANCED FUNCTIONAL MATERIALS, 28(9), 7 pages.
    DOI WoS243
    2018 Cheng, Z., Shi, H., Ma, H., Bian, L., Wu, Q., Gu, L., . . . Huang, W. (2018). Ultralong Phosphorescence from Organic Ionic Crystals under Ambient Conditions. ANGEWANDTE CHEMIE-INTERNATIONAL EDITION, 57(3), 678-682.
    DOI WoS160
    2017 Hu, L., Ma, L., Zhu, Q., Yu, L., Wu, Q., Hu, C., . . . Xu, B. (2017). Organic salt-derived nitrogen-rich, hierarchical porous carbon for ultrafast supercapacitors. NEW JOURNAL OF CHEMISTRY, 41(22), 13611-13618.
    DOI WoS11
    2017 Li, S., Cheng, L., Wu, Q., Zhang, Q., Yang, J., & Liu, J. (2017). Mechanistic Insight into the 2° Alcohol Oxidation Mediated by an Efficient Cu<SUP>I</SUP>/L-Proline-TEMPO Catalyst-A Density Functional Theory Study. CATALYSTS, 7(9), 15 pages.
    DOI WoS3
    2017 Teney, D., Wu, Q., & Van Den Hengel, A. (2017). Visual Question Answering: a tutorial. IEEE Signal Processing Magazine, 34(6), 63-75.
    DOI Scopus32 WoS17
    2017 Wang, P., Wu, Q., Shen, C., Dick, A., & Van Den Hengel, A. (2017). FVQA: fact-based Visual Question Answering. IEEE Transactions on Pattern Analysis and Machine Intelligence, 40(10), 2413-2427.
    DOI Scopus329 WoS173 Europe PMC5
    2017 Wu, Q., Teney, D., Wang, P., Shen, C., Dick, A., & van den Hengel, A. (2017). Visual question answering: a survey of methods and datasets. Computer Vision and Image Understanding, 163, 21-40.
    DOI Scopus269 WoS156
    2016 Shen, L., Min, Y. -T., Bai, X., Wang, J., Wu, Q., Yang, J., . . . Li, Q. -Y. (2016). Four Gadolinium Coordination Compounds Derived from Various Tetrazole-Containing Carboxylic Acids. ZEITSCHRIFT FUR ANORGANISCHE UND ALLGEMEINE CHEMIE, 642(19), 1112-1119.
    DOI WoS3
    2016 Wu, J., Bai, Y., Lu, Y. M., Wang, J., Wu, Q., Yang, G. W., & Li, Q. Y. (2016). Substituted group-directed magnesium(II) coordination compounds based on the derivatives of tetrazole-2-acetic acid. JOURNAL OF THE IRANIAN CHEMICAL SOCIETY, 13(12), 2155-2162.
    DOI WoS3
    2016 Shen, L., Bai, Y., Min, Y. -T., Jia, T. -T., Wu, Q., Wang, J., . . . Yang, G. -W. (2016). Coordination Architectures of energetic Cd (II) coordination polymers constructed by the bifunctional substituted-tetrazole-carboxylate ligands. JOURNAL OF SOLID STATE CHEMISTRY, 244, 129-139.
    DOI WoS13
    2016 Zhang, J., Tang, Z., Giddings, R., Wu, Q., Wang, W., Cao, B., . . . Tang, J. M. (2016). Stage-Dependent DSP Operation Range Clipping-Induced Bit Resolution Reductions of Full Parallel 64-Point FFTs Incorporated in FPGA-Based Optical OFDM Receivers. JOURNAL OF LIGHTWAVE TECHNOLOGY, 34(16), 3752-3760.
    DOI WoS12
    2016 Miao, L. -L., Guo, M. -Y., Wu, J., Lu, Y. -M., Wu, Q., Bai, Y., . . . Yang, G. -W. (2016). Counter anion and pH directed assembly of europium(III) compounds based on tetrazole containing carboxylic acids. INORGANICA CHIMICA ACTA, 450, 176-181.
    DOI WoS12
    2016 Yang, G. W., Zhang, Y. T., Wu, Q., Cao, M. J., Wu, J., Yue, Q. Y., & Li, Q. Y. (2016). Nitrogen-rich 5-(4-pyridyl)tetrazole-2-acetic acid and its alkaline earth metal coordination polymers for potential energetic materials. INORGANICA CHIMICA ACTA, 450, 364-371.
    DOI WoS16
    2016 Wang, C., Li, Y., Gong, M., Wu, Q., Zhang, J., Kim, J. K., . . . Wu, Y. (2016). Method for Direct Synthesis of α-Cyanomethyl-β-dicarbonyl Compounds with Acetonitrile and 1,3-Dicarbonyls. ORGANIC LETTERS, 18(17), 4151-4153.
    DOI WoS40
    2016 Du, J., Wang, M., Chen, N., Xie, S., Yu, H., & Wu, Q. (2016). Instability Origin and Improvement Scheme of Facial Alq<sub>3</sub> for Blue OLED Application. CHEMICAL RESEARCH IN CHINESE UNIVERSITIES, 32(3), 423-427.
    DOI WoS2
    2016 Tang, X. -L., Lin, B. -L., Cui, S., Zhang, X., Zhong, Y., Wu, Q., . . . Wang, T. -W. (2016). Paclitaxel modified Fe<sub>3</sub>O<sub>4</sub> loaded albumin nanoparticles as drug delivery vehicles by self-assembly. RSC ADVANCES, 6(49), 43284-43292.
    DOI WoS11
    2016 Wang, J., Zhang, F. F., Wei, B., Wu, Q., Cao, M. J., Bai, Y., & Yang, G. W. (2016). Counterion-Directed Assembly of Praseodymium(III) Compounds based on the Flexible Ligand 5-Aminotetrazole-1-propionic Acid (Hatzp). ZEITSCHRIFT FUR ANORGANISCHE UND ALLGEMEINE CHEMIE, 642(2), 169-173.
    DOI WoS6
    2016 Shen, L., Cao, M. J., Zhang, F. F., Wu, Q., Zhao, L. Y., Lu, Y. M., . . . Zou, J. H. (2016). Three new manganese(II) coordination complexes based on tetrazole carboxylate ligands. TRANSITION METAL CHEMISTRY, 41(2), 125-131.
    DOI WoS26
    2016 Sun, Y., Wang, X., Du, J., Chen, N., Yu, H., Wu, Q., & Meng, X. (2016). Amorphous Structure and Bonding Chemistry of Aluminium Antimonide(AlSb) Alloy for Phase-change Memory Device. CHEMICAL RESEARCH IN CHINESE UNIVERSITIES, 32(1), 76-81.
    DOI WoS4
    2015 Wu, Q., Cao, M. J., Wei, B., Bai, Y., Tian, H., Wang, J., . . . Yang, G. W. (2015). pH dependent synthesis of structurally diverse praseodymium(III) coordination polymers based on isomeric ligands. INORGANIC CHEMISTRY COMMUNICATIONS, 62, 111-114.
    DOI WoS26
    2015 Yang, G. W., Zhang, F. F., Wu, Q., Cao, M. J., Bai, Y., Li, Q. Y., . . . Zou, J. H. (2015). Substituted group directed assembly of energetic lead(II) compounds based on structure-relevant ligands. RSC ADVANCES, 5(103), 84439-84445.
    DOI WoS31
    2015 Nie, Y., Speakman, J. R., Wu, Q., Zhang, C., Hu, Y., Xia, M., . . . Wei, F. (2015). Exceptionally low daily energy expenditure in the bamboo-eating giant panda. SCIENCE, 349(6244), 171-174.
    DOI WoS113
    2015 Hall, P., Cai, H., Wu, Q., & Corradi, T. (2015). Cross-depiction problem: recognition and synthesis of photographs and artwork. Computational Visual Media, 1(2), 91-103.
    DOI Scopus34
    2014 Wu, Q., & Xiao, H. (2014). Dynamic CGE Model and Simulation Analysis on the Impact of Citizenization of Rural Migrant Workers on the Labor and Capital Markets in China. DISCRETE DYNAMICS IN NATURE AND SOCIETY, 2014, 8 pages.
    DOI WoS3
    2011 Fu, Z., Wu, Q., Gong, W., Shi, L., Li, W., & Dai, Z. (2011). Photoluminescence properties and analysis of spectral structure of R<sub>2</sub>(MoO<sub>4</sub>)<sub>3</sub>: Eu<SUP>3+</SUP> (R = La, Gd) phosphors. JOURNAL OF THE OPTICAL SOCIETY OF AMERICA B-OPTICAL PHYSICS, 28(4), 709-713.
    DOI WoS8
    2011 Fu, Z., Gong, W., Li, H., Wu, Q., Li, W., Yang, H. K., & Jeong, J. H. (2011). Synthesis and spectral properties of nanocrystalline Eu<SUP>3+</SUP>-doped pyrochlore oxide M<sub>2</sub>Sn<sub>2</sub>O<sub>7</sub> (M = Gd and Y). CURRENT APPLIED PHYSICS, 11(3), 933-938.
    DOI WoS13
    2011 Wu, Q., Li, H., Xia, W., Fu, X., Fu, Z., Zhou, S., . . . Jeong, J. H. (2011). Investigation of the Structure and Photoluminescence Properties of Ln<SUP>3+</SUP>(Eu<SUP>3+</SUP>, Dy<SUP>3+</SUP>, Sm<SUP>3+</SUP>) Ion-Doped NaY(MoO<sub>4</sub>)<sub>2</sub>. JOURNAL OF THE ELECTROCHEMICAL SOCIETY, 158(12), J387-J393.
    DOI WoS15
    2006 Zhang, F., Wu, Q., Chen, Z. -C., Li, X., Jiang, X. -M., & Lin, X. -F. (2006). Bioactive galactose-branched polyelectrolyte multilayers and microcapsules: Self-assembly, characterization, and biospecific lectin adsorption. LANGMUIR, 22(20), 8458-8464.
    DOI WoS33
    2006 Bi, J., Wu, Q., & Li, Z. (2006). On estimating clock skew for one-way measurements. COMPUTER COMMUNICATIONS, 29(8), 1213-1225.
    DOI WoS9
    - Zhuang, B., Wu, Q., Shen, C., Reid, I., & Hengel, A. V. D. (n.d.). Care about you: towards large-scale human-centric visual relationship
    detection.
  • Books

    Year Citation
    2022 Wu, Q., Wang, P., Wang, X., He, X., & Zhu, W. (2022). Visual Question Answering. Springer Nature Singapore.
    DOI
    2020 Garg, S., Sünderhauf, N., Dayoub, F., Morrison, D., Cosgun, A., Carneiro, G., . . . Milford, M. (2020). Semantics for Robotic Mapping, Perception and Interaction: A Survey (Vol. 8). United States: Now Publishers.
    DOI
  • Book Chapters

    Year Citation
    2025 Zhou, G., Hong, Y., Wang, Z., Wang, X. E., & Wu, Q. (2025). NavGPT-2: Unleashing Navigational Reasoning Capability for Large Vision-Language Models. In A. Leonardis, E. Ricci, S. Roth, O. Russakovsky, T. Sattler, & G. Varol (Eds.), Lecture Notes in Computer Science (Vol. 15065 LNCS, pp. 260-278). SPRINGER INTERNATIONAL PUBLISHING AG.
    DOI
    2025 Chen, Q., Xie, Y., Wu, B., Chen, X., Ang, J., To, M. S., . . . Wu, Q. (2025). Act Like a Radiologist: Radiology Report Generation Across Anatomical Regions. In Lecture Notes in Computer Science (Vol. 15477 LNCS, pp. 36-52). Springer Nature Singapore.
    DOI
    2024 Huang, Z., Ali, Z., Nguyen, G. D., Karakus, M., Bui, H., & Tran, T. (2024). Effects of Printing Patterns on Tensile Strength of 3D-Printed Cement Mortar. In Lecture Notes in Civil Engineering (pp. 344-351). Springer Nature Singapore.
    DOI
    2023 Agutter, K. (2023). STAYING OR DEPARTING: DISPLACED YOUTH IN AUSTRALIA. In When Migrants Fail to Stay: New Histories on Departures and Migration (pp. 187-199). Bloomsbury Publishing Plc.
    DOI
    2023 Neshat, M., Sergiienko, N. Y., da Silva, L. S. P., Amini, E., Nasiri, M., & Mirjalili, S. (2023). Adaptive bi-level whale optimization algorithm for maximizing the power output of hybrid wave-wind energy site. In M. Seyedali (Ed.), Handbook of Whale Optimization Algorithm: Variants, Hybrids, Improvements, and Applications (pp. 291-308). Elsevier.
    DOI
    2022 Wu, Q., Wang, P., Wang, X., He, X., & Zhu, W. (2022). Video Representation Learning. In Advances in Computer Vision and Pattern Recognition (pp. 111-117). Springer Nature Singapore.
    DOI
    2022 Wu, Q., Wang, P., Wang, X., He, X., & Zhu, W. (2022). Vision-and-Language Pretraining for VQA. In Advances in Computer Vision and Pattern Recognition (pp. 91-107). Springer Nature Singapore.
    DOI
    2022 Wu, Q., Wang, P., Wang, X., He, X., & Zhu, W. (2022). Text-Based VQA. In Advances in Computer Vision and Pattern Recognition (pp. 177-187). Springer Nature Singapore.
    DOI Scopus1
    2022 Wu, Q., Wang, P., Wang, X., He, X., & Zhu, W. (2022). Deep Learning Basics. In Advances in Computer Vision and Pattern Recognition (pp. 15-26). Springer Nature Singapore.
    DOI Scopus1
    2022 Wu, Q., Wang, P., Wang, X., He, X., & Zhu, W. (2022). Summary and Outlook. In Advances in Computer Vision and Pattern Recognition (pp. 233-236). Springer Nature Singapore.
    DOI
    2022 Wu, Q., Wang, P., Wang, X., He, X., & Zhu, W. (2022). Knowledge-Based VQA. In Advances in Computer Vision and Pattern Recognition (pp. 73-90). Springer Nature Singapore.
    DOI Scopus2
    2022 Wu, Q., Wang, P., Wang, X., He, X., & Zhu, W. (2022). Medical VQA. In Advances in Computer Vision and Pattern Recognition (pp. 165-176). Springer Nature Singapore.
    DOI Scopus6
    2022 Wu, Q., Wang, P., Wang, X., He, X., & Zhu, W. (2022). Question Answering (QA) Basics. In Advances in Computer Vision and Pattern Recognition (pp. 27-31). Springer Nature Singapore.
    DOI
    2022 Wu, Q., Wang, P., Wang, X., He, X., & Zhu, W. (2022). Visual Dialogue. In Advances in Computer Vision and Pattern Recognition (pp. 199-218). Springer Nature Singapore.
    DOI
    2022 Wu, Q., Wang, P., Wang, X., He, X., & Zhu, W. (2022). Referring Expression Comprehension. In Advances in Computer Vision and Pattern Recognition (pp. 219-230). Springer Nature Singapore.
    DOI
    2022 Wu, Q., Wang, P., Wang, X., He, X., & Zhu, W. (2022). Classical Visual Question Answering. In Advances in Computer Vision and Pattern Recognition (pp. 35-72). Springer Nature Singapore.
    DOI
    2022 Wu, Q., Wang, P., Wang, X., He, X., & Zhu, W. (2022). Advanced Models for Video Question Answering. In Advances in Computer Vision and Pattern Recognition (pp. 135-143). Springer Nature Singapore.
    DOI
    2022 Wu, Q., Wang, P., Wang, X., He, X., & Zhu, W. (2022). Video Question Answering. In Advances in Computer Vision and Pattern Recognition (pp. 119-133). Springer Nature Singapore.
    DOI Scopus1
    2022 Wu, Q., Wang, P., Wang, X., He, X., & Zhu, W. (2022). Visual Question Generation. In Advances in Computer Vision and Pattern Recognition (pp. 189-197). Springer Nature Singapore.
    DOI
    2022 Wu, Q., Wang, P., Wang, X., He, X., & Zhu, W. (2022). Embodied VQA. In Advances in Computer Vision and Pattern Recognition (pp. 147-164). Springer Nature Singapore.
    DOI Scopus1
    2015 Brown-Grant, R. (2015). Introduction. In R. BrownGrant, A. D. Hedeman, & B. Ribemont (Eds.), Advances in Computer Vision and Pattern Recognition (pp. 1-13). ROUTLEDGE.
    DOI WoS1
  • Conference Papers

    Year Citation
    2025 Qiao, Y., Liu, Q., Liu, J., Liu, J., & Wu, Q. (2025). LLM as Copilot for Coarse-Grained Vision-and-Language Navigation. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) Vol. 15063 LNCS (pp. 459-476). Milan, Italy: Springer Science and Business Media Deutschland GmbH.
    DOI
    2024 Wu, Y., Xie, Y., Luo, X., Wu, Q., & Cai, J. (2024). Dataset, Challenge, and Evaluation for Tumor Segmentation Variability. In MM 2024 - Proceedings of the 32nd ACM International Conference on Multimedia (pp. 11302-11303). ACM.
    DOI
    2024 Qu, X., Yu, J., Gai, K., Zhuang, J., Tang, Y., Xiong, G., . . . Wu, Q. (2024). Visual-Semantic Decomposition and Partial Alignment for Document-based Zero-Shot Learning. In MM 2024 - Proceedings of the 32nd ACM International Conference on Multimedia (pp. 4581-4590). ACM.
    DOI
    2024 Hong, H., Wang, S., Huang, Z., Wu, Q., & Liu, J. (2024). Navigating Beyond Instructions: Vision-and-Language Navigation in Obstructed Environments. In MM 2024 - Proceedings of the 32nd ACM International Conference on Multimedia (pp. 7639-7648). ACM.
    DOI
    2024 Li, Y., Yu, J., Gai, K., Liu, B., Xiong, G., & Wu, Q. (2024). T2VIndexer: A Generative Video Indexer for Efficient Text-Video Retrieval. In Proceedings of the 32nd ACM International Conference on Multimedia (pp. 3955-3963). ACM.
    DOI
    2024 Wang, X., Zhuang, B., & Wu, Q. (2024). ModaVerse: Efficiently Transforming Modalities with LLMs. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 26596-26606). IEEE.
    DOI Scopus1
    2024 Phan, V. M. H., Xie, Y., Qi, Y., Liu, L., Liu, L., Zhang, B., . . . Verjans, J. W. (2024). Decomposing Disease Descriptions for Enhanced Pathology Detection: A Multi-Aspect Vision-Language Pre-Training Framework.. In CVPR (pp. 11492-11501). Seattle, WA, USA: IEEE.
    2024 Lu, Z., Xie, Y., Zeng, Q., Lu, M., Wu, Q., & Xia, Y. (2024). Spot the Difference: Difference Visual Question Answering with Residual Alignment. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) Vol. 15005 LNCS (pp. 649-658). Marrakesh: Springer Science and Business Media Deutschland GmbH.
    DOI
    2024 Ye, Y., Xie, Y., Zhang, J., Chen, Z., Wu, Q., & Xia, Y. (2024). Continual Self-Supervised Learning: Towards Universal Multi-Modal Medical Data Representation Learning. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 11114-11124). Online: IEEE Computer Society.
    DOI
    2024 Hong, H., Wang, S., Huang, Z., Wu, Q., & Liu, J. (2024). Why Only Text: Empowering Vision-and-Language Navigation with Multi-modal Prompts. In IJCAI International Joint Conference on Artificial Intelligence (pp. 839-847). Jeju: International Joint Conferences on Artificial Intelligence.
    2024 Xie, Y., Chen, Q., Wang, S., To, M. S., Lee, I., Khoo, E. W., . . . Wu, Q. (2024). PairAug: What Can Augmented Image-Text Pairs Do for Radiology?. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 11652-11661). Seattle, Washington, USA: IEEE.
    DOI Scopus1
    2024 Zhou, G., Hong, Y., & Wu, Q. (2024). NavGPT: Explicit Reasoning in Vision-and-Language Navigation with Large Language Models. In Proceedings of the AAAI Conference on Artificial Intelligence Vol. 38 (pp. 7641-7649). Online: Association for the Advancement of Artificial Intelligence (AAAI).
    DOI Scopus12
    2024 Mohammadi, B., Hong, Y., Qi, Y., Wu, Q., Pan, S., & Shi, J. Q. (2024). Augmented Commonsense Knowledge for Remote Object Grounding. In Proceedings of the AAAI Conference on Artificial Intelligence Vol. 38 (pp. 4269-4277). Online: Association for the Advancement of Artificial Intelligence (AAAI).
    DOI Scopus5
    2024 Chen, Q., Pitawela, D., Zhao, C., Zhou, G., Chen, H. T., & Wu, Q. (2024). WebVLN: Vision-and-Language Navigation on Websites. In Proceedings of the AAAI Conference on Artificial Intelligence Vol. 38 (pp. 1165-1173). Online: Association for the Advancement of Artificial Intelligence (AAAI).
    DOI Scopus2
    2024 Tang, Y., Yu, J., Gai, K., Zhuang, J., Xiong, G., Hu, Y., & Wu, Q. (2024). Context-I2W: Mapping Images to Context-Dependent Words for Accurate Zero-Shot Composed Image Retrieval. In Proceedings of the AAAI Conference on Artificial Intelligence Vol. 38 (pp. 5180-5188). Online: Association for the Advancement of Artificial Intelligence (AAAI).
    DOI Scopus8
    2024 Wang, Z., Li, J., Hong, Y., Wang, Y., Wu, Q., Bansal, M., . . . Qiao, Y. (2024). Scaling Data Generation in Vision-and-Language Navigation. In Proceedings of the IEEE International Conference on Computer Vision (pp. 11975-11986). Paris, France: IEEE.
    DOI Scopus11
    2024 Wang, X., Wu, Q., & Zhuang, B. (2024). ModaVerse: Efficiently Transforming Modalities with LLMs. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 26606-26616). Online: IEEE.
    2024 Wu, B., Xie, Y., Zhang, Z., Ge, J., Yaxley, K., Bahadir, S., . . . To, M. S. (2024). BHSD: A 3D Multi-class Brain Hemorrhage Segmentation Dataset. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) Vol. 14348 LNCS (pp. 147-156). Online: Springer Nature Switzerland.
    DOI Scopus1
    2024 Yu, Z., Qiao, Y., Xie, Y., & Wu, Q. (2024). Multi-modal Adapter for Medical Vision-and-Language Learning. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) Vol. 14348 LNCS (pp. 393-402). Online: Springer Nature Switzerland.
    DOI
    2024 Deng, C., Chen, D., & Wu, Q. (2024). Identity-Consistent Aggregation for Video Object Detection. In Proceedings of the IEEE International Conference on Computer Vision (pp. 13388-13398). Online: IEEE.
    DOI
    2024 Qiao, Y., Yu, Z., & Wu, Q. (2024). VLN-PETL: Parameter-Efficient Transfer Learning for Vision-and-Language Navigation. In Proceedings of the IEEE International Conference on Computer Vision (pp. 15397-15406). Online: IEEE.
    DOI Scopus7
    2024 Liu, S., Zhang, H., Qi, Y., Wang, P., Zhang, Y., & Wu, Q. (2024). AerialVLN: Vision-and-Language Navigation for UAVs. In Proceedings of the IEEE International Conference on Computer Vision (pp. 15338-15348). Online: IEEE.
    DOI Scopus10
    2024 Tian, X., Yang, Y. L., & Wu, Q. (2024). ShapeScaffolder: Structure-Aware 3D Shape Generation from Text. In Proceedings of the IEEE International Conference on Computer Vision (pp. 2715-2724). Paris, France: IEEE.
    DOI Scopus2
    2023 Yu, Z., Xie, Y., Xia, Y., & Wu, Q. (2023). PLMVQA: Applying Pseudo Labels for Medical Visual Question Answering with Limited Data. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) Vol. 14394 LNCS (pp. 357-367). Online: Springer Nature Switzerland.
    DOI
    2023 Qiao, Y., Qi, Y., Yu, Z., Liu, J., & Wu, Q. (2023). March in Chat: Interactive Prompting for Remote Embodied Referring Expression. In Proceedings of the IEEE International Conference on Computer Vision (pp. 15712-15721). Paris, France: IEEE.
    DOI Scopus10
    2023 Deng, C., Chen, Q., Qin, P., Chen, D., & Wu, Q. (2023). Prompt Switch: Efficient CLIP Adaptation for Text-Video Retrieval. In Proceedings of the IEEE International Conference on Computer Vision (pp. 15602-15612). Online: IEEE.
    DOI Scopus11
    2023 Suo, W., Sun, M., Liu, W., Gao, Y., Wang, P., Zhang, Y., & Wu, Q. (2023). S<SUP>3</SUP>C: Semi-Supervised VQA Natural Language Explanation via Self-Critical Learning. In 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR (pp. 2646-2656). Online: IEEE COMPUTER SOC.
    DOI
    2023 Wen, Z., Wang, Y., Tan, M., Wu, Q., & Wu, Q. (2023). Digging out Discrimination Information from Generated Samples for Robust Visual Question Answering. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (pp. 6910-6928). Dubrovnik, Croatia and Online: Association for Computational Linguistics.
    DOI Scopus8
    2023 Wu, Q., Chao, W., Zhou, X., & Luo, Z. (2023). TP-Detector: Detecting Turning Points in the Engineering Process of Large-scale Projects. In EMNLP 2023 - 2023 Conference on Empirical Methods in Natural Language Processing, Proceedings of the System Demonstrations (pp. 177-185). Singapore: Association for Computational Linguistics (ACL).
    2023 Rodriguez-Opazo, C., Marrese-Taylor, E., Fernando, B., Takamura, H., & Wu, Q. (2023). Memory-efficient Temporal Moment Localization in Long Videos. In A. Vlachos, & I. Augenstein (Eds.), 17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023 (pp. 1909-1924). CROATIA, Dubrovnik: ASSOC COMPUTATIONAL LINGUISTICS-ACL.
    2023 Gao, J., Blair, A., Wu, Q., & Pagnucco, M. (2023). LoRA: A Logical Reasoning Augmented Dataset for Visual Question Answering. In Advances in Neural Information Processing Systems Vol. 36 (pp. 13 pages). Online: Neural information processing systems foundation.
    Scopus1
    2023 Rodriguez-Opazo, C., Marrese-Taylor, E., Fernando, B., Takamura, H., & Wu, Q. (2023). Memory-efficient Temporal Moment Localization in Long Videos. In EACL 2023 - 17th Conference of the European Chapter of the Association for Computational Linguistics, Proceedings of the Conference (pp. 1901-1916). Online: Association for Computational Linguistics (ACL).
    Scopus2
    2023 Chen, Q., Deng, C., & Wu, Q. (2023). Learning Distinct and Representative Modes for Image Captioning. In Advances in Neural Information Processing Systems Vol. 35 (pp. 14 pages). USA: Neural information processing systems foundation.
    Scopus13
    2023 Huang, Y., Leung, C. H., Ma, S., Yuan, Z., Wu, Q., Wang, S., . . . Huang, Z. (2023). Towards Balanced Representation Learning for Credit Policy Evaluation. In Proceedings of the International Conference on Artificial Intelligence and Statistics Vol. 206 (pp. 3677-3692). Valencia, Spain (virtual event).
    2023 Zhao, C., Qi, Y., & Wu, Q. (2023). Mind the Gap: Improving Success Rate of Vision-and-Language Navigation by Revisiting Oracle Success Routes. In Proceedings of the 31st ACM International Conference on Multimedia (pp. 4349-4358). Ottawa ON Canada: ACM.
    DOI Scopus6
    2023 Cong, G., Li, L., Qi, Y., Zha, Z. J., Wu, Q., Wang, W., . . . Huang, Q. (2023). Learning to Dub Movies via Hierarchical Prosody Models. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition Vol. 2023-June (pp. 14687-14697). Online: IEEE.
    DOI Scopus10
    2023 Guan, Q., Xie, Y., Yang, B., Zhang, J., Liao, Z., Wu, Q., & Xia, Y. (2023). Unpaired Cross-Modal Interaction Learning for COVID-19 Segmentation on Limited CT Images. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) Vol. 14222 (pp. 603-613). Vancouver, BC, Canada: Springer Nature Switzerland.
    DOI Scopus3
    2023 Xie, Y., Gu, L., Harada, T., Zhang, J., Xia, Y., & Wu, Q. (2023). MedIM: Boost Medical Image Representation via Radiology Report-Guided Masking. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) Vol. 14220 (pp. 13-23). Vancouver, BC, Canada: Springer Nature Switzerland.
    DOI Scopus4
    2022 Tian, X., Yang, Y. L., & Wu, Q. (2022). Enhancing Person Synthesis in Complex Scenes via Intrinsic and Contextual Structure Modeling. In BMVC 2022 - 33rd British Machine Vision Conference Proceedings.
    2022 Jing, C., Jia, Y., Wu, Y., Li, C., & Wu, Q. (2022). Learning the Dynamics of Visual Relational Reasoning via Reinforced Path Routing. In Proceedings of the 36th AAAI Conference on Artificial Intelligence, AAAI 2022 Vol. 36 (pp. 1122-1130). Palo Alto, California USA: AAAI Press.
    DOI Scopus6
    2022 Kazemi Moghaddam, M., Abbasnejad, E., Wu, Q., Qinfeng Shi, J., & Van Den Hengel, A. (2022). ForeSI: Success-Aware Visual Navigation Agent. In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV 2022) (pp. 3401-3410). Online: IEEE.
    DOI Scopus8 WoS1
    2022 Qi, Y., Pan, Z., Hong, Y., Yang, M. H., Van Den Hengel, A., & Wu, Q. (2022). The Road to Know-Where: An Object-and-Room Informed Sequential BERT for Indoor Vision-Language Navigation. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV 2021) (pp. 1635-1644). online: IEEE.
    DOI Scopus51 WoS2
    2022 Suo, W., Sun, M., Niu, K., Gao, Y., Wang, P., Zhang, Y., & Wu, Q. (2022). A Simple and Robust Correlation Filtering Method for Text-Based Person Search. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) Vol. 13695 LNCS (pp. 726-742). Online: Springer Nature Switzerland.
    DOI Scopus40 WoS1
    2022 Gu, J., Stefani, E., Wu, Q., Thomason, J., & Wang, X. E. (2022). Vision-and-Language Navigation: A Survey of Tasks, Methods, and Future Directions. In PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS) Vol. 1 (pp. 7606-7623). Online: ASSOC COMPUTATIONAL LINGUISTICS-ACL.
    Scopus46 WoS11
    2022 Zhu, W., Qi, Y., Narayana, P., Sone, K., Basu, S., Wang, E. X., . . . Wang, W. Y. (2022). Diagnosing Vision-and-Language Navigation: What Really Matters. In NAACL 2022 - 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference (pp. 5981-5993). Online: ssociation for Computational Linguistics (ACL).
    Scopus17 WoS1
    2022 Chen, Q., Tan, M., Qi, Y., Zhou, J., Li, Y., & Wu, Q. (2022). V2C: Visual Voice Cloning. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR, 2022) Vol. 2022-June (pp. 21210-21219). Online: IEEE.
    DOI Scopus16 WoS2
    2022 Jing, C., Jia, Y., Wu, Y., Liu, X., & Wu, Q. (2022). Maintaining Reasoning Consistency in Compositional Visual Question Answering. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition Vol. 2022-June (pp. 5089-5098). Online: IEEE.
    DOI Scopus20 WoS1
    2022 Hong, Y., Wang, Z., Wu, Q., & Gould, S. (2022). Bridging the Gap Between Learning in Discrete and Continuous Environments for Vision-and-Language Navigation. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition Vol. 2022-June (pp. 15418-15428). Online: IEEE.
    DOI Scopus25 WoS4
    2022 Xie, Y., Zhang, J., Xia, Y., & Wu, Q. (2022). UniMiSS: Universal Medical Self-supervised Learning via Breaking Dimensionality Barrier. In Proceedings, Part XXI of the 17th European Conference on Computer Vision (ECCV 2022), as published in Lecture Notes in Computer Science Vol. 13681 LNCS (pp. 558-575). Online: Springer.
    DOI Scopus34 WoS1
    2022 Ding, Y., Yu, J., Liu, B., Hu, Y., Cui, M., & Wu, Q. (2022). MuKEA: Multimodal Knowledge Extraction and Accumulation for Knowledge-based Visual Question Answering. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition Vol. 2022-June (pp. 5079-5088). Online: IEEE.
    DOI Scopus77
    2022 Qiao, Y., Qi, Y., Hong, Y., Yu, Z., Wang, P., & Wu, Q. (2022). HOP: History-and-Order Aware Pretraining for Vision-and-Language Navigation. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition Vol. 2022-June (pp. 15397-15406). New Orleans, LA, USA: IEEE.
    DOI Scopus44
    2022 Chen, C., Hu, Z., Jin, S., Xiao, L., Hu, M., Wu, Q., . . . Zou, M. (2022). Classification of COVID-19 in CT Scans Using Image Smoothing and Improved Deep Residual Network. In Artificial Intelligence First CAAI International Conference, CICAI 2021, Hangzhou, China, June 5–6, 2021, Proceedings, Part I Vol. 13069 LNAI (pp. 89-100). Switzerland: Springer.
    DOI
    2022 He, L., Cai, Q., Liang, X., Xin, J., Jiang, J., Shi, D., . . . Li, J. (2022). The role of macrophage ETS Proto-oncogene 2 in acute-onchronic liver failure. In JOURNAL OF HEPATOLOGY Vol. 77 (pp. S402-S403). ENGLAND, London: ELSEVIER.
    2022 Li, P., Liang, X., Luo, J., Li, J., Xin, J., Jiang, J., . . . Li, J. (2022). Survival benefit-based priority of liver transplantation in HBV-related acute-on-chronic liver failure. In JOURNAL OF HEPATOLOGY Vol. 77 (pp. S803). ELSEVIER.
    2022 Cao, Y., Wu, Q., Zhang, B., Liu, Z., & Li, J. (2022). FSE-MV: Compressed Domain Video Information Assisted Hybrid Real-Time Vehicle Speed Estimation. In C. T. Calafate, X. Chen, & Y. Wu (Eds.), MOBILE NETWORKS AND MANAGEMENT, MONAMI 2021 Vol. 418 (pp. 100-114). ELECTR NETWORK: SPRINGER INTERNATIONAL PUBLISHING AG.
    DOI
    2021 Yao, Y., Chen, T., Xie, G. S., Zhang, C., Shen, F., Wu, Q., . . . Zhang, J. (2021). Non-Salient Region Object Mining for Weakly Supervised Semantic Segmentation. In 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (pp. 2623-2632). online: IEEE.
    DOI Scopus174 WoS73
    2021 Yao, Y., Sun, Z., Zhang, C., Shen, F., Wu, Q., Zhang, J., & Tang, Z. (2021). Jo-SRC: A Contrastive Approach for Combating Noisy Labels. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 5188-5197). online: IEEE.
    DOI Scopus123 WoS31
    2021 Deng, C., Chen, S., Chen, D., He, Y., & Wu, Q. (2021). Sketch, ground, and refine: top-down dense video captioning. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2021) (pp. 234-243). online: IEEE.
    DOI Scopus51 WoS17
    2021 Hong, Y., Wu, Q., Qi, Y., Rodriguez Opazo, C., & Gould, S. (2021). VLN↻BERT: A Recurrent Vision-and-Language BERT for Navigation. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 1643-1653). online: IEEE.
    DOI Scopus166 WoS46
    2021 Xu, G., Niu, S., Tan, M., Luo, Y., Du, Q., & Wu, Q. (2021). Towards Accurate Text-based Image Captioning with Content Diversity Exploration. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 12632-12641). online: IEEE.
    DOI Scopus51 WoS25
    2021 Gao, C., Chen, J., Liu, S., Wang, L., Zhang, Q., & Wu, Q. (2021). Room-and-Object Aware Knowledge Reasoning for Remote Embodied Referring Expression. In 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (pp. 3063-3072). online: IEEE COMPUTER SOC.
    DOI Scopus63 WoS20
    2021 Wu, Q., Wu, C. J., Zhu, Y., & Joo, J. (2021). Communicative Learning with Natural Gestures for Embodied Navigation Agents with Human-in-the-Scene. In 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (pp. 4095-4102). online: IEEE.
    DOI Scopus11 WoS5
    2021 Yu, J., Chai, Y., Wang, Y., Hu, Y., & Wu, Q. (2021). CogTree: Cognition Tree Loss for Unbiased Scene Graph Generation. In IJCAI International Joint Conference on Artificial Intelligence (pp. 1274-1280). online: International Joint Conferences on Artificial Intelligence.
    DOI Scopus50
    2021 Suo, W., Sun, M., Wang, P., & Wu, Q. (2021). Proposal-free One-stage Referring Expression via Grid-Word Cross-Attention. In IJCAI International Joint Conference on Artificial Intelligence (pp. 1032-1038). online: International Joint Conferences on Artificial Intelligence.
    DOI Scopus10
    2021 Gao, C., Zhu, Q., Wang, P., & Wu, Q. (2021). Chop Chop BERT: Visual Question Answering by Chopping VisualBERT's Heads. In Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence (IJCAI-21) (pp. 664-670). United States: International Joint Conferences on Artificial Intelligence.
    DOI Scopus2
    2021 An, D., Qi, Y., Huang, Y., Wu, Q., Wang, L., & Tan, T. (2021). Neighbor-view Enhanced Model for Vision and Language Navigation. In MM 2021 - Proceedings of the 29th ACM International Conference on Multimedia (pp. 5101-5109). virtual online: ACM.
    DOI Scopus42 WoS16
    2021 Qiao, Y., Chen, Q., Deng, C., DIng, N., Qi, Y., Tan, M., . . . Wu, Q. (2021). R-GAN: Exploring Human-like Way for Reasonable Text-to-Image Synthesis via Generative Adversarial Networks. In MM 2021 - Proceedings of the 29th ACM International Conference on Multimedia (pp. 2085-2093). New York, NY, United States: Association for Computing Machinery.
    DOI Scopus15 WoS7
    2021 Wen, Z., Xu, G., Tan, M., Wu, Q., & Wu, Q. (2021). Debiased Visual Question Answering from Feature and Sample Perspectives. In M. Ranzato, A. Beygelzimer, Y. Dauphin, P. S. Liang, & J. Wortman Vaughan (Eds.), Advances in Neural Information Processing Systems 34 Vol. 5 (pp. 3784-3796). Online: Neural Information Processing Systems Foundation, Inc (NeurIPS).
    Scopus53 WoS3
    2021 He, K., Huang, Y., Wu, Q., Yang, J., An, D., Sima, S., & Wang, L. (2021). Landmark-RxR: Solving Vision-and-Language Navigation with Fine-Grained Alignment Supervision. In Advances in Neural Information Processing Systems Vol. 2 (pp. 652-663). ELECTR NETWORK: NEURAL INFORMATION PROCESSING SYSTEMS (NIPS).
    Scopus19 WoS3
    2021 Kazemi Moghaddam, M., Wu, Q., Abbasnejad, E., & Shi, J. (2021). Optimistic Agent: Accurate Graph-Based Value Estimation for More Successful Visual Navigation. In Proceedings of the IEEE Winter Conference on Applications of Computer Vision (WACV 2021) (pp. 3732-3741). online: IEEE.
    DOI Scopus14 WoS8
    2021 Zheng, Y., Wen, Z., Tan, M., Zeng, R., Chen, Q., Wang, Y., & Wu, Q. (2021). Modular graph attention network for complex visual relational reasoning. In Proceedings of the 15th Asian Conference on Computer Vision (ACCV 2020), as published in Lecture Notes in Computer Science Vol. 12627 (pp. 137-153). Cham, Switzerland: Springer.
    DOI Scopus2
    2021 Zhu, Q., Gao, C., Wang, P., & Wu, Q. (2021). Simple is not Easy: A Simple Strong Baseline for TextVQA and TextCaps. In THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE Vol. 35 (pp. 3608-3615). ELECTR NETWORK: ASSOC ADVANCEMENT ARTIFICIAL INTELLIGENCE.
    DOI Scopus40 WoS12
    2021 Wang, Z., Bao, R., Wu, Q., & Liu, S. (2021). Confidence-aware Non-repetitive Multimodal Transformers for TextCaps. In THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE Vol. 35 (pp. 2835-2843). ELECTR NETWORK: ASSOC ADVANCEMENT ARTIFICIAL INTELLIGENCE.
    DOI Scopus21 WoS5
    2021 Liu, L., He, M., Xu, G., Tan, M., & Wu, Q. (2021). How to Train Your Agent to Read and Write. In THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE Vol. 35 (pp. 13397-13405). Online: ASSOC ADVANCEMENT ARTIFICIAL INTELLIGENCE.
    DOI Scopus3 WoS1
    2021 Wu, Q., Qin, M., Song, J., & Liu, L. (2021). An improved method of low light image enhancement based on retinex. In 2021 6th International Conference on Image, Vision and Computing, ICIVC 2021 (pp. 233-241). online: IEEE.
    DOI Scopus7
    2020 Hong, Y., Rodriguez Opazo, C., Wu, Q., & Gould, S. (2020). Sub-Instruction Aware Vision-and-Language Navigation. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP) (pp. 3360-3376). virtual online: Association for Computational Linguistics.
    DOI Scopus34 WoS12
    2020 Jiang, X., Yu, J., Qin, Z., Zhuang, Y., Zhang, X., Hu, Y., & Wu, Q. (2020). DualVD: An adaptive dual encoding model for deep visual understanding in visual dialogue. In AAAI 2020 - 34th AAAI Conference on Artificial Intelligence Vol. 34 (pp. 11125-11132). online: AAAI.
    Scopus52 WoS21
    2020 Jing, C., Wu, Y., Zhang, X., Jia, Y., & Wu, Q. (2020). Overcoming language priors in VQA via decomposed linguistic representations. In Proceedings of the 34th AAAI Conference on Artificial Intelligence (AAAI-20) Vol. 34 (pp. 11181-11188). online: AAAI.
    DOI Scopus79 WoS24
    2020 Zhang, C., Yao, Y., Shu, X., Li, Z., Tang, Z., & Wu, Q. (2020). Data-driven Meta-set Based Fine-Grained Visual Recognition. In MM 2020 - Proceedings of the 28th ACM International Conference on Multimedia (pp. 2372-2381). online: ACM.
    DOI Scopus21 WoS6
    2020 Wang, P., Liu, D., Li, H., & Wu, Q. (2020). Give Me Something to Eat: Referring Expression Comprehension with Commonsense Knowledge. In MM 2020 - Proceedings of the 28th ACM International Conference on Multimedia (pp. 28-36). online: ACM.
    DOI Scopus17
    2020 Jing, C., Wu, Y., Pei, M., Hu, Y., Jia, Y., & Wu, Q. (2020). Visual-Semantic Graph Matching for Visual Grounding. In MM 2020 - Proceedings of the 28th ACM International Conference on Multimedia (pp. 4041-4050). online: ACM.
    DOI Scopus26 WoS6
    2020 Liu, F., Xu, G., Wu, Q., Du, Q., Jia, W., & Tan, M. (2020). Cascade Reasoning Network for Text-based Visual Question Answering. In MM 2020 - Proceedings of the 28th ACM International Conference on Multimedia (pp. 4060-4069). online: ACM.
    DOI Scopus50 WoS25
    2020 Hong, Y., Rodriguez-Opazo, C., Qi, Y., Wu, Q., & Gould, S. (2020). Language and visual entity relationship graph for agent navigation. In Advances in Neural Information Processing Systems Vol. 2020-December (pp. 1-12). online: NIPS.
    Scopus70
    2020 Liao, Z., Liu, L., Wu, Q., Teney, D., Shen, C., Van Den Hengel, A., & Verjans, J. (2020). Medical data inquiry using a question answering model. In Proceedings: 2020 IEEE 17th International Symposium on Biomedical Imaging (ISBI 2020) Vol. 2020-April (pp. 1490-1493). online: IEEE.
    DOI Scopus8 WoS3
    2020 Wang, H., Wu, Q., & Shen, C. (2020). Soft Expert Reward Learning for Vision-and-Language Navigation. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) Vol. 12354 LNCS (pp. 126-141). Switzerland: Springer Nature.
    DOI Scopus21
    2020 Tang, R., Ma, C., Zhang, W. E., Wu, Q., & Yang, X. (2020). Semantic Equivalent Adversarial Data Augmentation for Visual Question Answering. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) Vol. 12364 LNCS (pp. 437-453). Switzerland: Springer International Publishing.
    DOI Scopus28
    2020 Deng, C., Ding, N., Tan, M., & Wu, Q. (2020). Length-Controllable Image Captioning. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) Vol. 12358 LNCS (pp. 712-729). Switzerland: Springer International Publishing.
    DOI Scopus39
    2020 Qi, Y., Pan, Z., Zhang, S., van den Hengel, A., & Wu, Q. (2020). Object-and-Action Aware Model for Visual Language Navigation. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) Vol. 12355 LNCS (pp. 303-317). Switzerland: Springer International Publishing.
    DOI Scopus53
    2020 Jiang, X., Yu, J., Sun, Y., Qin, Z., Zhu, Z., Hu, Y., & Wu, Q. (2020). DAM: Deliberation, abandon and memory networks for generating detailed and non-repetitive responses in visual dialogue. In IJCAI International Joint Conference on Artificial Intelligence Vol. 2021-January (pp. 687-693). online: AAAI Press.
    Scopus8 WoS1
    2020 Zhu, Z., Yu, J., Wang, Y., Sun, Y., Hu, Y., & Wu, Q. (2020). Mucko: Multi-layer cross-modal knowledge reasoning for fact-based visual question answering. In IJCAI International Joint Conference on Artificial Intelligence Vol. 2021-January (pp. 1097-1103). online: AAAI Press.
    Scopus76 WoS36
    2020 Chen, Z., Wang, P., Ma, L., Wong, K. Y. K., & Wu, Q. (2020). Cops-Ref: A New Dataset and Task on Compositional Referring Expression Comprehension. In Proceedings of the 2020 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 10083-10092). online: IEEE.
    DOI Scopus50
    2020 Qi, Y., Wu, Q., Anderson, P., Wang, X., Wang, W. Y., Shen, C., & Van Den Hengel, A. (2020). Reverie: Remote embodied visual referring expression in real indoor environments. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 9979-9988). online: IEEE.
    DOI Scopus200
    2020 Liao, Z., Wu, Q., Shen, C., Van Den Hengel, A., & Verjans, J. (2020). AIML at VQA-Med 2020: Knowledge inference via a skeleton-based sentence mapping approach for medical domain visual question answering. In L. Cappellato, C. Eickhoff, N. Ferro, & A. Névéol (Eds.), Proceedings of the 11th International Conference of the CLEF Initiative (CLEF 2020), as published in CEUR Workshop Proceedings Vol. 2696 (pp. 1-14). online: CEUR-WS.
    Scopus7
    2020 Abbasnejad, M., Abbasnejad, I., Wu, Q., Shi, Q., & Van Den Hengel, A. (2020). Gold seeker: Information gain from policy distributions for goal-oriented vision-and-langauge reasoning. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 13447-13456). online: IEEE.
    DOI Scopus3
    2020 Chen, S., Jin, Q., Wang, P., & Wu, Q. (2020). Say as you wish: Fine-grained control of image caption generation with abstract scene graphs. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 9959-9968). online: IEEE.
    DOI Scopus195
    2020 Chen, S., Zhao, Y., Jin, Q., & Wu, Q. (2020). Fine-grained video-text retrieval with hierarchical graph reasoning. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 10635-10644). online: IEEE.
    DOI Scopus256
    2020 Chen, Q., Wu, Q., Tang, R., Wang, Y., Wang, S., & Tan, M. (2020). Intelligent home 3D: Automatic 3D-house design from linguistic descriptions only. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 12622-12631). online: IEEE.
    DOI Scopus32
    2019 Duan, X., Wu, Q., Gan, C., Zhang, Y., Huang, W., Van Den Hengel, A., & Zhu, W. (2019). Watch, reason and code: Learning to represent videos using program. In Proceedings of the 27th ACM International Conference on Multimedia (ACM Multimedia 2019), MM '19 (pp. 1543-1551). online: Association for Computing Machinery.
    DOI Scopus5 WoS1
    2019 Abbasnejad, E., Wu, Q., Shi, Q., & Van Den Hengel, A. (2019). What's to know? uncertainty as a guide to asking goal-oriented questions. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition Vol. 2019-June (pp. 4150-4159). online: IEEE.
    DOI Scopus17 WoS5
    2019 Zhang, J., Wu, Q., Zhang, J., Shen, C., & Lu, J. (2019). Mind your neighbours: Image annotation with metadata neighbourhood graph co-attention networks. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition Vol. 2019-June (pp. 2951-2959). online: IEEE.
    DOI Scopus20 WoS10
    2019 Wang, P., Wu, Q., Cao, J., Shen, C., Gao, L., & Hengel, A. V. D. (2019). Neighbourhood watch: Referring expression comprehension via language-guided graph attention networks. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition Vol. 2019-June (pp. 1960-1968). online: IEEE.
    DOI Scopus240 WoS99
    2018 Cao, I., Guo, Y., Wu, Q., Shen, C., Huang, J., & Tan, M. (2018). Adversarial learning with local coordinate coding. In 35th International Conference on Machine Learning, ICML 2018 Vol. 2 (pp. 1104-1117). online: PMLR.
    Scopus22 WoS1
    2018 Zhuang, Z., Tan, M., Zhuang, B., Liu, J., Guo, Y., Wu, Q., . . . Zhu, J. (2018). Discrimination-aware Channel Pruning for Deep Neural Networks. In Advances in Neural Information Processing Systems Vol. 2018-December (pp. 875-886). online: NIPS.
    Scopus410 WoS29
    2018 Zhang, J., Zhang, J., Wu, Q., Wu, Q., Xu, J., Lu, J., . . . Tang, Z. (2018). Historical image annotation by exploring the tag relevance. In Proceedings - 4th Asian Conference on Pattern Recognition, ACPR 2017 (pp. 646-651). Nanjing, PEOPLES R CHINA: IEEE.
    DOI Scopus1 WoS1
    2018 Zhuang, B., Wu, Q., Shen, C., Reid, I., & Van Den Hengel, A. (2018). HCVRD: A benchmark for large-scale human-centered visual relationship detection. In 32nd AAAI Conference on Artificial Intelligence, AAAI 2018 (pp. 7631-7638). New Orleans: Association for the Advancement of Artificial Intelligence.
    Scopus33 WoS15
    2018 Zhang, J., Wu, Q., Zhang, J., Shen, C., & Lu, J. (2018). Kill two birds with one stone: Weakly-supervised neural network for image annotation and tag refinement. In 32nd AAAI Conference on Artificial Intelligence, AAAI 2018 (pp. 7550-7557). New Orleans: ASSOC ADVANCEMENT ARTIFICIAL INTELLIGENCE.
    Scopus9 WoS5
    2018 Wu, Q., Wang, P., Shen, C., Reid, I., & Hengel, A. (2018). Are you talking to me? Reasoned visual dialog generation through adversarial learning. In Proceedings: 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2018) (pp. 6106-6115). Salt Lake City, UT: IEEE.
    DOI Scopus104 WoS54
    2018 Deng, C., Wu, Q., Wu, Q., Hu, F., Lyu, F., & Tan, M. (2018). Visual Grounding via Accumulated Attention. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 7746-7755). online: IEEE.
    DOI Scopus159 WoS79
    2018 Anderson, P., Das, A., & Wu, Q. (2018). Connecting language and vision to actions. In ACL 2018 - 56th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference Tutorial Abstracts (pp. 10-14). Melbourne: Association for Computational Linguistics.
    DOI
    2018 Huang, Y., Wu, Q., Song, C., & Wang, L. (2018). Learning Semantic Concepts and Order for Image and Sentence Matching. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 6163-6171). online: IEEE.
    DOI Scopus307 WoS192
    2018 Ma, C., Shen, C., Dick, A., Wu, Q., Wang, P., Van Den Hengel, A., & Reid, I. (2018). Visual Question Answering with memory-augmented network. In Proceedings: 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2018) (pp. 6975-6984). Salt Lake City, Utah: IEEE.
    DOI Scopus90 WoS62
    2018 Anderson, P., Wu, Q., Teney, D., Bruce, J., Johnson, M., Sünderhauf, N., . . . Hengel, A. V. D. (2018). Vision-and-language navigation: interpreting visually-grounded navigation instructions in real environments. In Proceedings: 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2018) Vol. abs/1711.07280 (pp. 3674-3683). Salt Lake City, UT: IEEE.
    DOI Scopus823 WoS1140
    2018 Zhang, J., Xie, Y., Wu, Q., & Xia, Y. (2018). Skin lesion classification in dermoscopy images using synergic deep learning. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) Vol. 11071 LNCS (pp. 12-20). Switzerland: Springer.
    DOI Scopus43 WoS24
    2018 Zhang, J., Wu, Q., Shen, C., Zhang, J., Lu, J., & van den Hengel, A. (2018). Goal-oriented visual question generation via intermediate rewards. In V. Ferrari, M. Hebert, C. Sminchisescu, & Y. Weiss (Eds.), Computer Vision - ECCV 2018: Proceedings, Part V Vol. Lecture Notes in Computer Science; vol. 11209 (pp. 189-204). Munich: Springer.
    DOI Scopus13 WoS14
    2018 Zhuang, B., Wu, Q., Shen, C., Reid, I., & van den Hengel, A. (2018). Parallel attention: a unified framework for visual object discovery through dialogs and queries. In Proceedings: 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2018) (pp. 4252-4261). Salt Lake City, UT: IEEE.
    DOI Scopus119 WoS52
    2018 Wang, C., Zhao, R., Yang, X., & Wu, Q. (2018). Research of UAV Target Detection and Flight Control Based on Deep Learning. In 2018 International Conference on Artificial Intelligence and Big Data (ICAIBD) (pp. 170-174). online: IEEE.
    DOI WoS10
    2018 Wu, Q., Wang, P., Liu, E., Fan, Y., Duan, D., Wang, Z., & Cai, S. (2018). Design and Implementation of Learning Management Platform for Aviation Flight Training Based on SCORM/AICC Standard-A Case Study of K Airline Company Flight Training Learning Platform. In ADVANCED SCIENCE LETTERS Vol. 24 (pp. 5194-5198). INDONESIA, Bandung: AMER SCIENTIFIC PUBLISHERS.
    DOI WoS1
    2017 Wang, Q., Chen, W., & Wu, Q. (2017). The research and application of an real-time embedded measurement and control system for the river discharge. In S. Li, Y. Dai, & Y. Cheng (Eds.), 2017 4TH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND CONTROL ENGINEERING (ICISCE) (pp. 1295-1298). Changsha, PEOPLES R CHINA: IEEE.
    DOI
    2017 Wang, P., Wu, Q., Shen, C., & van den Hengel, A. (2017). The VQA-machine: learning how to use existing vision algorithms to answer new questions. In Proceedings: 30th IEEE Conference on Computer Vision and Pattern Recognition Vol. 2017-January (pp. 3909-3918). Honolulu: IEEE.
    DOI Scopus68 WoS34
    2017 Wang, P., Wu, Q., Shen, C., Dick, A., & Van Den Hengel, A. (2017). Explicit knowledge-based reasoning for visual question answering. In C. Sierra (Ed.), Proceedings of the twenty-sixth International Joint Conference on Artificial Intelligence Vol. 0 (pp. 1290-1296). online: IJCAI.
    DOI Scopus136 WoS60
    2016 Wu, Q., Wang, P., Shen, C., Dick, A., & Van Den Hengel, A. (2016). Ask me anything: free-form visual question answering based on knowledge from external sources. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition Vol. 2016-December (pp. 4622-4630). Las Vegas, NV: IEEE.
    DOI Scopus299 WoS183
    2016 Wu, Q., Shen, C., Liu, L., Dick, A., & Van Den Hengel, A. (2016). What value do explicit high level concepts have in vision to language problems?. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition Vol. 2016-December (pp. 203-212). Las Vegas, NV: IEEE.
    DOI Scopus411 WoS278
    2016 Wu, Q., Wang, C., Li, A., & Huang, B. (2016). Integral sliding mode controller design for near space vehicle with input constraints. In 2016 IEEE CHINESE GUIDANCE, NAVIGATION AND CONTROL CONFERENCE (CGNCC) (pp. 187-191). PEOPLES R CHINA, Nanjing: IEEE.
    WoS2
    2016 Tang, J., Guo, Y., Lai, X., Liu, Y., & Wu, Q. (2016). Study on the Correlation between Fe<SUP>2+</SUP> and Peridot's Yellow Green Color and Quality Evaluation of Color Based on CIE1976 L*a*b* Uniform Color Space. In X. Xiao, & P. Han (Eds.), PROCEEDINGS OF THE 2016 5TH INTERNATIONAL CONFERENCE ON ENVIRONMENT, MATERIALS, CHEMISTRY AND POWER ELECTRONICS Vol. 84 (pp. 599-604). PEOPLES R CHINA, Zhengzhou: ATLANTIS PRESS.
    WoS1
    2016 Gao, G., Yang, H., Wu, Q., Mao, S. -J., & Yin, W. -L. (2016). A Wideband and Low Cross Polarization Slot Antenna Based on Differential-Feed. In INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATION AND NETWORK ENGINEERING (WCNE 2016) (pp. 4 pages). PEOPLES R CHINA, Beijing: DESTECH PUBLICATIONS, INC.
    2016 Gao, G., Yang, H., Jin, Z., & Wu, Q. (2016). A Broadband Dual-polarization Slot Antenna Based on Substrate-integrated Cavity. In 2016 PROGRESS IN ELECTROMAGNETICS RESEARCH SYMPOSIUM (PIERS) (pp. 1994-1998). PEOPLES R CHINA, Shanghai: IEEE.
    2016 Wu, Q., Yang, H., Jin, Z., Gao, G., & Cao, D. (2016). A Design of Band-pass Filter with Steep Stopband Attenuation Based on Transmission Zeros. In 2016 PROGRESS IN ELECTROMAGNETICS RESEARCH SYMPOSIUM (PIERS) (pp. 3482-3486). PEOPLES R CHINA, Shanghai: IEEE.
    2016 Wang, X., Wu, Q., & Yang, J. (2016). Extended PGA Processing of High Resolution Airborne SAR Imagery Reconstructed via Backprojection Algorithm. In 2016 CIE INTERNATIONAL CONFERENCE ON RADAR (RADAR) (pp. 3 pages). PEOPLES R CHINA, Guangzhou: IEEE.
    2016 Wu, Q., Yang, H., Gao, G., Gu, L., & Zhao, F. (2016). A Design of High Gain Archimedean Spiral Antenna. In INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATION AND NETWORK ENGINEERING (WCNE 2016) (pp. 4 pages). PEOPLES R CHINA, Beijing: DESTECH PUBLICATIONS, INC.
    2016 Macedo, G. S., Burke, K. A., Piscuoglio, S., Ng, C. K. Y., Geyer, F. C., Martelotto, L. G., . . . Reis-Filho, J. S. (2016). The landscape of somatic genetic alterations in BRCA1 and BRCA2 breast cancers. In CANCER RESEARCH Vol. 76 (pp. 4 pages). LA, New Orleans: AMER ASSOC CANCER RESEARCH.
    DOI
    2016 Weigelt, B., Piscuoglio, S., Burke, K., Ng, C. K. Y., Papanastasiou, A. D., Geyer, F. C., . . . Reis-Filho, J. S. (2016). Uterine Adenosarcomas Are Mesenchymal Neoplasms. In LABORATORY INVESTIGATION Vol. 96 (pp. 314A). WA, Seattle: NATURE PUBLISHING GROUP.
    2015 Wu, Q., Wu, Q., Zhao, S., Wei, M., & Wang, F. L. (2015). Knowledge Communication Analysis Based on Clustering and Association Rules Mining. In A. Liu, Y. Ishikawa, T. Qian, S. Nutanong, & M. A. Cheema (Eds.), DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2015 Vol. 9052 (pp. 66-75). VIETNAM, Hanoi: SPRINGER-VERLAG BERLIN.
    DOI
    2015 Wu, Q., Vogt, A., Briins, H. -D., Gronwald, F., & Schuster, C. (2015). Numerical and Experimental Evaluation of Electromagnetic Coupling between Radiating Antenna Structures inside a Computer Casing. In 2015 IEEE INTERNATIONAL SYMPOSIUM ON ELECTROMAGNETIC COMPATIBILITY (EMC) (pp. 328-333). GERMANY, Dresden: IEEE.
    2015 Cai, H., Wu, Q., & Hall, P. (2015). Beyond Photo-Domain Object Recognition: Benchmarks for the Cross-Depiction Problem. In Proceedings of the IEEE International Conference on Computer Vision Vol. 2015-February (pp. 74-79). Santigo: IEEE.
    DOI Scopus2 WoS3
    2015 Wu, Q., Chen, F. -C., & Huang, R. -Y. (2015). Detecting Temporal Community from Dynamic Heterogeneous Networks. In PROCEEDINGS OF 2015 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2015) (pp. 610-613). Harbin, PEOPLES R CHINA: IEEE.
    2014 Wu, Q., Cai, H., & Hall, P. (2014). Learning graphs to model visual objects across different depictive styles. In D. Fleet, T. Pajdia, B. Schiele, & T. Tuytelaars (Eds.), Proceedings of the 13th European Conference on Computer Vision Vol. VII (pp. 313-328). Zurich, Switzerland: Springer.
    DOI Scopus17 WoS5
    2013 Wu, Q., & Hall, P. (2013). Modelling visual objects Invariant to depictive style. In T. Burghardt, D. Damen, W. Mayol-Cuevas, & M. Mirmehdi (Eds.), Proceedings of the British Machine Vision Conference (pp. 23.1-23.12). Bristol, UK: BMVA Press.
    DOI Scopus4
    2013 Hao, Y., Wu, Q., & Liu, B. (2013). Literature Review on the Impact of Income Distribution Gap on Consumer Demand. In G. Lee (Ed.), PSYCHOLOGY, MANAGEMENT AND SOCIAL SCIENCE Vol. 18 (pp. 65-70). PEOPLES R CHINA, Shenzhen: INFORMATION ENGINEERING RESEARCH INST, USA.
    2012 Wu, Q., Fu, X., & Shen, X. (2012). Automatic micro-expression analysis. In INTERNATIONAL JOURNAL OF PSYCHOLOGY Vol. 47 (pp. 144-145). JOHN WILEY & SONS LTD.
    2012 Wu, Q., & Hall, P. (2012). Prime shapes in natural images. In R. Bowden, J. Collomosse, & K. Mikolajcczk (Eds.), Proceedings of the British Machine Vision Conference (pp. 45-1-45-12). Surrey, UK: BMVA Press.
    DOI Scopus4 WoS2
    2011 Hoffman, J., Wang, L. -M., Wu, Q., & Morton, K. (2011). Uptake of 2-deoxyglucose analogs by thrombotically activated cells. In JOURNAL OF NUCLEAR MEDICINE Vol. 52 (pp. 2 pages). SOC NUCLEAR MEDICINE INC.
    2008 Liu, Y., Yin, Y., Teng, Z., Wu, Q., & Li, G. (2008). Activities prediction of drug molecules by using the optimal ensemble based on uniform design. In D. S. Huang, D. C. Wunsch, D. S. Levine, & K. H. Jo (Eds.), ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS, PROCEEDINGS Vol. 5226 (pp. 106-+). PEOPLES R CHINA, Shanghai: SPRINGER-VERLAG BERLIN.
    WoS1
    2007 Wu, Q., Shao, T. -C., & Chen, T. (2007). Robust self-calibration from single image using RANSAC. In G. Bebis, R. Boyle, B. Parvin, D. Koracin, N. Paragios, S. M. Tanveer, . . . T. Malzbender (Eds.), ADVANCES IN VISUAL COMPUTING, PT I Vol. 4841 (pp. 230-+). NV, Lake Tahoe: SPRINGER-VERLAG BERLIN.
    WoS5
    2006 Wu, Q., Song, M., Bu, J., & Chen, C. (2006). EigenExpress approach in recognition of facial expression using GPU. In T. S. Huang, N. Sebe, M. S. Lew, V. Pavlovic, M. Kolsch, A. Galata, & B. Kisacanin (Eds.), COMPUTER VISION IN HUMAN-COMPUTER INTERACTION Vol. 3979 (pp. 12-20). AUSTRIA, Graz: SPRINGER-VERLAG BERLIN.
    WoS1
  • Report for External Bodies

  • Preprint

    Year Citation
    2024 Phan, V. M. H., Xie, Y., Qi, Y., Liu, L., Liu, L., Zhang, B., . . . Verjans, J. W. (2024). Decomposing Disease Descriptions for Enhanced Pathology Detection: A
    Multi-Aspect Vision-Language Pre-training Framework.
    2024 Zhou, G., Hong, Y., Wang, Z., Wang, X. E., & Wu, Q. (2024). NavGPT-2: Unleashing Navigational Reasoning Capability for Large
    Vision-Language Models.
    2024 Zhou, G., Hong, Y., Wang, Z., Zhao, C., Bansal, M., & Wu, Q. (2024). SAME: Learning Generic Language-Guided Visual Navigation with
    State-Adaptive Mixture of Experts.
    2024 Wei, Y., Fu, S., Jiang, W., Zhang, Z., Zeng, Z., Wu, Q., . . . Zhang, Y. (2024). GITA: Graph to Visual and Textual Integration for Vision-Language Graph
    Reasoning.
    2024 Chen, Q., Zhang, B., Wang, G., & Wu, Q. (2024). Weak-eval-Strong: Evaluating and Eliciting Lateral Thinking of LLMs with
    Situation Puzzles.
    2024 Chen, Q., Zhao, R., Wang, S., Phan, V. M. H., Hengel, A. V. D., Verjans, J., . . . Wu, Q. (2024). A Survey of Medical Vision-and-Language Applications and Their
    Techniques.
    2023 Chen, Q., Pitawela, D., Zhao, C., Zhou, G., Chen, H. -T., & Wu, Q. (2023). WebVLN: Vision-and-Language Navigation on Websites.
    2021 Chen, Q., Li, Y., Qi, Y., Zhou, J., Tan, M., & Wu, Q. (2021). V2C: Visual Voice Cloning.
    2021 Moghaddam, M. K., Abbasnejad, E., Wu, Q., Shi, J., & Hengel, A. V. D. (2021). Learning for Visual Navigation by Imagining the Success.
    2019 Parvaneh, A., Abbasnejad, E., Wu, Q., & Shi, J. (2019). Show, Price and Negotiate: A Hierarchical Attention Recurrent Visual Negotiator..
  • MyIP-7370, CERA grants, Anton van den Hengel, Anthony Dick, Qi Wu, Answer Me Why:Explainability is Critical if We are to Trust Automated Decision Making, 98,000 AUD

  • MyIP-7370, CERA grants, Anton van den Hengel, Anthony Dick, Qi Wu, Robust long-term Autonomous Navigation, 98,000 AUD

  • Facebook’s Research and Academic Relations Program, Peter Anderson, Qi Wu, Damien Teney, Niko Sunderhauf, Stephen Gould, Anton van den Hengel, Treasure Hunt: Natural Language N

  • Computer Vision

  • Machine Learning

  • Algorithms and Data Structure Analysis

  • Research Methods

  • Advanced Topics in Computer Science

  • Current Higher Degree by Research Supervision (University of Adelaide)

    Date Role Research Topic Program Degree Type Student Load Student Name
    2024 Principal Supervisor Vision-language Pre-training in Medical Domain Doctor of Philosophy Doctorate Full Time Ms Sinuo Wang
    2024 Principal Supervisor Distillation knowledge from Large Foundation Models For Vision-and-Language Navigation Master of Philosophy Master Full Time Mr Zerui Li
    2024 Principal Supervisor Direct Fitting 3D Generative Models Using Volume Rendering Master of Philosophy Master Full Time Mr Jian Zhou
    2024 Principal Supervisor Parameter-efficient Tuning Large Vision-Language Models Doctor of Philosophy Doctorate Full Time Mr Shuai Fu
    2023 Principal Supervisor Vision-and-Language in the Wild Doctor of Philosophy Doctorate Full Time Mr Zheng Yu
    2023 Principal Supervisor Efficient Video Foundation Model Doctor of Philosophy Doctorate Full Time Mr Feng Chen
    2022 Principal Supervisor Vision-and-Language Methods in Clinical Applications Doctor of Philosophy Doctorate Full Time Mr Chaohan Wang
    2022 Co-Supervisor MUDE: Mixed-reality Unified Development Environment for Context-Aware AI Automation Tasks Doctor of Philosophy Doctorate Full Time Miss Xiaoyan Wei
    2022 Principal Supervisor Spatiotemporal Multimodal Learning in Embodied AI Doctor of Philosophy Doctorate Full Time Mr Gengze Zhou
    2021 Co-Supervisor Recognizing Individual and Collective Activity Using Videos Doctor of Philosophy Doctorate Full Time Mr Bahram Mohammadi
  • Past Higher Degree by Research Supervision (University of Adelaide)

    Date Role Research Topic Program Degree Type Student Load Student Name
    2022 - 2023 Principal Supervisor Vision-and-Language Navigation in the Real-World Master of Philosophy Master Full Time Mr Chongyang Zhao
    2021 - 2024 Principal Supervisor Multi-modal Generation, Synergy and Evaluation Doctor of Philosophy Doctorate Full Time Mr Qi Chen
    2020 - 2023 Principal Supervisor General Vision and Language Methods in Real Applications: A Focus on Vision-and-Language Navigation Doctor of Philosophy Doctorate Full Time Miss Yanyuan Qiao
    2020 - 2024 Principal Supervisor Language-based Visual Understanding Doctor of Philosophy Doctorate Full Time Mr Chaorui Deng
    2019 - 2022 Co-Supervisor Towards Optimistic, Imaginative, and Harmonious Reinforcement Learning in
    Single-Agent and Multi-Agent Environments
    Doctor of Philosophy Doctorate Full Time Mr Mahdi Kazemi Moghaddam
    2018 - 2021 Co-Supervisor Fully Convolutional Instance-level Visual Recognition Doctor of Philosophy Doctorate Full Time Mr Zhi Tian
    2018 - 2022 Co-Supervisor 3D Scene Reconstruction from A Monocular Image Doctor of Philosophy Doctorate Full Time Mr Wei Yin
    2018 - 2021 Co-Supervisor Multi-modality Data Analysis Using Deep Reinforcement Learning Doctor of Philosophy Doctorate Full Time Mr Hu Wang
    2018 - 2022 Co-Supervisor Efficient Deep Networks for Image Matting Doctor of Philosophy Doctorate Full Time Ms Yutong Dai
    2017 - 2018 Co-Supervisor Text Detection and Recognition in Natural Scene Images Doctor of Philosophy Doctorate Full Time Mrs Hui Li
  • Position: Associate Prof/Reader
  • Phone: 83132134
  • Email: qi.wu01@adelaide.edu.au
  • Campus: North Terrace
  • Building: Australian Institute for Machine Learning, floor 1
  • Org Unit: Computer Science

Connect With Me
External Profiles

Other Links