Abstract
This paper describes our method for the Medical Domain Visual Question Answering (VQA-Med) Task of ImageCLEF 2019. The aim is to build a model that is able to answer questions about medical images. Our proposed model consists of sub-models, each specializing in answering a specific type of questions. Specifically, the sub-models we have are: “plane” model, “organ systems” model, “modality” models, and “abnormality” models. All of these models are basically image classification models based on pre-trained VGG16 network. We do not rely on the questions for the answers prediction since the questions on each type are repetitive. However, we do rely on them to determine the suitable model to be used for producing the answers and determine the suitable answer format. Our best model achieves 57% accuracy and 0.591 BLEU score.
| Original language | English |
|---|---|
| Journal | CEUR Workshop Proceedings |
| Volume | 2380 |
| State | Published - 2019 |
| Externally published | Yes |
| Event | 20th Working Notes of CLEF Conference and Labs of the Evaluation Forum, CLEF 2019 - Lugano, Switzerland Duration: 9 Sep 2019 → 12 Sep 2019 |
Keywords
- ImageCLEF 2019
- Medical Image Interpretation
- Medical Questions and Answers
- VGG Network
- Visual Question Answering
Fingerprint
Dive into the research topics of 'Just at ImageCLef 2019 visual question answering in the medical domain'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver