PhDinfo Seminar: Pietro Bongini, “Visual Question Answering: an overview”
Abstract: Visual Question Answering (VQA) is an emerging topic which aims at automatically answering questions referred to a specific image. Together with image captioning, VQA is the main point of contact between the two heterogeneous communities of Natural Language Processing (NLP) and Computer Vision. VQA datasets contain different types of questions which require reasoning about […]