Exploring the role of Text in Visual Question Answering on Natural Scenes and Documents
Rubèn Pérez Tito successfully defended his dissertation on Computer Science on November 13, 2023, and he is now Doctor of Philosophy by the Universitat Autònoma de Barcelona. What is the thesis about? Visual Question Answering (VQA) is the task where given an image and a natural language question, the objective is to generate a natural … Read more