Literate models for computer vision: Combining vision, language and reading
Dimosthenis Karatzas (email@example.com), Ernest Valveny (firstname.lastname@example.org), Lluis Gomez (email@example.com), Ali Biten, Andres Mafla, Ruben Perez, Sergi Garcia
Computer Vision Centre, Barcelona, Spain
CONTENT AND ORGANIZATION:
Written information in the world around us is a fundamental cue for a multitude of everyday tasks. From shopping at the supermarket to finding our destination in an unknown urban space, written text helps us perform many tasks that would otherwise be much more complex.
Computer vision systems on the other hand, have been practically illiterate for the first half century of their lifetime. Specific research on reading systems has been going on for decades, but the semantic information that image text conveys was not incorporated to higher-level computer vision tasks until very recently. This is gradually changing, afforded by the great success achieved in the field of scene text recognition in recent years.
Through this short interactive course, doctoral students will have a chance to reconcile with the state of the art in reading systems, especially scene text recognition, and explore how image text enables us to tackle new and exciting computer vision tasks such as fine-grained image classification, cross-modal retrieval, captioning and visual question answering.
REGISTRATION: Free of charge
WHEN: April 4, 2022 – 17.00-19.00 CET and April 6, 2022 – 17:00 – 19:00 CET
WHERE: The course will take place in hybrid mode. Onsite attendance is possible at the Computer Vision Centre, Barcelona (limited availability). For ONLINE participants, a link will be provided by the organisers after registration/enrollment.
HOW TO REGISTER and ENROLL:
Both AIDA and non-AIDA students are encouraged to participate in this short course.
If you are an AIDA Student* already, please:
Step (a): Register in the course by filing in the form at the Web site of the course
Step (b): Enroll in the same course in the AIDA course site https://www.i-aida.org/course/vision-and-language-reading-systems-and-multi-modal-representations/ using the “Enroll on this Course” button, which you can find there, so that this course enters your AIDA Certificate of Course Attendance.
If you are not an AIDA Student do only step (a).
*The International AI Doctoral Academy (AIDA) has 73 members, which are top AI Universities, Research centers and Industries: https://www.i-aida.org/
AIDA Students should have been registered in the AIDA system already (they are PhD students or PostDocs that belong only to the AIDA Members listed in this page: Members)