The Annual Catalan Meeting on Computer Vision (ACMCV) brings together the Computer Vision community of Catalonia, connecting research, talent generation and industry in one day. This meeting aims to strengthen the links among the Catalan Computer Vision actors, to disseminate within the community the most relevant works that have already been published abroad and to allow students from the Master’s Degree in Computer Vision to meet with members of the Catalan computer vision community and prospective employers.
Dates & Venue
Submission deadline | September 11, 2024 Registration deadline | September 16, 2024 ACMCV 2023 | September 18, 2024 Computer Vision Center & UAB School of Engineering
Multimodal learning has seen unprecedented improvements over the last decade. The availability of large scale data combined with modeling and compute improvements have been the perfect environment for machine learning models to excel at understanding modalities beyond text. In this talk, we will do a brief review of the history of multimodal learning with special focus on techniques involving vision and audio – from early self-supervised learning techniques such as MMV to large language models such as Gemini that are able to process text, vision and audio combined.