Reinforcement Learning of Visual Descriptors for Object Recognition

Reinforcement Learning of Visual Descriptors for Object Recognition

Place: Large Lecture Room

Affiliation: Universitat Autònoma de Barcelona, Computer Vision Centre, Barcelona. Spain.

Committee members:

Dr. Petia Ivanova Radeva, Universitat de Barcelona, Spain.
Dr. Bogdan Raducanu, Computer Vision Center, Barcelona, Spain.
Dr. Arnau Ramisa Ayats, Institut de Robòtica i Informàtica Industrial, Universitat Politècnica de Catalunya, Spain.  

ABSTRACT:

The human visual system is able to recognize the object in an image even if the object is partially occluded, from various points of view, in different colors, or with independence of the distance to the object. To do this, the eye obtains an image and extracts features that are sent to the brain, and then, in the brain the object is recognized. In computer vision, the object recognition branch tries to learns from the human visual system behaviour to achieve its goal. Hence, an algorithm is used to identify representative features of the scene (detection), then another algorithm is used to describe these points (descriptor) and finally the extracted information is used for classifying the object in the scene. The selection of  this set of algorithms is a very complicated task and thus, a very active research field. In this thesis we are focused on the selection/learning of the best descriptor for a given image. In the state of the art there are several descriptors but we do not know how to choose the best descriptor because depends on scenes that we will use (dataset) and the algorithm chosen to do the classification. We propose a framework based on reinforcement learning and bag of features to choose the best descriptor according to the given image. The system can analyse the behaviour of different learning algorithms and descriptor sets. Furthermore the proposed framework for improving the classification/recognition ratio can be used with minor changes in other computer vision fields, such as video retrieval.

Thesis Monica Piñol