Handwritten Document Analysis, Recognition and Interpretation

Stage of development

TRL 4-5

Business Sector

Tourism, Culture, Music, Audiovisual, Financial Services & Insurance

Research Line

Intelligent Reading Systems

Principal Resercher:

Technology description:

Our technology encompasses a range of methodologies dedicated to the analysis, recognition, and interpretation of handwritten documents, spanning textual manuscripts, ciphers featuring rare scripts, and music scores.
 
Recognizing handwritten documents presents significant challenges due to the wide variability in handwriting styles, diverse layouts, languages, and domains. Moreover, the lack of enough annotated data poses a major obstacle, particularly in the realm of Deep Learning.
 
To address these challenges, we have developed a multifaceted approach that integrates several innovative strategies. First, few-shot learning enables us to learn effectively with limited data, making it particularly useful for recognizing manuscripts with rare scripts, unfamiliar alphabets, or distinct handwriting styles. Second, self-supervised learning can learn representations from unlabeled data, which can be effectively transferred to document recognition tasks. Third, synthetic document generation provides an alternative approach by generating labeled data to train deep learning models.

Use cases:

Technology and citizen innovation for building historical social networks to understand the demographic past. 

More information

Computer Vision techniques to preserve, catalogue, and disseminate historic musical documents and to evolve towards the digital processing of musical information.

More information

Applications:

IP Transfer:

Interested in this technology? Contact us!

Technology Transfer & Industry Partnerships Department:

transferencia@cvc.uab.cat