Skip to Main content Skip to Navigation
Journal articles

Handwriting-OOV word-recognition using web resources

Abstract : Handwriting recognition systems rely on predefined dictionaries. Small and static dictionaries are often exploited to obtain high in-vocabulary (IV) accuracy at the expense of coverage. Thus the recognition of out-of-vocabulary (OOV) words is not handled efficiently. To improve OOV recognition while keeping IV dictionaries small, we introduce a multi-step approach that exploits web resources. After an IV-OOV classification, Wikipedia is used to create OOV sequence-adapted dynamic dictionaries. A second decoding is done the dynamic dictionary to determine the most probable word for the OOV sequence. We validate our approach with experiments conducted on the RIMES dataset using a BLSTM recognizer. Results show that improvements are obtained compared to handwriting recognition with static dictionary.
Complete list of metadata

Cited literature [33 references]  Display  Hide  Download
Contributor : Bruno Savelli Connect in order to contact the contributor
Submitted on : Monday, October 21, 2019 - 10:59:02 AM
Last modification on : Saturday, June 25, 2022 - 9:13:03 PM
Long-term archiving on: : Wednesday, January 22, 2020 - 2:54:04 PM


Publisher files allowed on an open archive




Cristina Oprean, Chafic Mokbel, Laurence Likforman-Sulem, Adrian Popescu. Handwriting-OOV word-recognition using web resources. Document Numérique, Lavoisier, 2014, 17 (3), pp.77 - 96. ⟨10.3166/DN.17.3.77-96⟩. ⟨cea-01822860⟩



Record views


Files downloads