research-article

Offline Recognition of Unconstrained Handwritten Texts Using HMMs and Statistical Language Models

Authors:

Alessandro Vinciarelli,

Horst BunkeAuthors Info & Claims

IEEE Transactions on Pattern Analysis and Machine Intelligence, Volume 26, Issue 6

Pages 709 - 720

https://doi.org/10.1109/TPAMI.2004.14

Published: 01 June 2004 Publication History

Abstract

Abstract--This paper presents a system for the offline recognition of large vocabulary unconstrained handwritten texts. The only assumption made about the data is that it is written in English. This allows the application of Statistical Language Models in order to improve the performance of our system. Several experiments have been performed using both single and multiple writer data. Lexica of variable size (from 10,000 to 50,000 words) have been used. The use of language models is shown to improve the accuracy of the system (when the lexicon contains 50,000 words, the error rate is reduced by \sim 50 percent for single writer data and by \sim 25 percent for multiple writer data). Our approach is described in detail and compared with other methods presented in the literature to deal with the same problem. An experimental setup to correctly deal with unconstrained text recognition is proposed.

References

[1]

T. Steinherz E. Rivlin and N. Intrator, “Off-Line Cursive Script Word Recognition-A Survey,” Int'l J. Document Analysis and Recognition, vol. 2, no. 2, pp. 1-33, Feb. 1999.

[2]

R. Plamondon and S.N. Srihari, “Online and Offline Handwriting Recognition: A Comprehensive Survey,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 22, no. 1, pp. 63-84, Jan. 2000.

Digital Library

[3]

A. Vinciarelli, “A Survey on Off-Line Cursive Word Recognition,” Pattern Recognition, vol. 35, no. 7, pp. 1433-1446, June 2002.

[4]

U.V. Marti and H. Bunke, “The IAM-Database: An English Sentence Database for Offline Handwriting Recognition,” Int'l J. Document Analysis and Recognition, vol. 5, no. 1, pp. 39-46, Jan. 2002.

[5]

A.W. Senior and A.J. Robinson, “An Off-Line Cursive Handwriting Recognition System,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 20, no. 3, pp. 309-321, Mar. 1998.

Digital Library

[6]

M. Zimmermann and H. Bunke, “Automatic Segmentation of the IAM Off-Line Database for Handwritten English Text,” Proc. 16th Int'l Conf. Pattern Recognition, vol. IV, pp. 35-39, 2002.

[7]

R. Rosenfeld, “Two Decades of Statistical Language Modeling: Where Do We Go from Here?” Proc. IEEE, vol. 88, no. 8, pp. 1270-1278, Aug. 2000.

[8]

F. Jelinek, Statistical Aspects of Speech Recognition. MIT Press 1998.

Digital Library

[9]

G. Kim V. Govindaraju and S.N. Srihari, “An Architecture for Handwritten Text Recognition Systems,” Pattern Recognition, vol. 2, pp. 37-44, 1999.

[10]

U.V. Marti and H. Bunke, “Using a Statistical Language Model to Improve the Performance of an HMM-Based Cursive Handwriting Recognition System,” Int'l J. Pattern Recognition and Artificial Intelligence, vol. 15, no. 1, pp. 65-90, 2001.

[11]

A. Vinciarelli S. Bengio and H. Bunke, “Offline Recognition of Large Vocabulary Cursive Handwritten Text,” Proc. Int'l Conf. Document Analysis and Recognition, 2003.

Digital Library

[12]

T. Paquet and Y. Lecourtier, “Recognition of Handwritten Sentences Using a Restricted Lexicon,” Pattern Recognition, vol. 26,no. 3, pp. 391-407, Mar. 1993.

[13]

D. Guillevic and C.Y. Suen, “Recognition of Legal Amounts on Bank Cheques,” Pattern Analysis and Applications, vol. 1, no. 1, 1998.

[14]

E. Cohen J.J. Hull and S.N. Srihari, “Control Structure for Interpreting Handwritten Addresses,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 16, no. 10, pp. 1049-1055, Oct. 1994.

Digital Library

[15]

J. Park and V. Govindaraju, “Use of Adaptive Segmentation in Handwritten Phrase Recognition,” Pattern Recognition, vol. 35, pp. 245-252, 2002.

[16]

G. Kim and V. Govindaraju, “Handwritten Phrase Recognition as Applied to Street Name Images,” Pattern Recognition, vol. 31, no. 1, pp. 41-51, Jan. 1998.

[17]

M. El Yacoubi M. Gilloux and J.M. Bertille, “A Statistical Approach for Phrase Location and Recognition within a Text Line: An Application to Street Name Recognition,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 24,no. 2 pp. 172-188, Feb. 2002.

Digital Library

[18]

A. El Yacoubi J.M. Bertille and M. Gilloux, “Conjoined Location of Street Names within a Postal Address Delivery Line,” Proc. Int'l Conf. Document Analysis and Recognition, vol. 2, pp. 1024-1027, 1995.

Digital Library

[19]

G. Kim V. Govindaraju and S.N. Srihari, “An Architecture for Handwritten Text Recognition Systems,” Int'l J. Document Analysis and Recognition, vol. 2, pp. 37-44, 1999.

[20]

R.K. Srihari and C.M. Baltus, “Incorporating Syntactic Constraints in Recognizing Handwritten Sentences,” Proc. Int'l Joint Conf. Artificial Intelligence, pp. 1262-1267, 1993.

[21]

R.K. Srihari, “Use of Lexical and Syntactic Techniques in Recognizing Handwritten Text,” Proc. ARPA Workshop Human Language Technology, pp. 403-407, 1994.

Digital Library

[22]

F. Jelinek, “Self-Organized Language Modeling for Speech Recognition,” Readings in Speech Recognition, A. Waibel and L. Kai-Fu, eds., pp. 450-506, Palo Alto, Calif.: Morgan Kaufmann, 1989.

Digital Library

[23]

A.J. Viterbi, “Error Bounds for Convolutional Codes and an Asymptotically Optimal Decoding Algorithm,” IEEE Trans. Information Theory, vol. 13, pp. 260-269, 1967.

Digital Library

[24]

S. Chen and R. Rosenfeld, “A Survey of Smoothing Techniques for ME Models,” IEEE Trans. Speech and Audio Processing, vol. 8, no. 1, pp. 37-50, Jan. 2000.

[25]

S.M. Katz, “Estimation of Probabilities from Sparse Data for the Language Model Component of a Speech Recognizer,” IEEE Trans. Acoustics, Speech, and Signal Processing, vol. 35, no. 3, pp. 400-401, 1987.

[26]

R. Rosenfeld, “A Maximum Entropy Approach to Adaptive Statistical Language Modeling,” Computer Speech and Language, vol. 10, pp. 187-228, 1996.

[27]

D. Klakow and J. Peters, “Testing the Correlation of Word Error Rate and Perplexity,” Speech Comm., vol. 38, pp. 19-28, 2002.

Digital Library

[28]

A. Vinciarelli and J Luttin, “Off-Line Cursive Script Recognition Based on Continuous Density HMM,” Proc. Seventh Int'l Workshop Frontiers in Handwriting Recognition, pp. 493-498, 2000.

[29]

A. Vinciarelli and S. Bengio, “Offline Cursive Word Recognition Using Continuous Density Hmms Trained with PCA or ICA Features,” Proc. 16th Int'l Conf. Pattern Recognition, pp. 493-498, 2002.

Digital Library

[30]

A. Vinciarelli and J. Luttin, “A New Normalization Technique for Cursive Handwritten Words,” Pattern Recognition Letters, vol. 22, no. 9, pp. 1043-1050, 2001.

[31]

L. Rabiner, “A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition,” Readings in Speech Recognition, A. Waibel and L. Kai-Fu, eds., pp. 267-296, Palo Alto, Calif.: Morgan Kaufmann, 1989.

Digital Library

[32]

L.E. Baum and T. Petrie, “Statistical Inference for Probabilistic Functions of Finite State Markov Chains,” Annals of Math. Statistics, vol. 37, pp. 1554-1563, 1966.

[33]

L.E. Baum T. Petrie G. Soules and N. Weiss, “A Maximization Technique Occurring in the Statistical Analysis of Probabilistic Functions of Markov Chains,” Annals of Math. Statistics, vol. 41,no. 1, pp. 164-171, 1970.

[34]

L.E. Baum, “An Inequality and Associated Maximization Technique in Statistical Estimation for Probabilistic Functions of Markov Processes,” Inequalities, vol. 3, pp. 1-8, 1972.

[35]

R. Bellman, Adaptive Control Processes: A Guided Tour. Princeton Univ. Press, 1991.

[36]

F. Sebastiani, “Machine Learning in Automated Text Categorization,” ACM Computing Surveys, vol. 34, no. 1, pp. 1-47, 2002.

Digital Library

[37]

D. Graff C. Cieri S. Strassel and N. Martey, “The TDT-3 Text and Speech Corpus,” Proc. Topic Detection and Tracking Workshop, 2000.

Cited By

Sánchez JVidal EBosch VQuirós L(2024)Ground-truth generation through crowdsourcing with probabilistic indexesNeural Computing and Applications10.1007/s00521-024-10188-036:30(18879-18895)Online publication date: 1-Oct-2024
https://dl.acm.org/doi/10.1007/s00521-024-10188-0
Singh HSharma RSingh V(2023)Language model based suggestions of next possible Gurmukhi character or word in online handwriting recognition systemMultimedia Tools and Applications10.1007/s11042-023-14654-082:30(47271-47289)Online publication date: 1-Dec-2023
https://dl.acm.org/doi/10.1007/s11042-023-14654-0
Vidal EToselli APuigcerver J(2023)Lexicon-based probabilistic indexing of handwritten text imagesNeural Computing and Applications10.1007/s00521-023-08620-y35:24(17501-17520)Online publication date: 10-May-2023
https://dl.acm.org/doi/10.1007/s00521-023-08620-y
Show More Cited By

Index Terms

Recommendations

Language models for online handwritten Tamil word recognition
DAR '12: Proceeding of the workshop on Document Analysis and Recognition

N-gram language models and lexicon-based word-recognition are popular methods in the literature to improve recognition accuracies of online and offline handwritten data. However, there are very few works that deal with application of these techniques on ...
Offline handwritten word recognition in Hindi
DAR '12: Proceeding of the workshop on Document Analysis and Recognition

This paper discusses the Hindi offline handwritten word recognizer (HWR) that we are developing. For the purpose of training and testing the offline HWR, we have created a Hindi handwritten word and character database from 100 writers. In our HWR we use ...
Offline recognition of handwritten Bangla characters: an efficient two-stage approach

The present work deals with recognition of handwritten characters of Bangla, a major script of the Indian sub-continent. The main contributions presented here are (a) generation of a database of handwritten basic characters of Bangla and (b) development ...

Comments

Information & Contributors

Information

Published In

cover image IEEE Transactions on Pattern Analysis and Machine Intelligence

IEEE Transactions on Pattern Analysis and Machine Intelligence Volume 26, Issue 6

June 2004

150 pages

ISSN:0162-8828

Issue’s Table of Contents

Copyright © Copyright © 2004 IEEE. All Rights Reserved.

Publisher

IEEE Computer Society

United States

Publication History

Published: 01 June 2004

Author Tags

Qualifiers

Research-article

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

79
Total Citations
View Citations
0
Total Downloads

Downloads (Last 12 months)0
Downloads (Last 6 weeks)0

Reflects downloads up to 20 Jan 2025

Other Metrics

View Author Metrics

Citations

Cited By

Sánchez JVidal EBosch VQuirós L(2024)Ground-truth generation through crowdsourcing with probabilistic indexesNeural Computing and Applications10.1007/s00521-024-10188-036:30(18879-18895)Online publication date: 1-Oct-2024
https://dl.acm.org/doi/10.1007/s00521-024-10188-0
Singh HSharma RSingh V(2023)Language model based suggestions of next possible Gurmukhi character or word in online handwriting recognition systemMultimedia Tools and Applications10.1007/s11042-023-14654-082:30(47271-47289)Online publication date: 1-Dec-2023
https://dl.acm.org/doi/10.1007/s11042-023-14654-0
Vidal EToselli APuigcerver J(2023)Lexicon-based probabilistic indexing of handwritten text imagesNeural Computing and Applications10.1007/s00521-023-08620-y35:24(17501-17520)Online publication date: 10-May-2023
https://dl.acm.org/doi/10.1007/s00521-023-08620-y
Kumari LSingh SRathore VSharma A(2023)A Comprehensive Handwritten Paragraph Text Recognition System: LexiconNetDocument Analysis and Recognition – ICDAR 2023 Workshops10.1007/978-3-031-41501-2_16(226-241)Online publication date: 21-Aug-2023
https://dl.acm.org/doi/10.1007/978-3-031-41501-2_16
Rakshit PChatterjee SHalder CSen SObaidullah SRoy K(2022)Comparative study on the performance of the state-of-the-art CNN models for handwritten Bangla character recognitionMultimedia Tools and Applications10.1007/s11042-022-13909-682:11(16929-16950)Online publication date: 2-Nov-2022
https://dl.acm.org/doi/10.1007/s11042-022-13909-6
Sánchez JVidal EBosch V(2022)Effective Crowdsourcing in the EDT Project with Probabilistic IndexesDocument Analysis Systems10.1007/978-3-031-06555-2_20(291-305)Online publication date: 22-May-2022
https://dl.acm.org/doi/10.1007/978-3-031-06555-2_20
Nurseitov DBostanbekov KKurmankhojayev DAlimova AAbdallah ATolegenov R(2021)Handwritten Kazakh and Russian (HKR) database for text recognitionMultimedia Tools and Applications10.1007/s11042-021-11399-680:21-23(33075-33097)Online publication date: 1-Sep-2021
https://dl.acm.org/doi/10.1007/s11042-021-11399-6
Kaur HKumar M(2021)Offline handwritten Gurumukhi word recognition using eXtreme Gradient Boosting methodologySoft Computing - A Fusion of Foundations, Methodologies and Applications10.1007/s00500-020-05455-w25:6(4451-4464)Online publication date: 1-Mar-2021
https://dl.acm.org/doi/10.1007/s00500-020-05455-w
Bhattacharya RMalakar SSchwenker FSarkar R(2021)Fuzzy-Based Pseudo Segmentation Approach for Handwritten Word Recognition Using a Sequence to Sequence Model with AttentionPattern Recognition. ICPR International Workshops and Challenges10.1007/978-3-030-68790-8_45(582-596)Online publication date: 10-Jan-2021
https://dl.acm.org/doi/10.1007/978-3-030-68790-8_45
Zhang CGupta AZisserman A(2020)Adaptive Text Recognition Through Visual MatchingComputer Vision – ECCV 202010.1007/978-3-030-58517-4_4(51-67)Online publication date: 23-Aug-2020
https://dl.acm.org/doi/10.1007/978-3-030-58517-4_4
Show More Cited By

View Options

View options

Media

Figures

Other

Tables

View Issue’s Table of Contents