EASTER: Efficient and Scalable Text Recognizer

Chaudhary, Kartik; Bali, Raghav

doi:10.21428/594757db.65eda33c

Computer Science > Computer Vision and Pattern Recognition

arXiv:2008.07839 (cs)

[Submitted on 18 Aug 2020 (v1), last revised 19 Aug 2020 (this version, v2)]

Title:EASTER: Efficient and Scalable Text Recognizer

Authors:Kartik Chaudhary, Raghav Bali

View PDF

Abstract:Recent progress in deep learning has led to the development of Optical Character Recognition (OCR) systems which perform remarkably well. Most research has been around recurrent networks as well as complex gated layers which make the overall solution complex and difficult to scale. In this paper, we present an Efficient And Scalable TExt Recognizer (EASTER) to perform optical character recognition on both machine printed and handwritten text. Our model utilises 1-D convolutional layers without any recurrence which enables parallel training with considerably less volume of data. We experimented with multiple variations of our architecture and one of the smallest variant (depth and number of parameter wise) performs comparably to RNN based complex choices. Our 20-layered deepest variant outperforms RNN architectures with a good margin on benchmarking datasets like IIIT-5k and SVT. We also showcase improvements over the current best results on offline handwritten text recognition task. We also present data generation pipelines with augmentation setup to generate synthetic datasets for both handwritten and machine printed text.

Comments:	9 pages, fixed typos and minor edits
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
Cite as:	arXiv:2008.07839 [cs.CV]
	(or arXiv:2008.07839v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2008.07839
Journal reference:	Proceedings of the Canadian Conference on Artificial Intelligence 2021
Related DOI:	https://doi.org/10.21428/594757db.65eda33c

Submission history

From: Raghav Bali [view email]
[v1] Tue, 18 Aug 2020 10:26:03 UTC (4,738 KB)
[v2] Wed, 19 Aug 2020 14:02:50 UTC (4,802 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:EASTER: Efficient and Scalable Text Recognizer

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:EASTER: Efficient and Scalable Text Recognizer

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators