Skip to main content

Showing 1–1 of 1 results for author: Islam, S B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2308.10647  [pdf, other

    cs.CV

    bbOCR: An Open-source Multi-domain OCR Pipeline for Bengali Documents

    Authors: Imam Mohammad Zulkarnain, Shayekh Bin Islam, Md. Zami Al Zunaed Farabe, Md. Mehedi Hasan Shawon, Jawaril Munshad Abedin, Beig Rajibul Hasan, Marsia Haque, Istiak Shihab, Syed Mobassir, MD. Nazmuddoha Ansary, Asif Sushmit, Farig Sadeque

    Abstract: Despite the existence of numerous Optical Character Recognition (OCR) tools, the lack of comprehensive open-source systems hampers the progress of document digitization in various low-resource languages, including Bengali. Low-resource languages, especially those with an alphasyllabary writing system, suffer from the lack of large-scale datasets for various document OCR components such as word-lev… ▽ More

    Submitted 21 August, 2023; v1 submitted 21 August, 2023; originally announced August 2023.