• Jul 19, 2022 News!Vol.14, No. 3 has been published with online version.   [Click]
  • Apr 25, 2022 News!News | Vol.14, No. 2 has been published with online version.   [Click]
  • Dec 24, 2021 News!Vol. 13, no. 1 & no. 2 has been indexed by INSPEC!   [Click]
General Information
    • ISSN: 1793-8236 (Online)
    • Abbreviated Title Int. J. Eng. Technol.
    • Frequency:  Quarterly 
    • DOI: 10.7763/IJET
    • Executive Editor: Ms.Yoyo Y. Zhou
    • Abstracting/ Indexing: Chemical Abstracts Services (CAS) EBSCO, Google Scholar, Ulrich Periodicals Directory, Crossref, ProQuest, Index CopernicusINSPECCNKI.
    • E-mail: ijet@vip.163.com
Prof. T. Hikmet Karakoc
Anadolu University, Faculty of Aeronautics and Astronautics, Turkey

IJET 2011 Vol.3(4): 392-395 ISSN: 1793-8236
DOI: 10.7763/IJET.2011.V3.258

Algorithm to Detect and Segment Gurmukhi Handwritten Text into Lines, Words and Characters

Rajiv Kumar and Amardeep Singh
Abstract—The output of a scanner is a non editable scanned text image. Though the text is visible but one can neither edit it nor make any change, if required. This provides a basis for the optical character recognition (OCR) theory. OCR consists of generally three major phase; pre processing after image acquisition, segmentation and recognition. The segmentation process is the most crucial phase. The output of this phase decides the outcome of recognition phase. If this output is right then recognition phase would give the right output otherwise not. In this paper, we provide an algorithm which is used to segment the scanned document image as a lines, words and characters. The coordinates of line detected are used to find the word position present in that line. Finally, these words position coordinates are used to find characters present in the word. To detect lines and words, one module is proposed which is used to find both. For character detection, the reverse engineering is used, i.e. one part is extracted from the word present in the line. This extracted part is checked whether it has some meaningful symbol (as per Gurmukhi script). If it has then the extracted part is marked and written in the file, otherwise the extracted part is readjusted to find the symbol. This overall concept was implemented, and got encouraging results.

Index Terms—OCR, Segmentation, Gurmukhi, Handwritten, Feature, Water Reservoir, Line, Word.

Rajiv Kumar, Thapar University,(email: rajiv.patiala@gmail.com)
Amardeep Singh, Pbi University (email: amardeep_dhiman@yahoo.com)


Cite: Rajiv Kumar and Amardeep Singh, "Algorithm to Detect and Segment Gurmukhi Handwritten Text into Lines, Words and Characters," International Journal of Engineering and Technology vol. 3, no. 4, pp. 392-395, 2011.

Copyright © 2008-2022. International Journal of Engineering and Technology. All rights reserved. 
E-mail: ijet@vip.163.com