• Mar 26, 2024 News!Vol.16, No. 1 has been published with online version.   [Click]
  • Jan 02, 2024 News!All papers in IJET will be publihsed article by article staring from 2024.
  • Nov 03, 2023 News!News | Vol.15, No. 4 has been published with online version.   [Click]
General Information
    • ISSN: 1793-8236 (Online)
    • Abbreviated Title Int. J. Eng. Technol.
    • Frequency:  Quarterly 
    • DOI: 10.7763/IJET
    • APC: 500 USD
    • Managing Editor: Ms. Jennifer Zeng
    • Abstracting/ Indexing: Inspec (IET), CNKI Google Scholar, EBSCO, ProQuest, Crossref, Ulrich Periodicals Directory, Chemical Abstracts Services (CAS), etc.
    • E-mail: ijet_Editor@126.com
IJET 2011 Vol.3(4): 392-395 ISSN: 1793-8236
DOI: 10.7763/IJET.2011.V3.258

Algorithm to Detect and Segment Gurmukhi Handwritten Text into Lines, Words and Characters

Rajiv Kumar and Amardeep Singh

Abstract—The output of a scanner is a non editable scanned text image. Though the text is visible but one can neither edit it nor make any change, if required. This provides a basis for the optical character recognition (OCR) theory. OCR consists of generally three major phase; pre processing after image acquisition, segmentation and recognition. The segmentation process is the most crucial phase. The output of this phase decides the outcome of recognition phase. If this output is right then recognition phase would give the right output otherwise not. In this paper, we provide an algorithm which is used to segment the scanned document image as a lines, words and characters. The coordinates of line detected are used to find the word position present in that line. Finally, these words position coordinates are used to find characters present in the word. To detect lines and words, one module is proposed which is used to find both. For character detection, the reverse engineering is used, i.e. one part is extracted from the word present in the line. This extracted part is checked whether it has some meaningful symbol (as per Gurmukhi script). If it has then the extracted part is marked and written in the file, otherwise the extracted part is readjusted to find the symbol. This overall concept was implemented, and got encouraging results.

Index Terms—OCR, Segmentation, Gurmukhi, Handwritten, Feature, Water Reservoir, Line, Word.

Rajiv Kumar, Thapar University,(email: rajiv.patiala@gmail.com)
Amardeep Singh, Pbi University (email: amardeep_dhiman@yahoo.com)


Cite: Rajiv Kumar and Amardeep Singh, "Algorithm to Detect and Segment Gurmukhi Handwritten Text into Lines, Words and Characters," International Journal of Engineering and Technology vol. 3, no. 4, pp. 392-395, 2011.

Copyright © 2008-2024. International Journal of Engineering and Technology. All rights reserved. 
E-mail: ijet_Editor@126.com