conversion of text image document in southern … · conversion of text image document in southern...
TRANSCRIPT
http://www.iaeme.com/IJCET/index.asp 73 [email protected]
International Journal of Computer Engineering & Technology (IJCET) Volume 9, Issue 4, July-Aug 2018, pp. 73–84, Article IJCET_09_04_008
Available online at http://www.iaeme.com/IJCET/issues.asp?JType=IJCET&VType=9&IType=4
Journal Impact Factor (2016): 9.3590(Calculated by GISI) www.jifactor.com
ISSN Print: 0976-6367 and ISSN Online: 0976–6375
© IAEME Publication
CONVERSION OF TEXT IMAGE DOCUMENT
IN SOUTHERN INDIAN LANGUAGES INTO
BRAILLE FOR VISUALLY CHALLENGED
PEOPLE
Dr. G.Gayathri Devi
Department of Computer Science,
SDNB Vaishnav College for Women, Chennai, TamilNadu, India
G.Sathyanarayanan
Senior Professional Project Management,
DXC Technology,
ABSTRACT
Speech and text is the significant intermediate for human communication. The
Braille encoding system represents textual documents in a readable format for the
visually challenged person. This paper projects a model to assist visually impaired or
blind person in reading the text present on southern Indian Language text image by
converting the text into corresponding Braille script. The experimentation of the
algorithms was carried out on southern Indian Language text image dataset.
Experimental results show that the projected method has a good performance on
converting the extracted text regions in an image to corresponding Braille Script.
Key words: Dravidian Language, Southern Indian Language, Braille conversion, Braille
System, Visually challenged people, Image Processing.
Cite this Article: Dr. G.Gayathri Devi and G.Sathyanarayanan, Conversion of Text
Image Document In Southern Indian Languages Into Braille For Visually Challenged
People. International Journal of Computer Engineering & Technology, 9(4), 2018, pp.
73–84.
http://www.iaeme.com/IJCET/issues.asp?JType=IJCET&VType=9&IType=4
1. INTRODUCTION
The Braille encoding system represents textual documents in a readable format for the visually
challenged persons. As there is a shortage of Braille compatible reading materials, visually
challenged people face trouble in necessities like education and employment. Reading text
documents is difficult for visually challenged people in various circumstances. Visually
impaired persons can read only by use of Braille script. The majority of printed works does not
include Braille or speech versions. There is a need of a system for automatic recognition of text
documents to Braille and speech to reduce communication gap between the written text systems
Dr. G.Gayathri Devi and G.Sathyanarayanan
http://www.iaeme.com/IJCET/index.asp 74 [email protected]
used by sighted persons and access mechanisms through which visually impaired people can
communicate
2. BRAILLE SYSTEM
Braille [1,2] is a tactile writing system used by visually challenged people. Braille is a system
of raised dots that can be read with the fingers by people who have low vision or blind. Braille
is named after Louis Braille, the French man who designed. Braille symbols are formed within
units of space known as Braille cells. A complete Braille cell comprises of six raised dots
arranged in two parallel rows each having three dots. The locations of dot are recognized by
numbers from one through six and it is shown in the figure 1. Sixty-four combinations (2^6)
are possible using one or more of these six dots. A single cell can be used to denote an alphabet
letter, number, punctuation mark, or even an entire word.
Fig 1 Braille Cell
2.1. Southern Indian Language
The Dravidian languages are a language spoken mainly in Tamil Nadu, Kerala, Telangana,
Andhra, Karnataka and as well as in Bangladesh , Bhutan , Mauritius, Sri Lanka, some parts
of Pakistan, Burma, southern Afghanistan, Nepal, Malaysia, Africa, Indonesia and Singapore.
The Dravidian languages with the most speakers are Tamil, Kannada , Telugu and Malayalam.
Vowels and Consonants of Indian Languages Indian languages are phonetic in nature. All
the languages have their own set of Vowels and Consonants. They have a good mapping from
one language to the other with respect to the features. It is observed that the fundamental vowels
and consonants for all the four languages are same besides some specialty with respect to the
consonants for each of the languages and moreover the absence of some of the consonants in
the typical Dravidian language Tamil. Consonants and vowels are combined to form Composite
characters.
The Tamil script has 12 vowels 18 consonants and one special character, the ayudha
ezhuthu. The complete script, therefore, consists of the 31 letters in their independent form and
an additional 216 composite letters, for a total of 247 combinations of a consonant and a vowel,
a mute consonant, or a vowel alone.
The Telugu script consists of 60 symbols – 16 vowels, 3 vowel modifiers, and 41
consonants. The Malayalam alphabet has 13 vowel letters, 36 consonant letters, and a few other
symbols. Kannada language uses 49 phonemic letters and it is divided into 3-groups,15 Vowels
, 34 Consonants and modifier glyphs (Half-letter) .There are 510 (34 * 15)composite
characters.
2.2. Southern Indian Language Braille System
Braille is used by many people all over the world in their mother tongue, and gives a means of
knowledge for all. Bharati Braille [4] or Indian Braille is a largely unified Braille script for
writing the Indian languages. Initially 11 Braille scripts were in usage in various areas of the
Conversion of Text Image Document In Southern Indian Languages Into Braille For Visually
Challenged People
http://www.iaeme.com/IJCET/index.asp 75 [email protected]
India and for different languages. Bharati Braille had become a nationwide standard Braille
script and it has been followed by Bangladesh., Sri Lanka, Nepal. Across the different Indian
languages there are up to sixteen vowels and about forty consonants. Figure 2, Figure 3, Figure
4 and Figure 5 shows the Tamil, Telugu, Malayalam and Kannada Braille alphabets sheets
respectively. The alphabets of Indian languages are divided into vowels and consonants. Across
the languages, fifteen vowels and thirty three consonants are common (with the exception of
Tamil) and hence the basic assignment in Bharati Braille is same. Bharati Braille assigns an
individual cell to each vowel and a consonant.
Fig 2 Tamil Braille Alphabets Sheet Fig 3 Telugu Braille Alphabets Sheet
Fig 4 Malayalam Braille Alphabets Sheet Fig 5Kannada Braille Alphabets Sheet
3. PROPOSED METHODLOGY
Gayathri et.al [4] discussed research work projected earlier for recognition the Southern Indian
Braille script from a Braille document.
This research proposes a successful conversion system from southern Indian Languages to
Braille and it contains i) Image Pre-processing, ii) OCR iii) Unicode Conversion, iv) Grade 1
Braille script conversion The flow of proposed system is shown in fig 5.
Dr. G.Gayathri Devi and G.Sathyanarayanan
http://www.iaeme.com/IJCET/index.asp 76 [email protected]
Fig 5 Flow of the Work
3.1. Work Flow
Mapping Table Creation - A database is created where it stores the alphabets of Southern
Indian Languages and its corresponding Braille Unicode and its Braille equivalent symbol.
Input - The system accepts scanned document image and the language as input
Stage 1: Preprocessing - Preprocessing is necessary for efficient recovery of the text
information from scanned image. A text extraction algorithm [5,6] was presented to preserve
text area and remove non-text area. The method was based on gamma correction method and
positional connected component labeling algorithm [7,8].
Stage 2: OCR Conversion – The language of scanned text document image is selected by the
user. OCR System is a system that automates the process of getting and processing the images
of text document into editable text file. The output of stage 1 is given to the corresponding free
open OCR Engine for text conversion.
Stage 3: Unicode Conversion system – The editable text is converted in to its expanded form
and those expanded text is converted into its corresponding Unicode equivalent.
Stage 4: Braille Conversion System – The expanded text Unicode is searched in look up table
and its corresponding Braille Unicode is identified and it is converted to its Braille Symbol.
3.2. Algorithm
The steps of the algorithm are as follows:
Algorithm – Text Image to Braille Conversion
Input: Scanned Text Image and the language of Scanned Image
Output: Braille Equivalent
1. The Scanned Image is Preprocessed and converted into editable String SLangChar[] using
appropriate OCR Engine.
2. Create a Mapping look-up database (Slang2SlangUni) where it stores Southern Indian language
alphabet and its corresponding Unicode
3. Create a Mapping look-up database (Charuni2BrailleUni where it stores Southern Indian
language Unicode and its corresponding Braille Unicode.
4. Create a Mapping look-up database (BrailleUni2BrailleSym) where it stores Braille Unicode
and its corresponding Symbol.
5. Buni [] = BSym[]= SLanguni[]=SLangCharExtend[]=NULL
6. Read the text string SLangChar[]
7. For i =1 to len of SLangChar
Is SLangChar[i] a Vowel or Consonant
Conversion of Text Image Document In Southern Indian Languages Into Braille For Visually
Challenged People
http://www.iaeme.com/IJCET/index.asp 77 [email protected]
SLangCharExtend[]+= SLangChar[i]
else
Is SLangChar[i] a Syllable
Break the corresponding Syllable in to Consonant[s] +Vowel and strore in
temp[]
SLangCharExtend[]+=temp
End
End for
For i =1 to len of SLangCharExtend
Find the Match for SLangCharExtend[i] in Mapping look-up
database(Slang2SlangUni) and store in Tlang
SLanguni[]+=Tlang
End for 8. For i =1 to len of SLanguni
Find the Match for SLanguni[i] in Mapping look-up database
(Charuni2BrailleUni database) and store in Tuni
Buni[]+=Tuni
End for 9. For i =1 to len of Buni[]
Find the Match for Buni[i] in Mapping look-up database
(BrailleUni2BrailleSym ) and store in Tsym
Bsym[]+=Tsym
End for 10. Return BSym[].
Indian Language Braille follows the expanded format for representing the vowel-consonant
combined character. For ex, the Braille notation for த and இ/◌ி are written together to
represent the combined character தி. The normal text is converted in to expanded form using
look up table. The expanded form is converted in to Unicode form. Then the Unicode form is
converted in to its corresponding Braille Unicode version using look up table. At last, Braille
Unicode version is converted in to Braille version using look up table. Some of the examples
is shown below
Example 1
Tamil Word -
Expanded form - =
Tamil Unicode Version -
Braille Equivalent Unicode –
���பாைவ
த+ இ+ ர+ உ+ ப+ ◌+ ப+ ஆ+ வ+ ஐ
0BA4 0B87 0BB0 0B89 0BAA 0BCD 0BAA 0B86 0BB5 0B90
281E 280A 2817 2825 280F 2808 280F 281C 2827 280C
Dr. G.Gayathri Devi and G.Sathyanarayanan
http://www.iaeme.com/IJCET/index.asp 78 [email protected]
Tamil Braille Version -
Example 2
Telugu Word -
Expanded form - =
Telugu Unicode Version -
Braille Equivalent Unicode -
Telugu Braille Version -
Example 3
Kannada Word -
Expanded form - =
Kannada Unicode Version -
Braille Equivalent Unicode -
Kannada Braille Version -
Example 4
Malayalam Word -
Expanded form - =
Malayalam Unicode Version -
Braille Equivalent Unicode -
⠞ ⠊ ⠗ ⠥ ⠏ ⠈ ⠏ ⠜ ⠧ ⠌
�ర�
త ఇ ర ఉ ప ◌ ప ఆ వ ఐ
0C24 0C07 0C30 0C09 0C2A 0C4D 0C2A 0C06 0C35 0C10
281E 280A 2817 2825 280F 2808 280F 281C 2827
⠞ ⠊ ⠗ ⠥ ⠏ ⠈ ⠏ ⠜ ⠧
തിര�ാൈവ
ത ഇ ര എ പ ◌ പ ആ വ ഐ
0D24 0D3F 0D30 0D41 0D2A 0D4D 0D2A 0D3E 0D35
281E 280A 2817 2825 280F 2808 280F 281C 2827
�ರು�ಾ��ೖ
ತ ಇ ರ ಉ ಪ ◌ ಪ ಆ ವ ಐ
0CA4 0CBF 0CB0 0CC1 0CAA 0CCD 0CAA 0CBE 0CB5
281E 280A 2817 2825 280F 2808 280F 281C 2827
⠞ ⠊ ⠗ ⠥ ⠏ ⠈ ⠏ ⠜ ⠧
Conversion of Text Image Document In Southern Indian Languages Into Braille For Visually
Challenged People
http://www.iaeme.com/IJCET/index.asp 79 [email protected]
Malayalam Braille Version -
4. EXPERIMENTAL RESULTS
The experimentation of the algorithms was carried out on Tamil, Telugu, Malayalam, Kannada
text Image. Recognition rate achieved is 90% and the accuracy rate of converting Braille from
recognized text is 100%. Some of the experimental results are shown below.
Input - Tamil Text Image
⠞ ⠊ ⠗ ⠥ ⠏ ⠈ ⠏ ⠜ ⠧
Expanded Tamil Text
வ ஆ ர ண ம ஆ ய இ ர ம ◌ ச ஊ ழ வ ல ம ◌ ச எ ய ◌ த உ
ந ஆ ர ண ந ம ◌ ப இ ந ட க ◌ க இ ன ◌ ற ஆ ன எ ன ◌ ற எ த இ ர ◌
ப ஊ ர ண ப ஒ ற ◌ க உ ட ம ◌ வ ஐ த ◌ த உ ப ◌ ப உ ற ம எ ங ◌ க உ ம ◌
த ஓ ர ண ம ◌ ந ஆ ட ◌ ட க ◌ க ன ஆ க ◌ க ண ◌ ட ஏ ன ◌ த ஓ ழ ஈ ந ஆ ன ◌
Tamil Unicode Equivalent
0BB5 0B86 0BB0 0BA3 0020 0BAE 0B86 0BAF 0B87 0BB0 0BAE 0BCD 0020 0B9A 0B8A 0BB4 0BB5 0020 0BB2 0BAE
0BCD 0B9A 0B8E 0BAF 0BCD 0BA4 0B89 000A
0BA8 0B86 0BB0 0BA3 0020 0BA8 0BAE 0BCD 0BAA 0B87 0020 0BA8 0B9F 0B95 0BCD 0B95 0B87 0BA9 0BCD 0BB1 0B86 0020
0BA9 0B8E 0BA9 0BCD 0BB1 0B8E 0BA4 0B87 0BB0 0BCD 000A
0BAA 0B8A 0BB0 0BA3 0020 0BAA 0B92 0BB1 0BCD 0B95 0B89 0B9F 0BAE 0BCD 0020 0BB5 0B90 0BA4 0BCD 0BA4 0B89 0BAA
0BCD 0020 0BAA 0B89 0BB1 0BAE 0B8E 0B99 0BCD 0B95 0B89 0BAE 0BCD 000A
0BA4 0B93 0BB0 0BA3 0BAE 0BCD 0020 0BA8 0B86 0B9F 0BCD 0B9F 0B95 0BCD 0020 0B95 0BA9 0B86 0B95 0BCD 0B95 0BA3
0BCD 0B9F 0B8F 0BA9 0BCD 0020 0BA4 0B93 0BB4 0B88 0BA8 0B86 0BA9 0BCD
Braille Unicode Mapping
2827 281C 2817 283C 0020 280D 281C 283D 280A 2817 280D 2808 0020 2809 2833 2837 2827 0020 2807 280D 2808 2809 2822
283D 2808 281E 2825 000A
281D 281C 2817 283C 0020 281D 280D 2808 280F 280A 0020 281D 283E 2805 2808 2805 280A 2830 2808 283B 281C 0020 2830
2822 2830 2808 283B 2822 281E 280A 2817 2808 000A
280F 2833 2817 283C 0020 280F 282D 283B 2808 2805 2825 283E 280D 2808 0020 2827 280C 281E 2808 281E 2825 280F 2808
0020 280F 2825 283B 280D 2822 282C 2808 2805 2825 280D 2808 000A
281E 2815 2817 283C 280D 2808 0020 281D 281C 283E 2808 283E 2805 2808 0020 2805 2830 281C 2805 2808 2805 283C 2808
283E 2821 2830 2808 0020 281E 2815 2837 2814 281D 281C 2830 2808
Editable Tamil Text after applying Preprocessing Technique & OCR
வாரண மாய�ர !ழவ ல ெச#$
நாரண ந ப� நட%கி&றா ென&ெறதி'
(ரண ெபா)*ட ைவ,$- .றெம/*
ேதாரண நா1ட% கனா%க2ேட& ேதாழ3நா&
Dr. G.Gayathri Devi and G.Sathyanarayanan
http://www.iaeme.com/IJCET/index.asp 80 [email protected]
Input – Telugu Text Image
Tamil Braille Equivalent
⠧⠜⠗⠼ ⠍⠜⠽⠊⠗⠍⠈ ⠉⠳⠷⠧ ⠇⠍⠈⠉⠢⠽⠈⠞⠥
⠝⠜⠗⠼ ⠝⠍⠈⠏⠊ ⠝⠾⠅⠈⠅⠊⠰⠈⠻⠜ ⠰⠢⠰⠈⠻⠢⠞⠊⠗⠈
⠏⠳⠗⠼ ⠏⠭⠻⠈⠅⠥⠾⠍⠈ ⠧⠌⠞⠈⠞⠥⠏⠈ ⠏⠥⠻⠍⠢⠬⠈⠅⠥⠍⠈
⠞⠕⠗⠼⠍⠈ ⠝⠜⠾⠈⠾⠅⠈ ⠅⠰⠜⠅⠈⠅⠼⠈⠾⠡⠰⠈ ⠞⠕⠷⠔⠝⠜⠰⠈
Expanded Telugu Text
చఇట�ఇ చఇలకమఆ అమ కఒట�ఇందఆ
తఓటకఎల�ఆవఆ పండఉ తఎచ�ఆవఆ
గఊట� ఓ పఎట�ఆవఆ గఉటఉక�ఉన మఇంగఆవవ ◌
Telugu Unicode Equivalent
0C1A 0C07 0C1F 0C4D 0C1F 0C07 0020 0C1A 0C07 0C32 0C15 0C2E 0C4D 0C2E 0C06 0020 0C05 0C2E 0C4D 0C2E 0020
0C15 0C12 0C1F 0C4D 0C1F 0C07 0C02 0C26 0C06 000A
0C24 0C13 0C1F 0C15 0C0E 0C32 0C4D 0C32 0C06 0C35 0C06 0020 0C2A 0C02 0C21 0C09 0020 0C24 0C0E 0C1A 0C4D
0C1A 0C06 0C35 0C06 000A
0C17 0C0A 0C1F 0C4D 0C32 0C13 0020 0C2A 0C0E 0C1F 0C4D 0C1F 0C06 0C35 0C06 0020 0C17 0C09 0C1F 0C09 0C15
0C4D 0C15 0C09 0C28 0020 0C2E 0C07 0C02 0C17 0C06 0C35 0C06
Braille Unicode Mapping
2809 280A 283E 2808 283E 280A 0020 2809 280A 2807 2805 280D 2808 280D 281C 0020 2801
280D 2808 280D 0020 2805 282D 283E 2808 283E 280A 2818 2819 281C 000A
281E 2815 283E 2805 2822 2807 2808 2807 281C 2827 281C 0020 280F 2818 282B 2825 0020
281E 2822 2809 2808 2809 281C 2827 281C 000A
281B 2833 283E 2808 2807 2815 0020 280F 2822 283E 2808 283E 281C 2827 281C 0020 281B
2825 283E 2825 2805 2808 2805 2825 281D 0020 280D 280A 2818 281B 281C 2827 281C
Editable Telugu Text after applying Preprocessing Technique & OCR
�ట�� �లకమ అమ !"ట�� ం#$
%&ట!'ల � ( పండ) %+$�(
గ,ట-� ./ట0� ( గ1ట2క3�న 4ం5(
Conversion of Text Image Document In Southern Indian Languages Into Braille For Visually
Challenged People
http://www.iaeme.com/IJCET/index.asp 81 [email protected]
Input -Kannada Text Image
Telugu Braille Equivalent
⠉⠊⠾⠈⠾⠊ ⠉⠊⠇⠅⠍⠈⠍⠜ ⠁⠍⠈⠍ ⠅⠭⠾⠈⠾⠊⠘⠙⠜
⠞⠕⠾⠅⠢⠇⠈⠇⠜⠧⠜ ⠏⠘⠫⠥ ⠞⠢⠉⠈⠉⠜⠧⠜
⠛⠳⠾⠈⠇⠕ ⠏⠢⠾⠈⠾⠜⠧⠜ ⠛⠥⠾⠥⠅⠈⠅⠥⠝ ⠍⠊⠘⠛⠜⠧⠜
Expanded Kannada Text
ತ ಓ ಟ ಕ ಎ ಹ ಓ ಗ ಓ ತ ಇ ಮ ◌ ಮ
ತ ಓ ಳ ಬ ◌ಂ ದ ಈ ತ ಮ ◌ ಮ
ಹ ಸ ಉ ಮ ಐ ಸ ಓ ತ ಇ ಮ ◌ ಮ
ಹ ಸ ಉ ಹ ಆ ದ ಈ ತ ಮ ◌ ಮ
Kannada Unicode Equivalent
0CA4 0C93 0C9F 0C95 0C8E 0020 0CB9 0C93 0C97 0C93 0020 0CA4 0C87 0CAE 0CCD 0CAE 000A
0CA4 0C93 0CB3 0020 0CAC 0C82 0CA6 0C88 0CA4 0CAE 0CCD 0CAE 000A
0CB9 0CB8 0C89 0020 0CAE 0C90 0CB8 0C93 0020 0CA4 0C87 0CAE 0CCD 0CAE 000A
0CB9 0CB8 0C89 0020 0CB9 0C86 0CA6 0C88 0CA4 0CAE 0CCD 0CAE
Braille Unicode Mapping
281E 2815 283E 2805 2822 0020 2813 2815 281B 2815 0020 281E 280A 280D 2808 280D 000A
281E 2815 2838 0020 2803 2818 2819 2814 281E 280D 2808 280D 000A
2813 280E 2825 0020 280D 280C 280E 2815 0020 281E 280A 280D 2808 280D 000A
2813 280E 2825 0020 2813 281C 2819 2814 281E 280D 2808 280D
Editable Kannada Text after applying Preprocessing Technique & OCR
!ೂೕಟ$ %ೂೕ&ೂೕ �ಮ'
!ೂೕಳ ಬಂ(ೕತಮ'
ಹಸು )ೖ*ೂೕ �ಮ'
ಹಸು %ಾ(ೕತಮ'
Dr. G.Gayathri Devi and G.Sathyanarayanan
http://www.iaeme.com/IJCET/index.asp 82 [email protected]
Input - Malayalam Text Image
Kannada Braille Equivalent
⠞⠕⠾⠅⠢ ⠓⠕⠛⠕ ⠞⠊⠍⠈⠍
⠞⠕⠸ ⠃⠘⠙⠔⠞⠍⠈⠍
⠓⠎⠥ ⠍⠌⠎⠕ ⠞⠊⠍⠈⠍
⠓⠎⠥ ⠓⠜⠙⠔⠞⠍⠈⠍
Expanded Malayalam Text
കഒച ◌ചഉ പഊച ◌ച കഉഞ ◌ഞഇനഒരഉ കഒച ◌ചമളഇ പറ ◌റഇ കആച ◌ചഇ വച ◌ച ചഊട ◌ പആല ◌ ഓടഇ ചഎന ◌ന ◌ നക ◌കഇ കഉഞ ◌ഞഉ നആവ ◌ പഒള ◌ളഇപ ◌പഓയഇ കഉഞ ◌ഞഇ പഊച ◌ച കഏണഉ മ ◌യആവഉ മ ◌യആവഉ മ ◌യആവഉ മ ◌യആവഉ
Malayalam Unicode Equivalent
0D15 0D12 0D1A 0D4D 0D1A 0D09 0020 0D2A 0D0A 0D1A 0D4D 0D1A 0020 0D15 0D09 0D1E 0D4D 0D1E 0D07 0D28 0D12 0D30
0D09 0020 0D15 0D12 0D1A 0D4D 0D1A 0D2E 0D33 0D07 0020 0D2A 0D31 0D4D 0D31 0D07 000A
0D15 0D06 0D1A 0D4D 0D1A 0D07 0020 0D35 0D1A 0D4D 0D1A 0020 0D1A 0D0A 0D1F 0D4D 0020 0D2A 0D06 0D32 0D4D 0020
0D13 0D1F 0D07 0020 0D1A 0D0E 0D28 0D4D 0D28 0D4D 0020 0D28 0D15 0D4D 0D15 0D07 0020 000A
0D15 0D09 0D1E 0D4D 0D1E 0D09 0020 0D28 0D06 0D35 0D4D 0020 0D2A 0D12 0D33 0D4D 0D33 0D07 0D2A 0D4D 0D2A 0D13
0D2F 0D07 0020 0D15 0D09 0D1E 0D4D 0D1E 0D07 0020 0D2A 0D0A 0D1A 0D4D 0D1A 0020 0D15 0D0F 0D23 0D09 0020 000A
0D2E 0D4D 0D2F 0D06 0D35 0D09 0020 0D2E 0D4D 0D2F 0D06 0D35 0D09 0020 0D2E 0D4D 0D2F 0D06 0D35 0D09 0020 0D2E 0D4D
0D2F 0D06 0D35 0D09
Editable Malayalam Text after applying Preprocessing Technique & OCR
െകാ"# പ" ക%ിെനാര െകാ"മളി പ&ി കാ"ി വ" ച' പാ( ഓടി െച) ന*ി ക% നാ+ െപാ,ിേ�ായി ക%ി പ" േകണ മ.ാവ മ.ാവ മ.ാവ മ.ാവ
Conversion of Text Image Document In Southern Indian Languages Into Braille For Visually
Challenged People
http://www.iaeme.com/IJCET/index.asp 83 [email protected]
CONCLUSION
This proposed approach presents an algorithm to assist visually impaired or blind person in
reading the Southern Indian languages text present on printed image. Experimental results show
that the proposed method has a good performance on converting text regions in an image into
Braille Script. The research work successfully converts Tamil, Telugu, Kannada, Malayalam
languages to Braille script and this work can be further developed to recognize Indian language
present in an image and convert it to corresponding Braille system.
ACKNOWLEDGMENT
This work is a part of UGC Minor Research Project. Reference Number-No.F MRP-
6538/16(SERO/UGC), Dated: 30/06/2017.Authors thank the Management and Principal of
Shrimathi Devkunvar Nanalal Bhatt Vaishnav College for Women for extending their support
for this project.
REFERENCES
[1] Working with Braille: A study of Braille as a medium of communication BARRY
HAMPSHIRE,the UNESCO Press, 1981
[2] https://en.wikipedia.org/wiki/Braille
[3] http://www.acb.org/tennessee/braille.html
[4] G.Gayathri Devi,G.Sathyanarayanan , Braille Document Recognition in Southern Indian
Languages – A Review"", EEE Fourth International Conference on Advances in Electrical,
Electronics, Information, Communication And Bioinformatics at Prathyusha Engineering
College, Chennai , Feb 21-22 2018,
Braille Unicode Mapping
2805 282D 2809 2808 2809 2825 0020 280F 2833 2809 2808 2809 0020 2805 2825 2812 2808 2812 280A
281D 282D 2817 2825 0020 2805 282D 2809 2808 2809 280D 2838 280A 0020 280F 283B 2808 283B
280A 000A
2805 281C 2809 2808 2809 280A 0020 2827 2809 2808 2809 0020 2809 2833 283E 2808 0020 280F 281C
2807 2808 0020 2815 283E 280A 0020 2809 2822 281D 2808 281D 2808 0020 281D 2805 2808 2805
280A 0020 000A
2805 2825 2812 2808 2812 2825 0020 281D 281C 2827 2808 0020 280F 282D 2838 2808 2838 280A 280F
2808 280F 2815 283D 280A 0020 2805 2825 2812 2808 2812 280A 0020 280F 2833 2809 2808 2809 0020
2805 2821 283C 2825 0020 000A
280D 2808 283D 281C 2827 2825 0020 280D 2808 283D 281C 2827 2825 0020 280D 2808 283D 281C 2827
2825 0020 280D 2808 283D 281C 2827 2825
Malayalam Braille Equivalent
⠅⠭⠉⠈⠉⠥ ⠏⠳⠉⠈⠉ ⠅⠥⠒⠈⠒⠊⠝⠭⠗⠥ ⠅⠭⠉⠈⠉⠍⠸⠊ ⠏⠻⠈⠻⠊
⠅⠜⠉⠈⠉⠊ ⠧⠉⠈⠉ ⠉⠳⠾⠈ ⠏⠜⠇⠈ ⠕⠾⠊ ⠉⠢⠝⠈⠝⠈ ⠝⠅⠈⠅⠊
⠅⠥⠒⠈⠒⠥ ⠝⠜⠧⠈ ⠏⠭⠸⠈⠸⠊⠏⠈⠏⠕⠽⠊ ⠅⠥⠒⠈⠒⠊ ⠏⠳⠉⠈⠉ ⠅⠡⠼⠥
⠍⠈⠽⠜⠧⠥ ⠍⠈⠽⠜⠧⠥ ⠍⠈⠽⠜⠧⠥ ⠍⠈⠽⠜⠧⠥
Dr. G.Gayathri Devi and G.Sathyanarayanan
http://www.iaeme.com/IJCET/index.asp 84 [email protected]
[5] Dr.C.P.Sumathi, G.Gayathri Devi, 2014, “Automatic Text Extraction from Complex
Colored Images Using Gamma Correction Method”, Journal of Computer Science, pages
706-715. (Scopus Indexed)
[6] G.Gayathri Devi, Dr.C.P.Sumathi, “Text Extraction from Images using Gamma Correction
Method and different Text Extraction Methods – A Comparative Analysis”, International
Conference on Information Communication and Embedded Systems ICICES 2014, S.A.
Engineering College, Chennai, Feb 27-28, 2014. (Scopus Indexed)
[7] G. Gayathri Devi, C. P. Sumathi, "Positional Connected Component Labeling Algorithm",
Indian Journal of Science & Technology, Volume 7, Issue 3, March 2014, pp 306-311.
[8] G Gayathri Devi, G Sathyanarayanan ,“A Connected Components Labeling Algorithm for
4-Connectivity Based On Position Matrix”, International Journal of Scientific Research in
Computer Science, Engineering and Information Technology Volume (2) : Issue (6) : Dec
2017 , ISSN:2456-3307 pg 50 to 58