We apologize for a recent technical issue with our email system, which temporarily affected account activations. Accounts have now been activated. Authors may proceed with paper submissions. PhDFocusTM
CFP last date
20 November 2024
Reseach Article

Printed Text to Audio Converter using OCR

Published on June 2015 by Kalyani Mangale, Hemangi Mhaske, Priyanka Wankhade
National Conference on Emerging Trends in Advanced Communication Technologies
Foundation of Computer Science USA
NCETACT2015 - Number 2
June 2015
Authors: Kalyani Mangale, Hemangi Mhaske, Priyanka Wankhade
c93a803f-ae0f-40c6-a6ea-c4b7e76ee836

Kalyani Mangale, Hemangi Mhaske, Priyanka Wankhade . Printed Text to Audio Converter using OCR. National Conference on Emerging Trends in Advanced Communication Technologies. NCETACT2015, 2 (June 2015), 27-30.

@article{
author = { Kalyani Mangale, Hemangi Mhaske, Priyanka Wankhade },
title = { Printed Text to Audio Converter using OCR },
journal = { National Conference on Emerging Trends in Advanced Communication Technologies },
issue_date = { June 2015 },
volume = { NCETACT2015 },
number = { 2 },
month = { June },
year = { 2015 },
issn = 0975-8887,
pages = { 27-30 },
numpages = 4,
url = { /proceedings/ncetact2015/number2/20989-2025/ },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Proceeding Article
%1 National Conference on Emerging Trends in Advanced Communication Technologies
%A Kalyani Mangale
%A Hemangi Mhaske
%A Priyanka Wankhade
%T Printed Text to Audio Converter using OCR
%J National Conference on Emerging Trends in Advanced Communication Technologies
%@ 0975-8887
%V NCETACT2015
%N 2
%P 27-30
%D 2015
%I International Journal of Computer Applications
Abstract

For many blind users educational choices are made based on which material can be accessed and which cannot. These people are dependent solely on Braille books & audio recordings provided by NGOs. The presented work will provide visually impaired people, an opportunity to have an audio material of their own choice of any printed material. The framework consists of two parts. One is Optical Character Recognition (OCR) which includes operations like grayscaling, thresholding, filtering, thinning, segmentation, cropping, etc. on a character in the image and other part is text to speech conversion using Microsoft's API which will convert the text into speech (audio).

References
  1. Gonzalez Rafael . C "Digital Image Processing" Pearson Education Second Edition, Upper Saddle River New Jersey, USA, 2002
  2. Chapman Stephen . J "MATLAB programming for Engineering" second edition, 2002
  3. Forsyth David . A, Ponce Jean "Computer Vision-A Modern Approach" Pearson Education, First Edition, Upper Saddle River New Jersey, USA, 2003
  4. Sandeep Tiwari, Shivangi Mishra, Priyank Bhatia, Praveen Km. Yadav, "Optical Character Recognition using MATLAB" International Journal of Advanced Research (2013), Volume 1, Issue 9, 757-767
  5. Oi-Mean Foong and Nurul Safwanah Bt Mohd Razali, "Signage Recognition Framework for Visually Impaired People", In. proc. of International Conference on Computer Communication and Management, Vol. 5, 2011.
  6. Azadeh Nazemi and Iain Murray,"An open source reading system for print disabilities", International Conference on Computing, E-Learning and Emerging Technology & International Conference on Advances in Computer , Electrical and Electronic Engineering - Sydney, Australia, (ISSN : 2091-1610 ) , Volume No : 12 Issue No : 2
Index Terms

Computer Science
Information Sciences

Keywords

Braille Books Ocr Api