Amrita International Conference of Women in Computing - 2013 |
Foundation of Computer Science USA |
AICWIC - Number 3 |
January 2013 |
Authors: D. Sasirekha, E. Chandra |
53ce2e0e-0045-42d1-8aa3-35936e0e7fd4 |
D. Sasirekha, E. Chandra . Text Extraction from PDF document. Amrita International Conference of Women in Computing - 2013. AICWIC, 3 (January 2013), 17-19.
Documents in PDF format are nowadays called the Universal document format. PDF to speech converter systems involves many steps to achieve. Text extraction is the primary step From PDF to do further processing. In this paper we start with the brief discussion about the steps involved in extracting the text from PDF documents. The aim of this paper is to give the introduction with some basic concepts on PDF, and with text extraction concepts, which will be useful for the readers who are less familiar in this area of research.