International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 44 - Number 22 |
Year of Publication: 2012 |
Authors: I Bhuvana Chandra, K Nagarjunavarma, Gireesh Kumar |
10.5120/6413-8852 |
I Bhuvana Chandra, K Nagarjunavarma, Gireesh Kumar . Foreground Estimation in a Degraded Text document. International Journal of Computer Applications. 44, 22 ( April 2012), 31-37. DOI=10.5120/6413-8852
In this paper an attempt is made to retrieve the text region alone from a degraded text document. For doing that, four different filters are used for noise removal in the text document. Later document binarization is done using thresholding. Three different thresholding techniques are implemented for foreground-background separation. Then candidate region is selected and features are extracted. The features are then fed to an SVM to classify text and non-text regions. The proposed approach is implemented and tested on various hand written and machine printed degraded text documents.