International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 102 - Number 3 |
Year of Publication: 2014 |
Authors: Jan-Hendrik Worch, Bjoern Gottfried |
10.5120/17792-8585 |
Jan-Hendrik Worch, Bjoern Gottfried . Choosing Shape Features by means of Genetic Algorithms for Gylph-clustering of Historical Documents. International Journal of Computer Applications. 102, 3 ( September 2014), 1-6. DOI=10.5120/17792-8585
The solution for a feature selection problem is presented in the field of document image processing. The choice of shape features for describing glyphs of historical documents is a non-trivial task since the variations of glyphs in different documents is innumerable. Hence, the manual selection of shape features would be a cumbersome task. To select a subset of features from a given set a genetic algorithm is used which optimises the result of a clustering process by x-means. The result of x-means is evaluated by using different quality measures. The optimisation methodology is illustrated within a case study, in which the selection of an appropriate set of features is a crucial part of the system. The intended application supports a user who is transcribing historical documents by showing him similar occurrences of a given glyph.