International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 1 - Number 3 |
Year of Publication: 2010 |
Authors: Sobhana N.V, Pabitra Mitra, S.K. Ghosh |
10.5120/72-166 |
Sobhana N.V, Pabitra Mitra, S.K. Ghosh . Conditional Random Field Based Named Entity Recognition in Geological text. International Journal of Computer Applications. 1, 3 ( February 2010), 119-125. DOI=10.5120/72-166
The paper describes about the development of a Named Entity Recognition (NER) system for Geological text using Conditional Random Fields (CRFs). The system makes use of the different contextual information of the words along with the variety of features that are helpful in predicting the various named entity (NE) classes. The NE tagged geological corpus was developed from the collection of scientific reports and articles on the geology of the Indian subcontinent has been used to build up the system. The training set consists of more than 2 lakh words and has been manually annotated with a NE tag set of seventeen tags. The system is able to recognize 17 classes of NEs with 75.8% F-measure.