International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 86 - Number 18 |
Year of Publication: 2014 |
Authors: Pawan Kumar, Raj Kumar Goel, Prem Sagar Sharma |
10.5120/15087-3360 |
Pawan Kumar, Raj Kumar Goel, Prem Sagar Sharma . Domain Specific Named Entity Recognition (DSNER) from Web Documents. International Journal of Computer Applications. 86, 18 ( January 2014), 24-29. DOI=10.5120/15087-3360
Named entity recognition is a tool, which use process natural language tasks such as, text categorization, speech translation, and document classification. The Web data promotes the idea, that more and more data can be interconnected. A step towards this goal is to bring more structured annotations to existing documents using common vocabularies or ontology. Semi-structured texts such as scientific, medical, forum and blog posts can hence be semantically annotated. Named Entity (NE) extractors play a key role for extracting structured information by identifying features, also called entities, and by linking them to other web resources by means of typed inferences. Earlier many systems have been developed named entity recognition with substantial success save for the problem of being domain specific and making it difficult to use the different systems across domains. In this paper we introduce specific domain like science, medical and news, named entity recognition. This paper presents a system to recognize the Named Entity from web documents using ontology.