International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 106 - Number 15 |
Year of Publication: 2014 |
Authors: Ramesh Thakur |
10.5120/18597-9856 |
Ramesh Thakur . Context-free Grammar Learning from Text Document using Sequential Pattern. International Journal of Computer Applications. 106, 15 ( November 2014), 23-26. DOI=10.5120/18597-9856
The World-Wide-Web and information system has gained significant achievements over the last two decades as expressed their dominance in various business and scientific applications. As estimated by Blumberg and Atre more than 85% of all business information exists in the form of unstructured and semi-structured document, typically formatted for human viewing, not for system processing. Extracting information from these document are challenging task. Extracting grammar rules from these documents is interesting idea. Grammar rules can be used to create structural descriptions of text documents. In this paper I propose grammatical inference using sequential pattern to infer formal language (context free grammar), which describes the given sample set.