International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 110 - Number 1 |
Year of Publication: 2015 |
Authors: S.nagaprasad, P.vijayapal Reddy, A.vinaya Babu |
10.5120/19277-0686 |
S.nagaprasad, P.vijayapal Reddy, A.vinaya Babu . Authorship Attribution based on Data Compression for Telugu Text. International Journal of Computer Applications. 110, 1 ( January 2015), 1-5. DOI=10.5120/19277-0686
Authorship attribution (AA) can be defined as the task of inferring characteristics of a document's author from the textual characteristics of the document itself. In this paper we evaluated the compression model for AA on Telugu text. We considered six different compressors namely Zip, BZip, GZip, LZW, PPM and PPMd in combination with three different compression distance measures such as Normalized Compressor Distance (NCD), Compression Dissimilarity Measure (CDM) and Conditional Complexity of Compression (CCC). The result shows that the compression models are good alternatives for Authorship attribution instead of classification model with various features.