International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 60 - Number 7 |
Year of Publication: 2012 |
Authors: andreia Dal Ponte Novelli, Jose Maria Parente De Oliveira |
10.5120/9703-4151 |
andreia Dal Ponte Novelli, Jose Maria Parente De Oliveira . A Method for Measuring Semantic Similarity of Documents. International Journal of Computer Applications. 60, 7 ( December 2012), 17-22. DOI=10.5120/9703-4151
With the documents increasing amount available in local or Web repositories, the comparison methods have to analyze large documents sets with different types and terminologies to obtain a response with minimum documents and with as much useful content to the user. For large documents sets where each document can contain many pages, it is impossible to compute the similarity using the entire document, to require creating solutions to analyze a few meaningful terms, in summary form. This article presents TextSSimily, a method that compares documents semantically considering only short text for comparison (text summary), using semantics to improve the set of responses and summaries to improve time to obtain results for large sets of documents.