CFP last date
20 January 2025
Reseach Article

Comparison of IR Models for Text Classification

by Priyanka Desai, G. R. Kulkarni
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 61 - Number 22
Year of Publication: 2013
Authors: Priyanka Desai, G. R. Kulkarni
10.5120/10228-4713

Priyanka Desai, G. R. Kulkarni . Comparison of IR Models for Text Classification. International Journal of Computer Applications. 61, 22 ( January 2013), 20-25. DOI=10.5120/10228-4713

@article{ 10.5120/10228-4713,
author = { Priyanka Desai, G. R. Kulkarni },
title = { Comparison of IR Models for Text Classification },
journal = { International Journal of Computer Applications },
issue_date = { January 2013 },
volume = { 61 },
number = { 22 },
month = { January },
year = { 2013 },
issn = { 0975-8887 },
pages = { 20-25 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume61/number22/10228-4713/ },
doi = { 10.5120/10228-4713 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T21:10:18.594391+05:30
%A Priyanka Desai
%A G. R. Kulkarni
%T Comparison of IR Models for Text Classification
%J International Journal of Computer Applications
%@ 0975-8887
%V 61
%N 22
%P 20-25
%D 2013
%I Foundation of Computer Science (FCS), NY, USA
Abstract

As there is availability of large amount of data on the web, but due to constraints web is only used for browsing and searching. Traditional IE uses NLP techniques such as lexicons, grammars, whereas web applies machine learning and pattern mining techniques to exploit the syntactical patterns or layout structures of the template-based documents. Information Retrieval is the art of presentation, storage, organization of and access to information items. IR now–-days mainly deals with retrieving information based on user queries. The paper deals with basic understanding of IR and IR models and shows Support Vector Machines is a good technique fir classification of huge data sets.

References
  1. Christopher D. Manning, Prabhakar Raghavan and Hinrich Schütze, 2009,Introduction to Information Retrieval, Cambridge University Press.
  2. M. Mitra, B. B. Chaudhuri,2000,Information Retrieval from Documents: A Survey,ACM,May 2000, Volume 2, Issue 2-3, pp 141-163
  3. Baeza-Yates, R. and Ribeiro-Neto, B. (2011). ,Modern Information Retrieval - the concepts and technology behind search. Addison Wesley.
  4. Croft, Metzler, and Strohman, 2010,"Search Engines: Information Retrieval in Practice," According to Information Retrieval in Practice by Croft, Metzler, and Strohman
  5. Djoerd Hiemstra,University of Twente ,Goker, A. , and Davies, J. ,November 2009, Information Retrieval: Searching in the 21st,Century. John Wiley and Sons, Ltd. , ISBN-13: 978-0470027622
  6. C. Zhai and J. Lafferty. , 2004,A Study of Smoothing Methods for Language Models Applied to Information Retrieval. ACM Transactions on Information Systems (TOIS): 22(2)
  7. SALTON, G. , and C. BUCKLEY ,1990. "Improving Retrieval Performance by Relevance Feedback. " Journal of the American Society for Information Science, 41(4), 288-97.
  8. H. R. Turtleand,W. B. Croft. ,1991,Evaluationof aninferencenetwork-basedretrievalmodel. ACM Trans. Inf. Syst. ,9(3):187–222,July1991
  9. Metzler, D. and Croft, W. B. ,2004 "Combining the Language Model and Inference Network Approaches to Retrieval," Information Processing and Management Special Issue on Bayesian Networks and Information Retrieval, 40(5), 735-750, 2004
  10. Metzler, D. , and Manmatha, R. , ,2004"An Inference Network Approach to Image Retrieval," Proceedings of the International Conference on Image and Video Retrieval (CIVR 2004), 42-50, 2004
  11. Tie-Yan Liu (2009) "Learning to Rank for Information Retrieval", Foundations and Trends® in Information Retrieval: Vol. 3: No 3, pp 225-331. http:/dx. doi. org/10. 1561/1500000016
  12. Wang, L. , Lin, J. , and Metzler, D. ,2010,"Learning to Efficiently Rank," in the Proceedings of the 33rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2010), 2010.
  13. Hang LI Oct 2011,A Short Introduction to Learning to Rank, Special Section on Information-Based Induction Sciences and Machine Learning, IEICE TRANS. INF. & SYST. , VOL. E94–D, NO. 10
  14. G. Salton,1989, Automatic Text Processing: The Transformation, Analysis and Retrieval of Information by Computer. Addison Wesley
  15. V. Bush, 1945,'As we may think', The Atlantic Monthly, vol. 176, no. 1, pp. 101-10,.
  16. C. N. Mooers,1950, 'The theory of digital handling of non-numerical information and its implications to machine economics', in Association for Computing Machinery Conference, Rutger University.
  17. C. Cortes, V. Vapnik, Support Vector Networks in Machine Learning ,1995,20, pp. 273-297
  18. Christopher J. C. Burges, 1998, "A Tutorial on Support Vector Machines for Pattern Recognition", Data Mining and Knowledge Discovery, 2, 121-167, Kluwer Academic Publishers
  19. Tristan Fletcher, March 1, 2009 "Support Vector Machines Explained ", ucl
  20. Dustin Boswell, August 6-2002, Introduction to Support Vector Machines,
  21. J. P. Lewis, Dec-2004, A Short Support Vector Machine Tutorial,CGIT Lab/IMSC,U. South California
  22. John C. Platt,1999, Fast Training of Support Vector Machines using Sequential Minimal Optimization,Advances in kernel methods ,Pages 185 - 208, MIT Press Cambridge, MA, USA ©1999,table of contents ISBN:0-262-19416-3
  23. M. IkonomakisS, S. Kotsiantis, V. Tampakas, August 2005, Text Classification Using Machine Learning Techniques, WSEAS Transactions on Computers, Issue 8, Volume 4, pp. 966-974
  24. James G. Shanahan,Norbert ,2003,Improving SVM Text Classification Performance through Threshold Adjustment ,ML:ECML 2003,Volume 2837,pp 361-372
  25. Prasanna Kothalkar,Priyanka Desai,March ,2012, mplementing Social Group Shopping using Support Vector Machines, IJCA Journal, icwet2012 - Number 1IJCA Proceedings on International Conference and workshop on Emerging Trends in Technology (ICWET 2012) icwet(1):35-40,
  26. B. G. Kermani, I. Kozlov, P. Melnyk, C. Zhao, J. Hachmann, D. Barker, and M. Lebl , 2007 ,Using Support Vector Machine Regression to Model the Retention of Peptides in Immobilized Metal-affinity Chromatography, ns Actuators B Chem. 2007 July 16; 125(1): 149–157.
Index Terms

Computer Science
Information Sciences

Keywords

Boolean retrieval Vector Space model BM25 Language models Inference networks Learning to Rank SVM