CFP last date
20 December 2024
Reseach Article

Automated Extractive Text Summarization using Genetic and Simulated Annealing Algorithms and their Hybridization

by Moheb R. Girgis, Marina Esam, Mamdouh M. Gomaa
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 185 - Number 33
Year of Publication: 2023
Authors: Moheb R. Girgis, Marina Esam, Mamdouh M. Gomaa
10.5120/ijca2023923105

Moheb R. Girgis, Marina Esam, Mamdouh M. Gomaa . Automated Extractive Text Summarization using Genetic and Simulated Annealing Algorithms and their Hybridization. International Journal of Computer Applications. 185, 33 ( Sep 2023), 34-43. DOI=10.5120/ijca2023923105

@article{ 10.5120/ijca2023923105,
author = { Moheb R. Girgis, Marina Esam, Mamdouh M. Gomaa },
title = { Automated Extractive Text Summarization using Genetic and Simulated Annealing Algorithms and their Hybridization },
journal = { International Journal of Computer Applications },
issue_date = { Sep 2023 },
volume = { 185 },
number = { 33 },
month = { Sep },
year = { 2023 },
issn = { 0975-8887 },
pages = { 34-43 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume185/number33/32904-2023923105/ },
doi = { 10.5120/ijca2023923105 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-07T01:27:41.986440+05:30
%A Moheb R. Girgis
%A Marina Esam
%A Mamdouh M. Gomaa
%T Automated Extractive Text Summarization using Genetic and Simulated Annealing Algorithms and their Hybridization
%J International Journal of Computer Applications
%@ 0975-8887
%V 185
%N 33
%P 34-43
%D 2023
%I Foundation of Computer Science (FCS), NY, USA
Abstract

With the growing world of information, the increase in on-line publishing, and prevalent access to the Internet, huge volume of electronic documents are currently available on-line. Automatic text summarization (ATS) has attracted great interest to assist users and computer systems to process vast amount of texts and extract relevant knowledge in a more efficient way. An ATS system can generate a summary of a document, i.e. short text that includes the main information in it. The aim of this work is to study the performance of ATS systems that utilize metaheuristic and heuristic algorithms in automated extractive text summarization. To this end, this paper proposes Genetic Algorithm (GA)-based, Simulated Annealing (SA)-based, and hybrid GA-SA-based methods for solving the single document summarization (SDS) problem. The objective of these methods is generating a high-quality summary that contains the main information of a given document. In these methods, to assess the quality of solutions (summaries) being generated, an objective function is used that will be maximized. This objective function is represented as a weighted sum that combines five features: sentence position, similarity with title, sentence length, cohesion, and coverage. The paper presents the results of the experiments that have been conducted to evaluate the quality of the summaries generated by the proposed SDS algorithms by applying them to sample articles from the CNN corpus, using co-occurrence statistical metrics (ROUGE metrics) and three content-based metrics (Fitness, Readability and Cohesion).

References
  1. Gambhir, M. and Gupta, V. 2017 Recent automatic text summarization techniques: a survey. Artificial Intelligence Review 47, 1–66.
  2. Widyassari, A. P., Rustad, S., Shidik, G. F., Noersasongko, E., Syukur ,A., Affandy, A., Setiadi, D. I. M. 2022 Review of automatic text summarization techniques & methods. Journal of King Saud University – Computer and Information Sciences 34, 1029–1046.
  3. Verma, P., Om, H. 2019 MCRMR: Maximum coverage and relevancy with minimal redundancy based multi-document summarization. Expert Systems with Applications 120 (5 April 2019), 43-56.
  4. Nikoo, M. D., Faraahi, A., Hashemi, S. M., and Erfani, S. H. 2012 A method for text summarization by bacterial foraging optimisation algorithm. IJCSI Int. J. Comput. Sci. 9 (4), 36–40.
  5. Sarkar, K. 2013 Automatic single document text summarization using key concepts in documents. Journal of Information Processing Systems (JIPS) 9, 602-620.
  6. Mendoza, M., Bonilla, S., Noguera, C., Cobos, C., and Leon, E. 2014 Extractive single-document summarization based on genetic operators and guided local search. Expert Systems with Applications 41, 4158–4169.
  7. Mahdipour, E. and Bagheri, M. 2014 Automatic Persian Text Summarizer Using Simulated Annealing and Genetic Algorithm. International Journal of Intelligent Information Systems. Special Issue: Research and Practices in Information Systems and Technologies in Developing Countries 3 (6-1), 84-90.
  8. Kikuchi, Y., Hirao, T., Takamura, H., Okumura, M., and Nagata, M. 2014 Single document summarization based on nested tree structure. In Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics 2 (June 23-25), 315–320, Baltimore, Maryland, USA.
  9. Asgari, H., Masoumi, B., and Sheijani, O. S. 2014 Automatic text summarization based on multi-agent particle swarm optimization. In Proceedings of Iranian Conference on Intelligent Systems (ICIS). IEEE, 1–5.
  10. Mirshojaei, S. H. and Masoomi, B. 2015 Text summarization using cuckoo search optimization algorithm. J. Comput. Robot. 8 (2), 19–24.
  11. Parveen, D. and Strube, M. 2015 Integrating importance, non-redundancy and coherence in graph-based extractive summarization. In Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence (IJCAI 2015). AAAI Press, 1298–1304.
  12. Hassan, O. F. 2015 Text Summarization using Ant Colony Optimization Algorithm. Doctoral dissertation. Sudan University of Science and Technology.
  13. Christian, H., Agus, M. P., and Suhartono, D. 2016 Summarization using term frequency inverse document frequency (TF-IDF). ComTech, 7 (4), 285-294.
  14. Sinha, A., Yadav, A.,  and Gahlot, A. 2018 Extractive Text Summarization using Neural Networks. arXiv:1802.10137 [cs.CL].
  15. Alguliyev, R. M., Aliguliyev, R. M., Isazade, N. R., Abdi, A., and Idris, N. 2019 COSUM: Text summarization based on clustering and optimization. Expert Systems, 36(1), 1–17.
  16. Xu, J. and Durrett, G. 2019 Neural Extractive Text Summarization with Syntactic Compression. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP). 3292–3303, Hong Kong, China. Association for Computational Linguistics.
  17. Hernández-Castañeda, Á., García-Hernández, R. A., Ledeneva, Y. and Millán-Hernández, C. E. 2020 Extractive Automatic Text Summarization Based on Lexical-Semantic Keywords, IEEE ACCESS 8, 49896-49907.
  18. El-Kassas, W. S., Salama, C. R., Rafea, A. A., and Mohamed, H. K. 2020 EdgeSumm: Graph-based framework for automatic text summarization. Information Processing and Management 57(6), 102264.
  19. Heidary, E., Parvïn, H., Nejatian, S., Bagherifard, K., Rezaie, V., Mansor, Z., and Pho, K. 2021 Automatic Text Summarization Using Genetic Algorithm and Repetitive Patterns, Computers, Materials & Continua (CMC) 67 (1), 1085-1101.
  20. Belwal, R. C., Rai, S., and Gupta, A. 2021 A new graph-based extractive text summarization using keywords or topic modeling. Journal of Ambient Intelligence and Humanized Computing 12(10), 8975–8990.
  21. He, J., Kryscinski, W., McCann, B., Rajani, N., and Xiong, C. 2022 CTRLsum: Towards generic controllable text summarization. In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing (December 2022), Abu Dhabi, United Arab Emirates, 5879—5915.
  22. Anand, D.  and Wagh, R.  2022 Effective deep learning approaches for summarization of legal texts. Journal of King Saud University - Computer and Information Sciences 34 (5), 2141-2150.
  23. Manning, C. D., Raghavan, P., and Schtze, H. 2008 Introduction to information retrieval. Cambridge University Press.
  24. Bossard, A., Genereux, M., and Poibeau, T. 2008 Description of the LIPN systems at TAC 2008: Summarizing information and opinions. In Proceedings of Notebook papers and results, text analysis conference (TAC-2008).
  25. Shareghi, E. and Hassanabadi, L. S. 2008 Text summarization with harmony search algorithm-based sentence extraction. In Proceedings of the 5th international conference on soft computing as transdisciplinary science and technology, 226–231. Cergy-Pontoise, France: ACM.
  26. Gupta, V., Chauhan, P., and Garg, S. 2012 An Statistical Tool for Multi-Document Summarization. International Journal of Scientific and Research Publications 2 (5).
  27. NLTK, https://www.tutorialspoint.com/natural_ language_toolkit/index.htm, 20/12/2021
  28. Goldberg, D. E. 1989 Genetic Algorithms in Search, Optimization, and Machine Learning, Reading, MA: Addison-Wesley.
  29. Metropolis, N., Rosenbluth, Rosenbluth, A. M., Teller, A., Teller, E. 1953 Equation of State Calculations by Fast Computing Machines. Journal of Chemical Physics 21 (6), 1087–1092.
  30. Lins, R. D., Simske, S. J., de Souza Cabral, L., de França Silva, G., Lima, R., Mello, R. F., and Favaro, L. 2012 A multi-tool scheme for summarizing textual documents. In Proceedings of the 11st IADIS international conference www/internet 2012, 409–414.
  31. Lin C. 2004 ROUGE: A package for automatic evaluation of summaries. Annual Meeting of the Association for Computational Linguistics (25 July 2004).
Index Terms

Computer Science
Information Sciences

Keywords

Single-Document Summarization Automatic Text Summarization Extractive Summarization Genetic Algorithm Simulated Annealing ROUGE Readability Cohesion CNN Corpus.