CFP last date
20 February 2025
Reseach Article

An Empirical Analysis of Different Big Data-based AI Integrated Tools in Multidisciplinary Fields

by Le Trung Min, Sharmila Mathivanan
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 186 - Number 62
Year of Publication: 2025
Authors: Le Trung Min, Sharmila Mathivanan
10.5120/ijca2025924430

Le Trung Min, Sharmila Mathivanan . An Empirical Analysis of Different Big Data-based AI Integrated Tools in Multidisciplinary Fields. International Journal of Computer Applications. 186, 62 ( Jan 2025), 20-33. DOI=10.5120/ijca2025924430

@article{ 10.5120/ijca2025924430,
author = { Le Trung Min, Sharmila Mathivanan },
title = { An Empirical Analysis of Different Big Data-based AI Integrated Tools in Multidisciplinary Fields },
journal = { International Journal of Computer Applications },
issue_date = { Jan 2025 },
volume = { 186 },
number = { 62 },
month = { Jan },
year = { 2025 },
issn = { 0975-8887 },
pages = { 20-33 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume186/number62/an-empirical-analysis-of-different-big-data-based-ai-integrated-tools-in-multidisciplinary-fields/ },
doi = { 10.5120/ijca2025924430 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2025-01-28T19:07:12+05:30
%A Le Trung Min
%A Sharmila Mathivanan
%T An Empirical Analysis of Different Big Data-based AI Integrated Tools in Multidisciplinary Fields
%J International Journal of Computer Applications
%@ 0975-8887
%V 186
%N 62
%P 20-33
%D 2025
%I Foundation of Computer Science (FCS), NY, USA
Abstract

The exponential data growth in today's world necessitates efficient and intelligent data management solutions. Machine learning has emerged as a key technology for addressing the challenges posed by big data, offering the potential to automate tasks, optimize processes, and extract valuable insights from massive datasets. This research explores the role of machine learning in data management across various fields, examining its applications, benefits, and potential drawbacks. The study also delves into the ethical considerations surrounding AI adoption, such as bias, fairness, and transparency. A comparative analysis of five prominent AI-powered data management tools is conducted, evaluating their performance, scalability, and resource utilization. The findings provide insights into the strengths and weaknesses of each tool, aiding in informed decision-making for organizations seeking to leverage AI for efficient and responsible data management in the era of big data.

References
  1. M. A. Khan and A. Sharma. 2023. Deep Overview of Virtualization Technologies Environment and Cloud Security. In Proceedings of the 2023 2nd International Conference for Innovation in Technology (INOCON). IEEE, Bangalore, India, 1-6. https://doi.org/10.1109/INOCON57975.2023.10101349
  2. Mohd Amaan Khan and Ranjan Walia. 2024. Intelligent Data Management in Cloud Using AI. In Proceedings of the 2024 3rd International Conference for Innovation in Technology (INOCON). IEEE, Bangalore, India, 1-6. https://doi.org/10.1109/INOCON60754.2024.10511932
  3. Ru Jia, Yun Yang, John Grundy, Jacky Keung and Hao Li. 2019. A Highly Efficient Data Locality Aware Task Scheduler for Cloud-Based Systems. In Proceedings of the 2019 IEEE 12th International Conference on Cloud Computing (CLOUD). IEEE, Milan, Italy, 496-498. https://doi.org/10.1109/CLOUD.2019.00089
  4. Santosh Gopalkrishnan, Ann Reddipogu. 2023. Exploring Artificial Intelligence (AI) Impact on Businesses: Perspectives from Big Data and Security. In Proceedings of the 2023 International Conference On Cyber Management And Engineering (CyMaEn). IEEE, Bangkok, Thailand, 12-17. https://doi.org/10.1109/CyMaEn57228.2023.10051065
  5. Jaideep Visave. 2024. AI in Emergency Management: Ethical Considerations and Challenges. Journal of Emergency Management and Disaster Communications 05 (01): 165–83. https://doi.org/10.1142/S268998092450009X
  6. Visave, Jaideep. 2024. AI in Emergency Management: Ethical Considerations and Challenges. Journal of Emergency Management and Disaster Communications 05, 01 (May 2024), 168-183. https://doi.org/10.1142/S268998092450009X
  7. Ahmad Chaddad, Qizong Lu, Jiali Li, Yousef Katib, Reem Kateb, Camel Tanougast. 2023. Explainable, Domain-Adaptive, and Federated Artificial Intelligence in Medicine. IEEE 10, 4 (April 2023), 859-876. https://doi.org/10.1109/JAS.2023.123123
  8. Sanjeev Kumar Marimekala, John Lamb, Robert Epstein, Vasundhara Bhupathi. 2024. Using AI and Big Data in the HealthCare Sector to help build a Smarter and more Intelligent HealthCare System. In Proceedings of the 2024 IEEE World AI IoT Congress (AIIoT). IEEE, Seattle, WA, USA, 356-362. https://doi.org/10.1109/AIIoT61789.2024.10578989
  9. Limata, S. 2024. AI: Balancing Revolutionary Potential with Overhyped Expectations and Dubious Claims. Retrieved October 15, 2024 from https://dlglearningcenter.com/ai-balancing-revolutionary-potential-with-overhyped-expectations-and-dubious-claims/
  10. Jiafu Wan, Xiaomin Li, Hong-Ning Dai, Andrew Kusiak, Miguel Martínez-García and Di Li. 2020. Artificial-Intelligence-Driven Customized Manufacturing Factory: Key Technologies, Applications, and Challenges. IEEE 109, 4 (April 2021), 377 – 398. https://doi.org/10.1109/JPROC.2020.3034808
  11. Nana Yang. 2020. AI Assisted Internet Finance Intelligent Risk Control System Based on Reptile Data Mining and Fuzzy Clustering. In Proceedings of the 2020 Fourth International Conference on I-SMAC (IoT in Social, Mobile, Analytics and Cloud) (I-SMAC). IEEE, Palladam, India, 533-536. https://doi.org/10.1109/I-SMAC49090.2020.9243608
  12. MarkAxis. 2024. Candidsky strengthens SEO team with three new hires. Retrieved October 18, 2024 from https://www.markaxis.com/candidsky-strengthens-seo-team-with-three-new-hires/
  13. Takyar, A. 2023. AI in education: Use cases, solution and implementation. Retrieved October 18, 2024 from https://www.leewayhertz.com/ai-use-cases-in-education
  14. Rihaab Mowlana. 2023. Artificial Intelligence From Innovation to Ethical Dilemmas. Retrieved October 18, 2024 from https://www.dailymirror.lk/print/life/Artificial-Intelligence-From-Innovation-to-Ethical-Dilemmas/243-260928
  15. Romila Pradhan, Aditya Lahiri, Sainyam Galhotra, Babak Salimi. 2022. Explainable AI: Foundations, Applications, Opportunities for Data Management Research. In Proceedings of the 2022 IEEE 38th International Conference on Data Engineering (ICDE). IEEE, Kuala Lumpur, Malaysia, 3209 – 3212. https://doi.org/10.1109/ICDE53745.2022.00300
  16. Dibyendu Datta. 2024. What Is Azure Data Factory? How It Works and Use Cases. Retrieved October 17, 2024 from https://www.cdata.com/blog/what-is-azure-data-factory
  17. Kettner, B. and Geisler, F. 2022. Azure Data Factory in Pro Serverless Data Handling with Microsoft Azure. Berkeley, CA.
  18. Pathipati, V. 2024. Simplifying data ingestion with azure blob storage. Retrieved October 18, 2024 from https://www.linkedin.com/pulse/simplifying-data-ingestion-azure-blob-storage-venkatesh-pathipati--ftxec
  19. Stedman, C. 2024. What is Data Preparation? An In-Depth Guide, Business Analytics. Retrieved October 18, 2024 from https://www.techtarget.com/searchbusinessanalytics/definition/data-preparation
  20. Macintyre, F. and McGuire, J. 2024. Machine learning app development, Pulsion Technology. Retrieved October 18, 2024 from https://www.pulsion.co.uk/machine-learning-app-development
  21. Seema Yelne, Minakshi Chaudhary, Karishma Dod, Akhtaribano Sayyad, Ranjana Sharma. 2023. Harnessing the power of AI: A comprehensive review of its impact and challenges in nursing science and healthcare. Cureus 15, 11 (November 2023), e49252. https://doi.org/10.7759/cureus.49252
  22. Amazon.com. No date. AWS Glue DataBrew, Retrieved October 18, 2024 from https://aws.amazon.com/glue/features/databrew
  23. Amazon.com. No date. What is AWS Glue DataBrew?, Retrieved October 18, 2024 from https://docs.aws.amazon.com/databrew/latest/dg/what-is.html
  24. Rezvan, H. 2024. From idea to execution: Embarking on data science projects. Retrieved October 18, 2024 from https://www.linkedin.com/pulse/from-idea-execution-embarking-data-science-projects-rezvan-heydari-ugsfe
  25. Alteryx. 2023. Data enrichment. Retrieved October 18, 2024 from https://www.alteryx.com/glossary/data-enrichment
  26. Amazon.com. No date. Using crawlers to populate the Data Catalog. Retrieved October 18, 2024 from https://docs.aws.amazon.com/glue/latest/dg/add-crawler.html
  27. Daniel Rozo and Maurits de Groot. 2021. Enrich datasets for descriptive analytics with AWS Glue DataBrew. Retrieved October 18, 2024 from https://aws.amazon.com/blogs/big-data/enrich-datasets-for-descriptive-analytics-with-aws-glue-databrew/
  28. Google Cloud. No date. From data warehouse to a unified, AI-ready data platform. Retrieved October 18, 2024 from https://cloud.google.com/bigquery
  29. Gupta, D. 2021. Google BigQuery: An introduction to big data analytics platform. Retrieved October 18, 2024 from https://blog.knoldus.com/google-bigquery-an-introduction-to-big-data-analytics-platform/
  30. Google Cloud. No date. Introduction to AI and ML in BigQuery. Retrieved October 18, 2024 from https://cloud.google.com/bigquery/docs/bqml-introduction
  31. Awati, R. 2021. What are Lossless and Lossy Compression?. Retrieved October 18, 2024 from https://www.techtarget.com/whatis/definition/lossless-and-lossy-compression
  32. Chand, M. 2024. Unlocking the power of big data with Google BigQuery. Retrieved October 18, 2024 from https://medium.com/@mehar.chand.cloud/unlocking-the-power-of-big-data-with-google-bigquery-fd6c3a9f2ca6
  33. Google Cloud. No date. Create machine learning models in BigQuery ML. Retrieved October 18, 2024 from https://cloud.google.com/bigquery/docs/create-machine-learning-model
  34. Talend Data Fabric. 2021. Talend - A Leader in Data Integration & Data Integrity. Retrieved October 18, 2024 from https://www.talend.com/products/data-fabric/
  35. Ashwani, K. 2023. What is Talend Data Fabric and use cases of Talend Data Fabric?. Retrieved October 18, 2024 from https://www.devopsschool.com/blog/what-is-talend-data-fabric-and-use-cases-of-talend-data-fabric/
  36. Talend Data Integration. 2021. Talend - A Leader in Data Integration & Data Integrity. Retrieved October 18, 2024 from https://www.talend.com/products/integrate-data
  37. Sheldon, R. and Stedman, C. 2024. Data quality, Data Management. TechTarget. Retrieved October 18, 2024 from https://www.techtarget.com/searchdatamanagement/definition/data-quality
  38. Talend Team. 2020. Revealing the Intelligence in your Data with Talend Winter’20. Retrieved October 18, 2024 from https://www.talend.com/blog/revealing-the-intelligence-in-your-data-with-talend-winter20-part-1
  39. Pratibha, K.J. 2024. Empowering Data Governance with AI & ML: Automation, Efficiency, and advanced technologies. Retrieved October 18, 2024 from https://www.linkedin.com/pulse/empowering-data-governance-ai-ml-automation-efficiency-jha-m41wc
  40. Zarikar, S. 2024. Unlocking the power of data with AI data catalogs: The future of metadata management. Retrieved October 18, 2024 from https://www.linkedin.com/pulse/unlocking-power-data-ai-catalogs-future-metadata-sunil-zarikar-4xhkc
  41. Restack.io. 2024. H2O open source AI platform. Retrieved October 18, 2024 from https://www.restack.io/p/h2o-open-source-ai-answer-no-code-ai-development-cat-ai
  42. H2o.ai. No date. Product Brief H2O MLOps. Retrieved October 18, 2024 from https://h2o.ai/resources/product-brief/h2o-mlops
  43. Adlibsoftware.com. 2024. Leading AI experts advice on data preparation for AI deployment. Retrieved October 18, 2024 from https://www.adlibsoftware.com/news/leading-ai-experts-advice-on-data-preparation-for-ai-deployment
  44. Mailchimp. No date. AI transparency: Building trust in AI. Retrieved October 18, 2024 from https://mailchimp.com/resources/ai-transparency
Index Terms

Computer Science
Information Sciences

Keywords

Data Management Big Data Artificial Intelligence (AI) Data Integration Multidisciplinary Field Cloud Computing Ethical Considerations.