International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 176 - Number 24 |
Year of Publication: 2020 |
Authors: Pallavi Gupta, Nitin V. Tawar |
10.5120/ijca2020920215 |
Pallavi Gupta, Nitin V. Tawar . The Impact and Importance of Statistics in Data Science. International Journal of Computer Applications. 176, 24 ( May 2020), 10-14. DOI=10.5120/ijca2020920215
With the massive amount of data pouring in, the data science has become one of the most challenging yet promising field to deal with such tremendous quantity of data and bring out the quality information out for strategic business decisions. The way to data science begins with collection of huge amount of data which should be managed enough to start processing on it to analyze it. The statistics plays a vital role from molding data into the required format to final presentation of results to make it easy for the operations to be carried out on data almost in every step of data science. In this paper, we give a manifestation of how important the statistics is to provide the necessary tools and methods to handle data to provide deep insights into the data and how useful statistics is for quantification and analysis of data. We will discuss various tools and techniques of statistics used in data science beginning from measures of dispersion to advanced tools for visualization of results to be able to understand the role and importance of statistical approaches in data processing and analysis.