Google News
logo
Data Science - Interview Questions
Please explain the role of data cleaning in data analysis.
Data cleaning can be a daunting task due to the fact that with the increase in the number of data sources, the time required for cleaning the data increases at an exponential rate.
 
This is due to the vast volume of data generated by additional sources. Also, data cleaning can solely take up to 80% of the total time required for carrying out a data analysis task.
 
Nevertheless, there are several reasons for using data cleaning in data analysis. Two of the most important ones are:
 
Cleaning data from different sources helps in transforming the data into a format that is easy to work with
Data cleaning increases the accuracy of a machine learning model
Advertisement