Data Understanding in Analytics
Data Understanding in Analytics
a hypothesis, and then testing it using statistical methods. When there are hundreds and thousands of variables, you first need to understand the data set. Without an initial understanding of the data, it is impossible to determine where to focus effort in an investigation of the data. One major estimation technique is visualization. In many cases, a simple graph can convey considerable information about the data. With visualization and data summaries, you can gain real insight into the data and determine important issues that need additional exploration. Visualization can give us a real understanding of the variables in the data. Data mining should always begin with a good look at the data and an investigation of their properties. Without such an understanding, it is difficult to take the next step to investigate patterns and relationships in the data.