0% found this document useful (0 votes)
128 views1 page

Data Understanding in Analytics

Data mining techniques focus on pattern recognition while statistical methods focus on hypothesis testing. To understand large datasets, initial visualization techniques are important to gain insight before further exploration. Data mining should always begin with understanding the properties of the data through visualization and summaries in order to determine where to focus investigation of patterns and relationships. Basic issues in data understanding include acquiring, integrating, describing, and assessing data quality.

Uploaded by

esjai
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
128 views1 page

Data Understanding in Analytics

Data mining techniques focus on pattern recognition while statistical methods focus on hypothesis testing. To understand large datasets, initial visualization techniques are important to gain insight before further exploration. Data mining should always begin with understanding the properties of the data through visualization and summaries in order to determine where to focus investigation of patterns and relationships. Basic issues in data understanding include acquiring, integrating, describing, and assessing data quality.

Uploaded by

esjai
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 1

Data mining techniques tend to focus on estimation and pattern recognition; statistical methods tend to focus on inference, formulating

a hypothesis, and then testing it using statistical methods. When there are hundreds and thousands of variables, you first need to understand the data set. Without an initial understanding of the data, it is impossible to determine where to focus effort in an investigation of the data. One major estimation technique is visualization. In many cases, a simple graph can convey considerable information about the data. With visualization and data summaries, you can gain real insight into the data and determine important issues that need additional exploration. Visualization can give us a real understanding of the variables in the data. Data mining should always begin with a good look at the data and an investigation of their properties. Without such an understanding, it is difficult to take the next step to investigate patterns and relationships in the data.

Basic Issues That Must Be Resolved in Data Understanding


Following are some of the basic issues you will encounter in the pursuit of an understanding of your data and some activities associated with them: How do I find the data I need for modeling?Data Acquisition How do I integrate data I find in multiple disparate data sources?Data Integration What do the data look like?Data Description How clean is the data set?Data Assessment

You might also like