Data Mining Myths Data mining … provides instant solutions/predictions is not yet viable for business applications requires a separate, dedicated database can only be done by those with advanced degrees is only for large firms that have lots of customer data is another name for the good-old statistics
mining is a business process-a way of finding patterns in your data that provide insight you can use to conduct your business more effectively. Data mining also makes predictions to guide customer interactions and other business decisions
Myth #1: Data mining is all about algorithms Generally people form the impression that data mining is all about advanced data analysis algorithms. This misconception might be summarized as follows: "All you need for data mining is good algorithms. The better your algorithms, the better your data mining;
While data mining is not all about data analysis
algorithms, there is a part of data mining that is about algorithms. This raises the question, "How can you judge the quality of an algorithm?" You might think that the main criterion would be the predictive accuracy of the models it generates. This view, however, misrepresents the role of algorithms in the data mining process.
Myth #3: Data mining requires a data warehouse………...the reality…. It is true that data mining can benefit from warehoused data that is well organized, relatively clean, and easy to access This is particularly true if the warehouse has been constructed with data mining specifically in mind. However, the warehoused data may be less useful for data mining than the source or operational data. warehoused data may be completely useless if only summary data are stored.
Myth #4: Data mining is all about vast quantities of data Early explanations of data mining often began with statements like, "We now collect more data than ever, yet how are we to benefit from these vast data stores?" Focusing on the size of data stores provided a convenient introduction to the topic of data mining, but subtly misrepresented its nature.
Myth #4: Data mining is all about vast quantities of data…………..the reality… While there are many large datasets that organizations can benefit from mining, it would be a mistake to believe that these should be the sole focus of data mining. Many useful data mining projects are performed on small or medium-sized datasets-some, for example, containing only a few hundreds or thousands of records
Myth #5: Data mining should be done by a technology expert Data mining uses advanced technology, and its workings, particularly those of modeling techniques, are unlikely to be understood by the wider IT community. Does this mean that data mining should be conducted only by those who understand every nuance of the technology that is involved?