Four General Phases of Data Mining
Four General Phases of Data Mining
as data preparation, data analysis and classification, knowledge acquisition and prognosis. We can represent data mining phases as follows, Data preparation phase
Identify data set Clean data set Integrate data set
Operational database
Data warehouse
Classification analysis Clustering and sequence analysis Link analysis Trend and deviation analysis Select and apply algorithms Neural nets Inductive logic Decision trees Classification and regression tree Nearest neighbor Visualization etc. Prediction Forecasting Modeling
We can go more further details study about these phase of data mining as bellows, --Data preparation phase: In the data preparation phase the main data sets to be used by the data mining operation are identifies and cleansed of any data impurities. Reason for this is the data warehouse are already integrated and filtered. The data warehouse usually is the target set for data mining operations. --Data analysis and classification phase: This is the phase that studies the data to indentify common data characteristics or patterns. In this phase the data mining tool applies specific algorithms to find such as data groupings, classifications, clusters, or sequences, data dependencies, links ,or relationships & data patterns, trends, and deviations --Knowledge acquision phase: The knowledge acquision phase uses the results of the data analysis and classification phase. So, during this phase data mining tool select the appropriate modeling or knowledge acquision algorithms. --Prognosis phase: Most of the time many data mining tools stop at the knowledge acquision phase. But still there are some that continue to the prognosis phase. Here in this case data mining findings are used to predict future behavior and forecase business outcomes.