Top 10 Open Source Data Mining Tools: A Brief Look at Mining Tasks
Top 10 Open Source Data Mining Tools: A Brief Look at Mining Tasks
Pre-processing: This involves all the preliminary tasks that can help in getting started with any of
the actual mining tasks. Pre-processing could be removing anomalies and noise from the data
that’s about to be mined, filling in missing values, normalising the data or compressing data
using techniques like generalisation and aggregation.
Associative analysis helps in bringing out hidden relationships among data items in a large data
set. This can help in predicting the occurrence of a particular item in a transaction or an event
whenever some other item is present. You can think of this as a conditional probability.
Regression is used to predict values of a dependent variable by constructing a model or a
mathematical function out of independent variables.
Summarisation helps in coming up with a compact description for the whole data set.
Data mining is a combination of various techniques like pattern recognition, statistics, machine
learning, etc. While there is a good amount of intersection between machine learning and data
mining, as both go hand in hand and machine learning algorithms are used for mining data, we
will restrict ourselves in this article to only those tools specialised for data mining.
https://www.softwaretestinghelp.com/data-mining-tools/
https://opensourceforu.com/2017/03/top-10-open-source-data-mining-tools/
Data Mining Tools
1. Sisense Licensed
2. SSDT (SQL Server Data Tools) Licensed
3. Oracle Data Mining Proprietary License
4. IBM Cognos Proprietary License
5. IBM SPSS Modeler Proprietary License
6. SAS Data Mining Proprietary License
1. MOA
Massive Online Analysis (MOA)
2. KEEL
KEEL (Knowledge Extraction for Evolutionary Learning)