Weka 3.6 Tutorial: (Waikato Environment For Knowledge Analysis)
Weka 3.6 Tutorial: (Waikato Environment For Knowledge Analysis)
6 Tutorial
(Waikato Environment for Knowledge Analysis)
WEKA
• It’s a data mining/machine learning tool developed
by University of Waikato.
• Main Features:
– 49 data preprocessing tools
– 76 classification/regression algorithms
– 8 clustering algorithms
– 3 algorithms for finding association rules
– 15 attribute/subset evaluators + 10 search algorithms for
feature selection
Starting WEKA
• 4 Options
– Explorer
– Experimenter
– Knowledge Flow
– Simple CLI
Weka Simple CLI
Commands here
Preprocessing Data
• Data can be imported from a file in various
formats: ARFF, CSV etc.
• Data can also be read from a URL or from an
SQL database (using JDBC)
• Pre-processing tools in WEKA are called
“filters”
CSV (Comma Separated) File
Roll,Name,Percentage,Passed
1,ABC,72,y
2,abc,30.8,n
Result.csv
3,xyz,44.3,n
4,XYZ,52.3,y
@relation Result
Numeric
@data
1,ABC,72,y
2,abc,30.8,n weka.core.Instance
3,xyz,44.3,n
4,XYZ,52.3,y
Weka.Classifiers
• A simple example
Classifier: NaiveBayes