0% found this document useful (0 votes)

269 views38 pages

Introduction To Weka

Weka is a collection of machine learning algorithms for data mining tasks. It contains tools for data preprocessing, classification, regression, clustering, association rules, and visualization. Datasets are stored in ARFF files which describe the attributes and contain the data values. Weka contains many classifiers like J48 decision trees and Naive Bayes. It also has filters for preprocessing data and tools like the Explorer for classification and clustering and Experimenter for running multiple experiments.

Uploaded by

sandyguru05

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

269 views38 pages

Introduction To Weka

Uploaded by

sandyguru05

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 38

Introduction to Weka

Overview


What is Weka?

Where to find Weka?

Command Line Vs GUI

Datasets in Weka

ARFF Files

Classifiers in Weka

Filters
What is Weka?


Weka is a collection of machine learning
algorithms for data mining tasks. The
algorithms can either be applied directly to a
dataset or called from your own Java code.
Weka contains tools for data pre-processing,
classification, regression, clustering,
association rules, and visualization. It is also
well-suited for developing new machine
learning schemes.
Where to find Weka


Weka website (Latest version 3.6):
– http://www.cs.waikato.ac.nz/ml/weka/


Weka Manual:
− http://transact.dl.sourceforge.net/sourcefor
ge/weka/WekaManual-3.6.0.pdf
CLI Vs GUI


Recommended for in-depth usage 
Explorer

Offers some functionality not 
Experimenter
available via the GUI 
Knowledge Flow
Datasets in Weka


Each entry in a dataset is an instance of the
java class:
− weka.core.Instance

Each instance consists of a number of
attributes
Attributes


Nominal: one of a predefined list of values
− e.g. red, green, blue

Numeric: A real or integer number

String: Enclosed in “double quotes”

Date

Relational
ARFF Files


The external representation of an Instances
class

Consists of:
− A header: Describes the attribute types
− Data section: Comma separated list of data
ARFF File Example

Dataset name

Comment

Attributes

Target / Class variable

Data Values
Assignment ARFF Files


Credit-g

Heart-c

Hepatitis

Vowel

Zoo


http://www.cs.auckland.ac.nz/~pat/weka/
ARFF Files


Basic statistics and validation by running:
− java weka.core.Instances data/soybean.arff
Classifiers in Weka

Learning algorithms in Weka are derived from
the abstract class:
− weka.classifiers.Classifier

Simple classifier: ZeroR
− Just determines the most common class
− Or the median (in the case of numeric
values)
− Tests how well the class can be predicted
without considering other attributes
− Can be used as a Lower Bound on
Performance.
Classifiers in Weka


Simple Classifier Example
− java weka.classifiers.rules.ZeroR -t
data/weather.arff
− java weka.classifiers.trees.J48 -t
data/weather.arff

Help Command
− java weka.classifiers.trees.J48 -h
Classifiers in Weka


Soybean.arff split into train and test set
– Soybean-train.arff
– Soybean-test.arff Training data

Input command:
– java weka.classifiers.trees.J48 -t soybean-
train.arff -T soybean-test.arff -i

Test data Provides more detailed

output
Soybean Results
Soybean Results (cont...)
Soybean Results (cont...)

• True Positive (TP)

– Proportion classified as class x / Actual total in
class x
– Equivalent to Recall
• False Positive (FP)
– Proportion incorrectly classified as class x /
Actual total of all classes, except x
Soybean Results (cont...)

• Precision:
– Proportion of the examples which truly have
class x / Total classified as class x
• F-measure:
– 2*Precision*Recall / (Precision + Recall)
– i.e. A combined measure for precision and
recall
Soybean Results (cont...)
Total Actual h

Total Classified as h Total Correct

Filters


weka.filters package

Transform datasets

Support for data preprocessing
− e.g. Removing/Adding Attributes
− e.g. Discretize numeric attributes into
nominal ones

More info in Weka Manual p. 15 & 16.
More Classifiers
Explorer

• Preprocess
• Classify
• Cluster
• Associate
• Select attributes
• Visualize
Preprocess

• Load Data
• Preprocess Data
• Analyse Attributes
Classify

• Select Test Options e.g:

– Use Training Set
– % Split,
– Cross Validation...
• Run classifiers
• View results
Classify
Results
Experimenter

• Allows users to create, run, modify and

analyse experiments in a more convenient
manner than when processing individually.
– Setup
– Run
– Analyse
Experimenter: Setup

• Simple/Advanced
• Results Destinations
– ARFF
– CSV
– JDBC Database
10-fold
Cross Datasets
Validation
Num of
runs
Classifiers
Run Simple Experiment
Results
Advanced Example

Multiple Classifiers
Advanced Example

OM Test Bank - Chapte3
100% (2)
OM Test Bank - Chapte3
10 pages
EXTENDED PROJECT-Soft Drink
100% (1)
EXTENDED PROJECT-Soft Drink
26 pages
Business Research 2017-18 NOTES (IV-SEM-BMS-SSCBS-DU)
100% (3)
Business Research 2017-18 NOTES (IV-SEM-BMS-SSCBS-DU)
28 pages
Managerial Economics 7th Edition Keat Test Bank 1
100% (70)
Managerial Economics 7th Edition Keat Test Bank 1
18 pages
Weka Tutorial
No ratings yet
Weka Tutorial
45 pages
Learning Informatica PowerCenter 9.x
From Everand
Learning Informatica PowerCenter 9.x
Rahul Malewar
3/5 (4)
Research Project (IP TV) Proposal
No ratings yet
Research Project (IP TV) Proposal
5 pages
Weka 3.6 Tutorial: (Waikato Environment For Knowledge Analysis)
No ratings yet
Weka 3.6 Tutorial: (Waikato Environment For Knowledge Analysis)
12 pages
DWDM LAB MANUAL
No ratings yet
DWDM LAB MANUAL
55 pages
AI32 Guide To Weka PDF
No ratings yet
AI32 Guide To Weka PDF
6 pages
Learning To Use We Ka
No ratings yet
Learning To Use We Ka
5 pages
Introduction To Weka: Xingquan (Hill) Zhu
No ratings yet
Introduction To Weka: Xingquan (Hill) Zhu
63 pages
Weka Software Manuala
No ratings yet
Weka Software Manuala
20 pages
Weka Overview Slides
No ratings yet
Weka Overview Slides
31 pages
Data Base Management Key Points
No ratings yet
Data Base Management Key Points
8 pages
Weka Tutorial
No ratings yet
Weka Tutorial
32 pages
CS-703 (B) Data Warehousing and Data Mining Lab
No ratings yet
CS-703 (B) Data Warehousing and Data Mining Lab
50 pages
Weka Tutorial
No ratings yet
Weka Tutorial
15 pages
Weka & Rapid Miner Tutorial: by Chibuike Muoh
No ratings yet
Weka & Rapid Miner Tutorial: by Chibuike Muoh
15 pages
Wekappt
No ratings yet
Wekappt
58 pages
Weka Weka: A - Antony Alex MCA DR G R D College of Science - CBE Tamil Nadu - India
No ratings yet
Weka Weka: A - Antony Alex MCA DR G R D College of Science - CBE Tamil Nadu - India
23 pages
Lecture 7 - Weka
No ratings yet
Lecture 7 - Weka
69 pages
Appendix Weka
No ratings yet
Appendix Weka
17 pages
Overview: Data Mining Methods: WEKA: A Machine Learning Toolkit The Explorer
No ratings yet
Overview: Data Mining Methods: WEKA: A Machine Learning Toolkit The Explorer
41 pages
Lab04
No ratings yet
Lab04
7 pages
DHW Lab (Ex1 To 3)
No ratings yet
DHW Lab (Ex1 To 3)
18 pages
Weka Data Miningvsem
No ratings yet
Weka Data Miningvsem
7 pages
Introduction To Weka: Statistical Learning
No ratings yet
Introduction To Weka: Statistical Learning
36 pages
2.3 Weka Tool
No ratings yet
2.3 Weka Tool
84 pages
DWBI Lab Manual 2023-24 Final
No ratings yet
DWBI Lab Manual 2023-24 Final
40 pages
Weka-: Data Warehousing and Data Mining Lab Manual-Week 9
100% (1)
Weka-: Data Warehousing and Data Mining Lab Manual-Week 9
8 pages
Data Mining Term Project Machine Learning With WEKA: Weka Explorer Tutorial For Version 3.4.3
No ratings yet
Data Mining Term Project Machine Learning With WEKA: Weka Explorer Tutorial For Version 3.4.3
42 pages
Weka Tutorial
No ratings yet
Weka Tutorial
13 pages
131953194aams Vol 196 April 2020 A3 p451-469 Kanwal Preet Singh Attwal
No ratings yet
131953194aams Vol 196 April 2020 A3 p451-469 Kanwal Preet Singh Attwal
19 pages
Weka (20030421-Version1 by Kdelab)
No ratings yet
Weka (20030421-Version1 by Kdelab)
51 pages
Rintro Wekacomplete
No ratings yet
Rintro Wekacomplete
135 pages
Weka Tutorial
No ratings yet
Weka Tutorial
2 pages
RWeka
No ratings yet
RWeka
34 pages
More Data Mining With Weka: Ian H. Witten
No ratings yet
More Data Mining With Weka: Ian H. Witten
61 pages
dwdm_file-final_ver3.pdf_20241230_172003_0000
No ratings yet
dwdm_file-final_ver3.pdf_20241230_172003_0000
54 pages
Dinesh DM
No ratings yet
Dinesh DM
34 pages
Task 0: Weka Introduction
No ratings yet
Task 0: Weka Introduction
11 pages
Data Warehousing and Data Mining Lab Manual
0% (1)
Data Warehousing and Data Mining Lab Manual
30 pages
Meka Tutorial
No ratings yet
Meka Tutorial
18 pages
Group 3: Elhaine, Jai, Icelle and Marianne
No ratings yet
Group 3: Elhaine, Jai, Icelle and Marianne
17 pages
WEKA Explorer Tutorial
No ratings yet
WEKA Explorer Tutorial
45 pages
Data Mining Lab Questions
100% (1)
Data Mining Lab Questions
47 pages
Machine Learning With WEKA An Introduction
No ratings yet
Machine Learning With WEKA An Introduction
66 pages
Result Prediction Using Weka: An Effort by - Shlok Tibrewal (14bit0088) Siddarth Nyati (14bit0074)
No ratings yet
Result Prediction Using Weka: An Effort by - Shlok Tibrewal (14bit0088) Siddarth Nyati (14bit0074)
11 pages
Weka Lab
No ratings yet
Weka Lab
11 pages
WEKA Practical Protocol
No ratings yet
WEKA Practical Protocol
40 pages
Lecture 12 - Weka Tutorial
No ratings yet
Lecture 12 - Weka Tutorial
84 pages
WEKA Intro
No ratings yet
WEKA Intro
17 pages
Priyadarshini J. L. College of Engineering, Nagpur: Session 2022-23 Semester-V
No ratings yet
Priyadarshini J. L. College of Engineering, Nagpur: Session 2022-23 Semester-V
31 pages
Bioinformatics: Applications Note
No ratings yet
Bioinformatics: Applications Note
3 pages
Part I - Installing Weka: HW Assignment 1
No ratings yet
Part I - Installing Weka: HW Assignment 1
3 pages
DWM1 Riya
No ratings yet
DWM1 Riya
16 pages
DWDM WEEK1&2
No ratings yet
DWDM WEEK1&2
13 pages
DM Lab Material
No ratings yet
DM Lab Material
88 pages
NOTES
No ratings yet
NOTES
45 pages
Weka Experiment
No ratings yet
Weka Experiment
13 pages
Scala Data Analysis Cookbook (new): Navigate the world of data analysis, visualization, and machine learning with over 100 hands-on Scala recipes
From Everand
Scala Data Analysis Cookbook (new): Navigate the world of data analysis, visualization, and machine learning with over 100 hands-on Scala recipes
Arun Manivannan
No ratings yet
Java Programming for Beginners: Programming
From Everand
Java Programming for Beginners: Programming
Stephanie Mwaniki
No ratings yet
JDK Tutorials - Herong's Tutorial Examples
From Everand
JDK Tutorials - Herong's Tutorial Examples
Herong Yang
No ratings yet
Java Package Mastery: 100 Knock Series - Master Java in One Hour, 2024 Edition
From Everand
Java Package Mastery: 100 Knock Series - Master Java in One Hour, 2024 Edition
Kanto
No ratings yet
26 Weka
No ratings yet
26 Weka
5 pages
SQL Exercises
No ratings yet
SQL Exercises
2 pages
Current Affairs: January 2010
100% (2)
Current Affairs: January 2010
46 pages
Social Responsibilities of Management
No ratings yet
Social Responsibilities of Management
2 pages
Download Complete Doing Corpus Linguistics 2nd Edition Eniko Csomay PDF for All Chapters
100% (2)
Download Complete Doing Corpus Linguistics 2nd Edition Eniko Csomay PDF for All Chapters
55 pages
Introductory Business Statistics With Interactive Spreadsheets 1st Canadian Edition 1660157589. Print
No ratings yet
Introductory Business Statistics With Interactive Spreadsheets 1st Canadian Edition 1660157589. Print
110 pages
Error and Biases in Judgement &amp Decision Making
No ratings yet
Error and Biases in Judgement &amp Decision Making
30 pages
Inbound 4682043162688421403
No ratings yet
Inbound 4682043162688421403
11 pages
Regresi: Variables Entered/Removed
No ratings yet
Regresi: Variables Entered/Removed
6 pages
RSCH g12 First Quarter Exam 2 PDF Free
No ratings yet
RSCH g12 First Quarter Exam 2 PDF Free
54 pages
Two-Stage Least Squares (2SLS)
No ratings yet
Two-Stage Least Squares (2SLS)
7 pages
Econometrics Lecture Chapter 2 Note pdf-1
No ratings yet
Econometrics Lecture Chapter 2 Note pdf-1
34 pages
Untitled
No ratings yet
Untitled
407 pages
Linear RegressionSV
No ratings yet
Linear RegressionSV
66 pages
Module 4
No ratings yet
Module 4
15 pages
M11L2 Statistical N Data Certainty PH
No ratings yet
M11L2 Statistical N Data Certainty PH
26 pages
Business Analytics Practical Problems
No ratings yet
Business Analytics Practical Problems
26 pages
Ardl Model
No ratings yet
Ardl Model
20 pages
Evaluation of Evidence
No ratings yet
Evaluation of Evidence
51 pages
Report 006
No ratings yet
Report 006
16 pages
16 AS Statistics and Mechanics Practice Paper H mark scheme
No ratings yet
16 AS Statistics and Mechanics Practice Paper H mark scheme
8 pages
7._75-80
No ratings yet
7._75-80
7 pages
Chapter Five Regression
No ratings yet
Chapter Five Regression
12 pages
Lesson No.5. Continuous Random Variable
No ratings yet
Lesson No.5. Continuous Random Variable
3 pages
STAT 111: Introduction To Statistics and Probability: Lecture 2: Data Reduction
No ratings yet
STAT 111: Introduction To Statistics and Probability: Lecture 2: Data Reduction
28 pages
January 2017 QP
No ratings yet
January 2017 QP
15 pages
Sociology Ia
No ratings yet
Sociology Ia
29 pages
Pondicherry University - Ma Applied Economics Syllabus
No ratings yet
Pondicherry University - Ma Applied Economics Syllabus
27 pages
Adding Center Points
No ratings yet
Adding Center Points
3 pages

Introduction To Weka

Uploaded by

Introduction To Weka

Uploaded by

Introduction to Weka

Target / Class variable

Test data Provides more detailed

• True Positive (TP)

Total Classified as h Total Correct

• Select Test Options e.g:

• Allows users to create, run, modify and

You might also like