0% found this document useful (0 votes)

41 views3 pages

Weka Exercise - Introduction To Algorithms

This document discusses running basic machine learning algorithms and data preprocessing techniques using the Weka machine learning software. It explores using the ZeroR, 1R, and SVM algorithms on iris and ionosphere datasets to get an initial understanding of the number of classes in each dataset. It also describes several common Weka filters that can be used for data preprocessing tasks like resampling to reduce dataset size, discretizing continuous attributes, reordering data, adding noise, reducing attributes, standardizing attribute ranges, filling in missing values, and removing attributes.

Uploaded by

Katlo Kay

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

41 views3 pages

Weka Exercise - Introduction To Algorithms

Uploaded by

Katlo Kay

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Faculty of Technology

University of Sunderland

WEKA Machine Learning: Running Basic Algorithms

Aim: To show how to run several algorithms on datasets to get an idea of how many types of data
we have. We shall use 3 algorithms commonly used to get a first feel of data. We also look at basic
data cleaning.

Data files needed: iris.arff and ionosphere.arff

Algorithms explored: ZeroR, 1R, SVM to determine key attributes.

1. ZeroR:

Open the file in the Pre-process tab.

In the Classify tab with the Choose button select Rules >
Classifiers > ZeroR
In the confusion matrix we get a feel that there are 2 categories

=== Confusion Matrix == a b

<-- classified as
6 2 | a = A 8 0
| b = B

2. OneR (1R)

Open the file in the Pre-process tab.

In the Classify tab with the Choose button select Rules > Classifiers
> OneR (1R)
In the confusion matrix we get a feel that there are 3 categories

=== Confusion Matrix ===

a b c <-- classified as 50 0 0 | a = Iris-setosa

0 44 6 | b = Iris-versicolor
0 6 44 | c = Iris-virginica

3. SVM (Support Vector Machine)

Open the file in the Preprocess tab.

◦ Examine the data with the Edit button in Pre-process

You will see a table of the data. The columns are:

No. = the number of the row of the data
Sepal and petal measurements which are the 4 rows of data
class = the type of flower so we can train the system to categorise flower types
The Selected Attribute shows: there is 0 data missing values, 35 rows of data, 9 unique items
of data, mean average of each attribute and the range of measurements (minimum and
maximum).
Below the attributes area we can see a coloured graph which indicates how many types
there may be.
Faculty of Technology
University of Sunderland

To run a VSM (vector space machine) called SMO in Weka:

In the Classify tab with the Choose button select

◦ functions > SMO

▪ click in the command line next to the Choose button and change
· filterType – No

· click in Kernel exponent and set it to 2 (to force Weka to use

an SVM)
◦ then press Start

We see in the confusion matrix that there are 3 types of iris flower
a, b, c.

=== Confusion Matrix ===

a b c <-- classified as
50 0 0 | a = Iris-setosa
0 47 3 | b = Iris-versicolor
0 3 47 | c = Iris-virginica

Using Filters to Prepare/Clean Data

WEKA implements pre-processing of data by means of the editor (as seen above) and Filters. We list
some of the main filters. Filters are selected in the Pre-processing tab by the Filters button. Use the
ionosphere dataset.

Filters can make very large datasets smaller in order for them to processed on less powerful systems.
Or they can randomise the order of the data for better machine learning (even adding noise to data).

• To reduce the dataset size use: Filter > Supervised> Instance > Resample and select e.g. 50%
• To merge data ranges e.g. income into low/medium/high: Filter> Supervised> Attribute>
Discretize
• To reorder datafor better processing: Filter> Unsupervised> Reorder
• To add noise to improve some algorithms: Filter> Unsupervised > AddNoise
• To automatically reduce attributes: Filter> Unsupervised> PrincipalComponents

Filters also standardize the ranges of data.

• Normalise data to -1 to +1: Unsupervised> Standardise

Filters can also pick out a subset of features to process to make processing more efficient.

• Removing attributes: Filter> Unsupervised> Remove and indicate column e.g. 1 and inverse

Filters can fill in missing values:

Faculty of Technology
University of Sunderland

• With the ionosphere freshly loaded use edit to select some values in a column and
delete them
• You will see a number of missing values in the attribute window
• Now we will replace these values automatically with a filter: Filter> Unsupervised>
Attribute > ReplaceMissingValues
• Then go back to edit and see what values have replaced the missing ones

DWM1
No ratings yet
DWM1
19 pages
Experiment 1 Aim:: Introduction To ML Lab With Tools (Hands On WEKA On Data Set (Iris - Arff) ) - (A) Start Weka
No ratings yet
Experiment 1 Aim:: Introduction To ML Lab With Tools (Hands On WEKA On Data Set (Iris - Arff) ) - (A) Start Weka
55 pages
WEKA Installation & Usage Guide
No ratings yet
WEKA Installation & Usage Guide
11 pages
Weka 3.6 Tutorial: Data Mining Guide
No ratings yet
Weka 3.6 Tutorial: Data Mining Guide
4 pages
Priyadarshini J. L. College of Engineering, Nagpur: Session 2022-23 Semester-V
No ratings yet
Priyadarshini J. L. College of Engineering, Nagpur: Session 2022-23 Semester-V
31 pages
Weka Data Analysis Guide
No ratings yet
Weka Data Analysis Guide
21 pages
Weka J48 Algorithm on Iris Dataset
No ratings yet
Weka J48 Algorithm on Iris Dataset
4 pages
6.034 Design Assignment 2: 1 Data Sets
No ratings yet
6.034 Design Assignment 2: 1 Data Sets
6 pages
DA LabFile
No ratings yet
DA LabFile
63 pages
Practical DWDM
No ratings yet
Practical DWDM
32 pages
Ccs341 Datawarehousing
No ratings yet
Ccs341 Datawarehousing
66 pages
WEKA: Classification: Instructor: Amany Al Luhaybi
No ratings yet
WEKA: Classification: Instructor: Amany Al Luhaybi
8 pages
DM Lab
No ratings yet
DM Lab
101 pages
DMW Lab Print
No ratings yet
DMW Lab Print
21 pages
WEKA Tool & Data Mining Lab Guide
No ratings yet
WEKA Tool & Data Mining Lab Guide
29 pages
Data Warehousing Lab Guide
No ratings yet
Data Warehousing Lab Guide
55 pages
Introduction to WEKA: Features & Usage
No ratings yet
Introduction to WEKA: Features & Usage
51 pages
Expt 1 Docx
No ratings yet
Expt 1 Docx
15 pages
Data Warehousing - To Write
No ratings yet
Data Warehousing - To Write
23 pages
Data Mining Unit 5
No ratings yet
Data Mining Unit 5
12 pages
Machine Learning Tools: Weka & KNIME
No ratings yet
Machine Learning Tools: Weka & KNIME
88 pages
Lab Manual - DM
No ratings yet
Lab Manual - DM
56 pages
Lab 01-PhamBinhDuong ITCSIU21054
No ratings yet
Lab 01-PhamBinhDuong ITCSIU21054
9 pages
DM Lab Task-1 Expr's-1
No ratings yet
DM Lab Task-1 Expr's-1
58 pages
Workshop 1
No ratings yet
Workshop 1
16 pages
WEKA Machine Learning Tutorials
No ratings yet
WEKA Machine Learning Tutorials
5 pages
Machine Learning: Algorithms and Applications: Quang Nhat Nguyen
No ratings yet
Machine Learning: Algorithms and Applications: Quang Nhat Nguyen
16 pages
Data Mining Lab Manual
No ratings yet
Data Mining Lab Manual
36 pages
DMW LabFile 0901CS243D11 Swastik
No ratings yet
DMW LabFile 0901CS243D11 Swastik
25 pages
DMW 05
No ratings yet
DMW 05
4 pages
Anjali Weka Software Report
No ratings yet
Anjali Weka Software Report
17 pages
Data Warehousing and Data Mining Lab
No ratings yet
Data Warehousing and Data Mining Lab
53 pages
J48 & Naive Bayes Classification Guide
No ratings yet
J48 & Naive Bayes Classification Guide
3 pages
WEKA Data Analysis Guide
No ratings yet
WEKA Data Analysis Guide
85 pages
Data Warehousing
No ratings yet
Data Warehousing
54 pages
Data Mining (WEKA) en
No ratings yet
Data Mining (WEKA) en
51 pages
DMDW LAB NEW - Merged
No ratings yet
DMDW LAB NEW - Merged
53 pages
Experiment No: 01 Data Exploration & Data Preprocessing
No ratings yet
Experiment No: 01 Data Exploration & Data Preprocessing
54 pages
Data Warehousing Lab Exp 1-3
No ratings yet
Data Warehousing Lab Exp 1-3
24 pages
DMW Lab Manual
No ratings yet
DMW Lab Manual
42 pages
Data Mining Lab Manual
No ratings yet
Data Mining Lab Manual
50 pages
Data Mining Practical Guide
No ratings yet
Data Mining Practical Guide
27 pages
BI - Experiment - No - 1
No ratings yet
BI - Experiment - No - 1
7 pages
Data Mining & Warehousing Lab Report
No ratings yet
Data Mining & Warehousing Lab Report
25 pages
Data Warehouse Lab Manual
No ratings yet
Data Warehouse Lab Manual
60 pages
WEKA Lab Manual
100% (2)
WEKA Lab Manual
107 pages
Weka: Machine Learning Workbench Overview
No ratings yet
Weka: Machine Learning Workbench Overview
11 pages
Data Mining Complete Lab Manual - DRSNR
No ratings yet
Data Mining Complete Lab Manual - DRSNR
27 pages
NguyenCongSang ITITIU20292 Lab1
No ratings yet
NguyenCongSang ITITIU20292 Lab1
7 pages
PavicJakov WEKA
No ratings yet
PavicJakov WEKA
40 pages
Data Warehousing Lab Manual
No ratings yet
Data Warehousing Lab Manual
36 pages
R Course - Part7 ML - Exercise Sheet 2024
No ratings yet
R Course - Part7 ML - Exercise Sheet 2024
8 pages
Weka Data Processing and Analysis Guide
No ratings yet
Weka Data Processing and Analysis Guide
100 pages
WEKA Manual
No ratings yet
WEKA Manual
25 pages
Weka Book Questions
0% (1)
Weka Book Questions
2 pages
Iris Dataset Clustering in Weka
No ratings yet
Iris Dataset Clustering in Weka
17 pages
DWM1 Riya
No ratings yet
DWM1 Riya
16 pages
DMDV 210
No ratings yet
DMDV 210
63 pages
21bce9255 Se Lab - Assignment 2
No ratings yet
21bce9255 Se Lab - Assignment 2
7 pages
Schedule Supplementary Examination Interim Semester 2024-2025
No ratings yet
Schedule Supplementary Examination Interim Semester 2024-2025
5 pages
PT3 Ict
No ratings yet
PT3 Ict
15 pages
Hazwani Hassim (A), Hafizoah Kassim (B), Asiah Kassim (C) Corresponding Author
No ratings yet
Hazwani Hassim (A), Hafizoah Kassim (B), Asiah Kassim (C) Corresponding Author
15 pages
Be - Computer Engineering Aids - Semester 8 - 2024 - May - Reinforcement Learning Rev 2019c Scheme
No ratings yet
Be - Computer Engineering Aids - Semester 8 - 2024 - May - Reinforcement Learning Rev 2019c Scheme
2 pages
ECM 24conf Final1
No ratings yet
ECM 24conf Final1
2 pages
CNN-Powered Garbage Detection
No ratings yet
CNN-Powered Garbage Detection
5 pages
Generative AI Class9 Skill Education
100% (1)
Generative AI Class9 Skill Education
27 pages
Artificial Intelligence
No ratings yet
Artificial Intelligence
16 pages
ĐỀ 55
No ratings yet
ĐỀ 55
9 pages
Artificial Intelligence Presentation
No ratings yet
Artificial Intelligence Presentation
21 pages
Most Significant Technology Developments
No ratings yet
Most Significant Technology Developments
8 pages
Mirai Summer Internship - Capstone Projects
No ratings yet
Mirai Summer Internship - Capstone Projects
7 pages
Unit 3 - KmeansClustering
No ratings yet
Unit 3 - KmeansClustering
17 pages
L17 - Knowledge Engineering
No ratings yet
L17 - Knowledge Engineering
16 pages
Full Download Intelligent Natural Language Processing Trends and Applications 1st Edition Khaled Shaalan PDF
88% (8)
Full Download Intelligent Natural Language Processing Trends and Applications 1st Edition Khaled Shaalan PDF
55 pages
Deep Learning for Autism Detection
No ratings yet
Deep Learning for Autism Detection
4 pages
Current Affairs JWT Guess Paper 2025
No ratings yet
Current Affairs JWT Guess Paper 2025
135 pages
UNIT 4 Merged
No ratings yet
UNIT 4 Merged
203 pages
Generative AI in Space
No ratings yet
Generative AI in Space
2 pages
TheGeometryOfThree Waydecision
No ratings yet
TheGeometryOfThree Waydecision
28 pages
Game Theory With Simulation of Other Players
No ratings yet
Game Theory With Simulation of Other Players
8 pages
Shri Sai Institute of Technology, Aurangabad: "Case Study of Secure Computing: Achievements & Trends."
No ratings yet
Shri Sai Institute of Technology, Aurangabad: "Case Study of Secure Computing: Achievements & Trends."
16 pages
BATALLA, JHENNIEL A. - STS-Module 6
No ratings yet
BATALLA, JHENNIEL A. - STS-Module 6
4 pages
AI and Ethics Poster Presentation Cleaned
No ratings yet
AI and Ethics Poster Presentation Cleaned
3 pages
The Intersection of Ai and Consumer Behavior
No ratings yet
The Intersection of Ai and Consumer Behavior
15 pages
AI Based Pet Adoption System
No ratings yet
AI Based Pet Adoption System
5 pages
2025 Social Media Strategies
No ratings yet
2025 Social Media Strategies
14 pages
Support Vector Machines
No ratings yet
Support Vector Machines
57 pages
đề chính thức thị xã
No ratings yet
đề chính thức thị xã
8 pages

Weka Exercise - Introduction To Algorithms

Uploaded by

Weka Exercise - Introduction To Algorithms

Uploaded by

Faculty of Technology

WEKA Machine Learning: Running Basic Algorithms

Data files needed: iris.arff and ionosphere.arff

Algorithms explored: ZeroR, 1R, SVM to determine key attributes.

Open the file in the Pre-process tab.

=== Confusion Matrix == a b

Open the file in the Pre-process tab.

=== Confusion Matrix ===

a b c <-- classified as 50 0 0 | a = Iris-setosa

3. SVM (Support Vector Machine)

Open the file in the Preprocess tab.

You will see a table of the data. The columns are:

To run a VSM (vector space machine) called SMO in Weka:

In the Classify tab with the Choose button select

◦ functions > SMO

· click in Kernel exponent and set it to 2 (to force Weka to use

=== Confusion Matrix ===

Using Filters to Prepare/Clean Data

Filters also standardize the ranges of data.

• Normalise data to -1 to +1: Unsupervised> Standardise

Filters can fill in missing values:

You might also like