0% found this document useful (0 votes)

20 views13 pages

Weka Experiment

The document provides an introduction to WEKA, which is an open source data mining software. It describes the four major applications in WEKA: Explorer, Experimenter, Knowledge Flow, and Simple CLI. The Explorer allows users to preprocess data, perform classification, clustering, association rule mining, attribute selection, and data visualization. It also provides details on loading and exploring data in the Explorer interface. The document then briefly outlines the functionality of the Experimenter application for performing experiments.

Uploaded by

safrinfathima746

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views13 pages

Weka Experiment

Uploaded by

safrinfathima746

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

Exp No: 1 INTRODUCTION TO WEKA 1

Roll Number: Ur num here

LAUNCHING WEKA:

WEKA  Waikato Environment for Knowledge Analysis

The WEKA GUI Chooser provides a starting point for launching WEKA’S main GUI applications
and supporting tools. The GUI Chooser consists of four buttons—one for each of the four major Weka
applications—and four menus.
The buttons can be used to start the following applications:
1. Explorer: An environment for exploring data with WEKA.
2. Experimenter: An environment for performing experiments and conducting statistical
tests between learning schemes.
3. Knowledge Flow: It supports essentially the same functions as the explorer but with a
drag and drop interface. One advantage is that it supports incremental learning.
4. Simple CLI: Provides a simple command-line interface that allows direct execution of
WEKA commands for operating systems that do not provide their own command line interface.

The menu consists of four sections like Program, Tools, Visualization, and Help.
EXPLORER:
It is a user interface which contains a group of tabs just below the title bar. The tabs are as
follows:
1. Preprocess
2. Classify
3. Cluster
4. Associate
5. Select Attributes
6. Visualize
The bottom of the window contains status box, log and WEKA bird.
1. PREPROCESSING:

Software Lab-II(DWH) Dept. of CSE

Exp No: 1 INTRODUCTION TO WEKA 2
Roll Number: Ur num here

LOADING DATA
The first four buttons at the top of the preprocess section enable you to load
Data into WEKA:
1. Open file: It shows a dialog box allowing you to browse for the data file on the local file
system.
2. Open URL: Asks for a Uniform Resource Locator address for where the data is stored.
3. Open DB: Reads data from a database.
4. Generate: It is used to generate artificial data from a variety of Data Generators.

Using the Open file button we can read files in a variety of formats like WEKA’s ARFF
format, CSV format. Typically ARFF files have .arff extension and CSV files .csv extension.
THE CURRENT RELATION
The Current relation box contains the currently loaded data i.e. interpreted as a single
relational table in database terminology, which has three entries:
1. Relation: It provides the name of the relation in the file from which it was loaded.
Filters are used modify the name of a relation.
2. Instances: The number of instances (data points/records) in the data.
3. Attributes: The number of attributes (features) in the data.
ATTRIBUTES
It is located below the current relation box which contains four buttons, they are:
1) All is used to tick all boxes 2) None is used to clear all boxes 3) Invert is used make ticked
boxes unticked. 4) Pattern is used to select attributes by representing an expression. E.g. a.* is
used to select all the attributes that begins with a.
SELECTED ATTRIBUTE:
It is located beside the current relation box which contains the following:
1. Name: It specifies the name of the attribute i.e. same as in the attribute list.
2. Type: It specifies the type of attribute, most commonly Nominal or Numeric.

Software Lab-II(DWH) Dept. of CSE

Exp No: 1 INTRODUCTION TO WEKA 3
Roll Number: Ur num here

3. Missing: It provides a numeric value of instances in the data for which an attribute is missing.
4. Distinct: It provides the number of different values that the data contains for an attribute.
5. Unique: it provides the number of instances in the data having a value for an attribute that no
other instances have.
FILTERS
By clicking the Choose button at the left of the Filter box, it is possible to select one of
the filters in WEKA. Once a filter has been selected, its name and options are shown in the field
next to the Choose button, by clicking on this box with the left mouse button it shows a
GenericObjectEditor dialog box which is used to configure the filter.

2. CLASSIFICATION

Classification has a text box which gives the name of currently selected classifier, and
its options. By clicking it with the left mouse button it shows a GenericObjectEditor dialog box,
which is same as for filters i.e. used to configure the current classifier options.
TEST OPTIONS
The result of applying the chosen classifier will be tested according to the options that
are set by clicking in the Test options box. There are four test modes:
1. Use training set.
2. Supplied test set.
3. Cross-validation.
4. Percentage split.

Once the classifier, test options and class have all been set, the learning process is
started by clicking on the Start button. We can stop the training process at any time by clicking
on the Stop button.
The Classifier output area to the right of the display is filled with text describing the
results of training and testing.

Software Lab-II(DWH) Dept. of CSE

Exp No: 1 INTRODUCTION TO WEKA 4
Roll Number: Ur num here

After training several classifiers, the Result List will contain several entries using which
we can move over various results that have been generated. By pressing Delete we can
remove a selected entry from the results.

3. CLUSTERING

By clicking the text box beside the choose button in the Clusterer box, it shows a dialog
box used to choose a new clustering scheme.
The Cluster mode box is used to choose what to cluster and how to evaluate the
results. The first three options in it are same as in classification like Use training set,
Supplied test set and Percentage split. The fourth option is classes to clusters evaluation
An additional option in the Cluster mode box is the Store clusters for visualization
which finds whether or not it will be possible to visualize the clusters once training is
complete.
Ignore Attributes: when clustering, some attributes in the data should be ignored. It
shows a small window that allows you to select which attributes are ignored.
4. ASSOCIATING

Software Lab-II(DWH) Dept. of CSE

Exp No: 1 INTRODUCTION TO WEKA 5
Roll Number: Ur num here

It contains schemes for learning association rules, and the learners are chosen and
configured in the same way as the clusterer, filters, and classifiers in the other panels.
5. SELECTING ATTRIBUTES

Attribute selection involves searching through all possible combinations of attributes in the
data to find which subset of attributes works best for prediction. To do this, two objects must be set
up: an attribute evaluator and a search method. The evaluator determines what method is used to
assign a worth to each subset of attributes. The search method determines what style of search is
performed.
The Attribute Selection Mode box has two options:
1. Use full training set: The worth of the attribute subset is determined using the full set of
training data.
2. Cross-validation: The worth of the attribute subset is determined by a process of cross-
validation. The Fold and Seed fields set the number of folds to use and the random seed used when
shuffling the data.
6. VISUALIZING

WEKA’s visualization section allows you to visualize 2D plots of the current relation.

Software Lab-II(DWH) Dept. of CSE

Exp No: 1 INTRODUCTION TO WEKA 6
Roll Number: Ur num here

EXPERIMENTER: The Weka Experiment Environment enables the user to create, run, modify, and
analyses experiments in a more convenient manner. It can also be run from the command line using
the Simple CLI.
New Experiment:
After clicking on new default parameters for an Experiment are defined.
We can choose the experiment in two different modes 1) Simple and 2) Advanced

Result Destination:
By default, an ARFF file is the destination for the results output. But we can also choose
CSV file as the destination for output file. The advantage of ARFF or CSV files is that they can
be created without any additional classes. The drawback is the lack of ability to resume the
interrupted experiment.
Experiment type:
The user can choose between the following three different types:
1. Cross-validation: it is a default type and it performs stratified cross-validation
with the given number of folds.
2. Train/Test Percentage Split: it splits a dataset according to the given percentage
into a train and a test file after the order of the data has been randomized and stratified.
3. Train/Test Percentage Split: As it is impossible to specify an explicit train/test
files pair, one can abuse this type to un-merge previously merged train and test file into the
two original files.
Additionally, one can choose between Classification and Regression, depending on the
datasets and classifiers one uses.
Data Sets:
One can add dataset files either with an absolute path or with a relative path.
Iteration control:
1. Number of repetitions: In order to get statistically meaningful results, the default number
of iterations is 10.

Software Lab-II(DWH) Dept. of CSE

Exp No: 1 INTRODUCTION TO WEKA 7
Roll Number: Ur num here

2. Data sets first/Algorithms first: As soon as one has more than one dataset and algorithm, it
can be useful to switch from datasets being iterated over first to algorithms.
Algorithms: New algorithms can be added via the “Add New” button.
Opening this dialog for the first time, ZeroR is presented.

By clicking on the Choose button one can choose another classifier which is as shown in
the below diagram:

The “Filter...” button enables us to highlight classifiers that can handle certain
attributes and class types. With “Remove Filter” button one can clear the classifiers that are
highlighted earlier.

Software Lab-II(DWH) Dept. of CSE

Exp No: 1 INTRODUCTION TO WEKA 8
Roll Number: Ur num here

With the Load options... and Save options... buttons one can load and save the setup of
a selected classifier from and to XML.
Running an Experiment:
To run the current experiment, click the Run tab at the top of the Experiment Environment
window. The current experiment performs 10 runs of 10-fold stratified cross-validation.

After clicking the Run tab, it shows a window with start button and stop button, by clicking on
start button we can run the experiment and by clicking on stop button we can run the experiment.

Software Lab-II(DWH) Dept. of CSE

Exp No: 1 INTRODUCTION TO WEKA 9
Roll Number: Ur num here

If the experiment was defined correctly, the 3 messages shown above will be displayed in the
Log panel.
Advanced
Defining an experiment: When the Experimenter is started in Advanced mode, the Setup tab is
displayed. Now click New to initialize an experiment.

To define the dataset to be processed by a scheme, first select Use relative paths in the
Datasets panel of the Setup tab and then click on Add new... button.
Saving Results of the experiment
To identify a dataset to which the results are to be sent, click on the Instances- ResultListener
entry in the Destination panel, which opens a dialog box with a label named as “output file”.

Software Lab-II(DWH) Dept. of CSE

Exp No: 1 INTRODUCTION TO WEKA 10
Roll Number: Ur num here

Now give the name of the output file and click on OK button. The dataset name is now
displayed in the Datasets panel of the Setup tab. This is as shown in the following figure:

Now we can run the experiment by clicking the Run tab at the top of the experiment
environment window. The current experiment performs 10 randomized train and test runs.
To change from random train and test experiments to cross-validation experiments, click on
the Result generator entry.
Using analysis tab in experiment environment window one can analyze the results of experiments
using experiment analyzer.

KNOWLEDGE FLOW:
The Knowledge Flow provides an alternative to the Explorer as a graphical front end to WEKA’s core
algorithms. It is represented as shown in the following figure. The Knowledge Flow presents a data-
flow inspired interface to WEKA.
The Knowledge Flow offers the following features:
1. Intuitive data flow style layout.
2. Process data in batches or incrementally.
3. Process multiple batches or streams in parallel (each separate flow executes in its own thread).
4. Chain filters together.
5. View models produced by classifiers for each fold in a cross validation.
6. visualize performance of incremental classifiers during processing

Software Lab-II(DWH) Dept. of CSE

Exp No: 1 INTRODUCTION TO WEKA 11
Roll Number: Ur num here

7. Plug-in facility for allowing easy addition of new components to the Knowledge Flow.

Components:

1. Data Sources: All WEKA loaders are available.

2. Data Sinks: All WEKA savers are available.
3. Filters: All WEKA’s filters are available.
4. Classifiers: All WEKA classifiers are available.
5. Clusterers: All WEKA clusterers are available.
6. Evaluation: It contains different kinds of techniques like TrainingSetMaker, TestSetMaker,
CrossValidationFoldMaker, TrainTestSplitMaker, ClassAssigner, ClassValuePicker,
ClassifierPerformanceEvaluator, IncrementalClassifierEvaluator,
ClustererPerformanceEvaluator, and PredictionAppender.

Software Lab-II(DWH) Dept. of CSE

Exp No: 1 INTRODUCTION TO WEKA 12
Roll Number: Ur num here

7. Visualization: It contains different models like DataVisualizer, ScatterPlotMatrix,

AttributeSummarizer, ModelPerformanceChart, TextViewer, GraphViewerbased, and
StripChart.
Plug-in Facility:
The Knowledge Flow offers the ability to easily add new components via a plug-in mechanism.
SIMPLE CLI:
The Simple CLI provides full access to all Weka classes like classifiers, filters, clusterers, etc.,
but without the hassle of the CLASSPATH.
It offers a simple Weka shell with separated command line and output.
The simple command line interface is represented as shown in the following figure:

The following commands are available in the Simple CLI:

1. java <classname> [<args>]: - invokes a java class with the given arguments (if any)
2. break: - it stops the current thread in a friendly manner. e.g., a running classifier
3. kill: - stops the current thread in an unfriendly fashion
4. cls:- clears the output area
5. exit:- exits the Simple CLI
6. help [<command>]:- provides an information about the command available in the simple
CLI. Also it provides an overview of all commands available if the command is not specified
as an argument.

Software Lab-II(DWH) Dept. of CSE

Exp No: 1 INTRODUCTION TO WEKA 13
Roll Number: Ur num here

In order to invoke a Weka class, only the way is one has to prefix the class with”java”. This
command tells the Simple CLI to load a class and execute it with any given parameters.
For example: java weka.classifiers.trees.J48 -t c:/temp/iris.arff which results in the following
output.

Using simple CLI we can also perform command redirection using the operator “>”. For
example: java weka.classifiers.trees.J48 -t test.arff > j48.txt

Software Lab-II(DWH) Dept. of CSE

SaaS Implementation Best Practices - v2
No ratings yet
SaaS Implementation Best Practices - v2
24 pages
Laboratory Manual On: Data Mining
No ratings yet
Laboratory Manual On: Data Mining
41 pages
WEKA Practical Protocol
No ratings yet
WEKA Practical Protocol
40 pages
Lecture 12 - Weka Tutorial
No ratings yet
Lecture 12 - Weka Tutorial
84 pages
CS-703 (B) Data Warehousing and Data Mining Lab
No ratings yet
CS-703 (B) Data Warehousing and Data Mining Lab
50 pages
DWDM Lab Manual
No ratings yet
DWDM Lab Manual
55 pages
Data Mining Example (Using Weka)
50% (2)
Data Mining Example (Using Weka)
59 pages
Dinesh DM
No ratings yet
Dinesh DM
34 pages
Dm&pa Lab Manual
No ratings yet
Dm&pa Lab Manual
68 pages
Data Warehousing Lab Excercise
No ratings yet
Data Warehousing Lab Excercise
45 pages
WEKA Lab Record
No ratings yet
WEKA Lab Record
69 pages
Windows Server 2003 Domains Active Directory
No ratings yet
Windows Server 2003 Domains Active Directory
392 pages
Lab Manual Format
No ratings yet
Lab Manual Format
37 pages
Weka Lab
No ratings yet
Weka Lab
11 pages
Primitive Data Types & Wrapper Classes: Interface
No ratings yet
Primitive Data Types & Wrapper Classes: Interface
6 pages
Lab Manual - DM
No ratings yet
Lab Manual - DM
56 pages
Design and Implementation of A Computerised Stadium Management Information System
100% (8)
Design and Implementation of A Computerised Stadium Management Information System
32 pages
Data Warehousing and Data Mining Lab Manual
0% (1)
Data Warehousing and Data Mining Lab Manual
30 pages
GPON OLT (New 8PON Port, 16PON Port) User Manual-Command Line Operation - V1.1 20180723
No ratings yet
GPON OLT (New 8PON Port, 16PON Port) User Manual-Command Line Operation - V1.1 20180723
336 pages
Modicon LMC078: Motion Controller Programming Guide
No ratings yet
Modicon LMC078: Motion Controller Programming Guide
276 pages
32013105-BDA LabManual
No ratings yet
32013105-BDA LabManual
122 pages
Data Warehousing and Data Mining Lab Manual
100% (1)
Data Warehousing and Data Mining Lab Manual
30 pages
2023 Full Planner Print
No ratings yet
2023 Full Planner Print
139 pages
DM Lab Material
No ratings yet
DM Lab Material
88 pages
Data Warehousing
No ratings yet
Data Warehousing
54 pages
Mooc On Weka
No ratings yet
Mooc On Weka
59 pages
DWDM File-Final Ver3.pdf 20241230 172003 0000
No ratings yet
DWDM File-Final Ver3.pdf 20241230 172003 0000
54 pages
DM Lab
No ratings yet
DM Lab
101 pages
Rovertown 1
No ratings yet
Rovertown 1
47 pages
18 Ajit Gupta Android Practical
No ratings yet
18 Ajit Gupta Android Practical
122 pages
Satp Installation Guide 3.2
No ratings yet
Satp Installation Guide 3.2
81 pages
Rintro Wekacomplete
No ratings yet
Rintro Wekacomplete
135 pages
Yealink T55A Teams Phone Edition User Guide V15.85
No ratings yet
Yealink T55A Teams Phone Edition User Guide V15.85
51 pages
Data Mining Complete Lab Manual - DRSNR
No ratings yet
Data Mining Complete Lab Manual - DRSNR
27 pages
Weka Lab Manual
No ratings yet
Weka Lab Manual
49 pages
Aiml Manual
No ratings yet
Aiml Manual
27 pages
DMW Lab Print
No ratings yet
DMW Lab Print
21 pages
Data Mining Lab Manual
No ratings yet
Data Mining Lab Manual
50 pages
Data Mining Unit 5
No ratings yet
Data Mining Unit 5
12 pages
03 01 PatMax Logic
No ratings yet
03 01 PatMax Logic
15 pages
Controlcasepciv4 241115112355 3cfe7e3f
No ratings yet
Controlcasepciv4 241115112355 3cfe7e3f
27 pages
Experiment WEKA
No ratings yet
Experiment WEKA
16 pages
DHW Lab (Ex1 To 3)
No ratings yet
DHW Lab (Ex1 To 3)
18 pages
Deepak Dmbi File
No ratings yet
Deepak Dmbi File
40 pages
Expt 1 Docx
No ratings yet
Expt 1 Docx
15 pages
Wekappt
No ratings yet
Wekappt
58 pages
Weka Installation Steps Final
No ratings yet
Weka Installation Steps Final
7 pages
Datawarehouse Pract 2
No ratings yet
Datawarehouse Pract 2
7 pages
Semtech Broadcast SelectorGuide 2021 Web
No ratings yet
Semtech Broadcast SelectorGuide 2021 Web
12 pages
Lab 04
No ratings yet
Lab 04
7 pages
WEKA Experimenter Tutorial For Version 3-5-8: David Scuse Peter Reutemann July 14, 2008
No ratings yet
WEKA Experimenter Tutorial For Version 3-5-8: David Scuse Peter Reutemann July 14, 2008
40 pages
DM Lab Task-1 Expr's-1
No ratings yet
DM Lab Task-1 Expr's-1
58 pages
Cheryl Simons Resume 2013-4
No ratings yet
Cheryl Simons Resume 2013-4
3 pages
Data Warehousing and Data Mining Lab
No ratings yet
Data Warehousing and Data Mining Lab
53 pages
HP Color LaserJet CP5220 Ersatzteile PDF
No ratings yet
HP Color LaserJet CP5220 Ersatzteile PDF
51 pages
Introduction To Weka: Xingquan (Hill) Zhu
No ratings yet
Introduction To Weka: Xingquan (Hill) Zhu
63 pages
Lab 02
No ratings yet
Lab 02
4 pages
DWM - Exp No 5
No ratings yet
DWM - Exp No 5
7 pages
Handout 1
No ratings yet
Handout 1
5 pages
Machine Learning May 2024
No ratings yet
Machine Learning May 2024
8 pages
Weka Data Miningvsem
No ratings yet
Weka Data Miningvsem
7 pages
Wire Color Code Charts
No ratings yet
Wire Color Code Charts
4 pages
Comparing Open-Source Speech Recognition Toolkits
No ratings yet
Comparing Open-Source Speech Recognition Toolkits
12 pages
DMBI Exp1: Introduction To WEKA Tool
No ratings yet
DMBI Exp1: Introduction To WEKA Tool
6 pages
Healthcare ERP Project Success: It's All About Avoiding Missteps
No ratings yet
Healthcare ERP Project Success: It's All About Avoiding Missteps
5 pages
Weka Tutorial
No ratings yet
Weka Tutorial
45 pages
Presentation (Vehicle Insurance Policy)
No ratings yet
Presentation (Vehicle Insurance Policy)
10 pages
Weka Tutorial
No ratings yet
Weka Tutorial
32 pages
CA ERwin Tutorial
No ratings yet
CA ERwin Tutorial
12 pages
Weka (20030421-Version1 by Kdelab)
No ratings yet
Weka (20030421-Version1 by Kdelab)
51 pages
QHY163M Review en
No ratings yet
QHY163M Review en
31 pages
Code:: Bahria University, Islamabad Campus Short Assignment (Quiz 01) (Fall 2020 Semester)
No ratings yet
Code:: Bahria University, Islamabad Campus Short Assignment (Quiz 01) (Fall 2020 Semester)
4 pages
WEKA Explorer Tutorial
No ratings yet
WEKA Explorer Tutorial
45 pages
A7ph 206 1
No ratings yet
A7ph 206 1
7 pages
ExplorerGuide A Version 3-5-8
No ratings yet
ExplorerGuide A Version 3-5-8
22 pages
Product Detail - 700d - English - 3
No ratings yet
Product Detail - 700d - English - 3
2 pages
DWDM WEEK1&2
No ratings yet
DWDM WEEK1&2
13 pages
Weka Overview Slides
No ratings yet
Weka Overview Slides
31 pages
SUN2000-115kTL-M2 Datasheet
No ratings yet
SUN2000-115kTL-M2 Datasheet
2 pages
Weka Software Manuala
No ratings yet
Weka Software Manuala
20 pages
WEKA Explorer User Guide For Version 3-4: Richard Kirkby Eibe Frank July 15, 2008
No ratings yet
WEKA Explorer User Guide For Version 3-4: Richard Kirkby Eibe Frank July 15, 2008
13 pages
Data Base Management Key Points
No ratings yet
Data Base Management Key Points
8 pages
Fine Art Colour Photography: Lesson 1 Course Notes
No ratings yet
Fine Art Colour Photography: Lesson 1 Course Notes
19 pages
Examples On Sampling and Aliasing Phenomena: Example 1
No ratings yet
Examples On Sampling and Aliasing Phenomena: Example 1
5 pages
AI32 Guide To Weka PDF
No ratings yet
AI32 Guide To Weka PDF
6 pages
WEKA Intro
No ratings yet
WEKA Intro
17 pages
Crystal Reports Introduction: Versions 2008-2016
From Everand
Crystal Reports Introduction: Versions 2008-2016
Seth Bonder
No ratings yet
Advance Excel 2016: Training guide
From Everand
Advance Excel 2016: Training guide
Ritu Arora
No ratings yet
(Part 2) Java 4 Selenium WebDriver: Come Learn How To Program For Automation Testing
From Everand
(Part 2) Java 4 Selenium WebDriver: Come Learn How To Program For Automation Testing
Rex Jones
No ratings yet
Selenium Testing Tools Interview Questions You'll Most Likely Be Asked: Second Edition
From Everand
Selenium Testing Tools Interview Questions You'll Most Likely Be Asked: Second Edition
Vibrant Publishers
No ratings yet

Weka Experiment

Uploaded by

Weka Experiment

Uploaded by

Exp No: 1 INTRODUCTION TO WEKA 1

Roll Number: Ur num here

WEKA  Waikato Environment for Knowledge Analysis

Software Lab-II(DWH) Dept. of CSE

Software Lab-II(DWH) Dept. of CSE

Software Lab-II(DWH) Dept. of CSE

Software Lab-II(DWH) Dept. of CSE

Software Lab-II(DWH) Dept. of CSE

Software Lab-II(DWH) Dept. of CSE

Software Lab-II(DWH) Dept. of CSE

Software Lab-II(DWH) Dept. of CSE

Software Lab-II(DWH) Dept. of CSE

Software Lab-II(DWH) Dept. of CSE

1. Data Sources: All WEKA loaders are available.

Software Lab-II(DWH) Dept. of CSE

7. Visualization: It contains different models like DataVisualizer, ScatterPlotMatrix,

The following commands are available in the Simple CLI:

Software Lab-II(DWH) Dept. of CSE

Software Lab-II(DWH) Dept. of CSE

You might also like