Learning To Use We Ka
Learning To Use We Ka
Weka is a collection of machine learning algorithms for data mining tasks. The algorithms can either be
applied directly to a dataset (using GUI) or called from your own Java code (using Weka Java library). Tools
(or functions) in Weka include:
The Weka GUI Chooser (class weka.gui.GUIChooser) provides a starting point for launching Wekas
main GUI applications and supporting tools. If one prefers a MDI (multiple document interface)
appearance, then this is provided by an alternative launcher called Main (class weka.gui.Main).
The GUI Chooser consists of four buttons one for each of the four major Weka applications and four
menus. The buttons can be used to start the following applications:
Explorer: An environment for exploring data with WEKA (the rest of this documentation deals
with this application in more detail).
Experimenter: An environment for performing experiments and conducting statistical tests
between learning schemes.
Knowledge Flow: This environment supports essentially the same functions as the Explorer but
with a drag-and-drop interface. One advantage is that it supports incremental learning.
Simple CLI: Provides a simple command-line interface that allows direct execution of WEKA
commands for operating systems that do not provide their own command line interface.
HTS_LearningWeka_KnowYourData
9/17/2015 11:29:10 AM
Page 1 of 5
(c)
HTS_LearningWeka_KnowYourData
9/17/2015 11:29:10 AM
Page 2 of 5
(d)
Weather data: (a) spreadsheet, (b) CSV format, and (c) ARFF numeric (d) ARFF nominal.
Getting to Know your Data
WEKA provides 23 sample data (.ARFF) in directory C:\Program Files (x86)\Weka-3-6\data\. Please
explore these data using Arff Viewer in Weka Tools.
HTS_LearningWeka_KnowYourData
9/17/2015 11:29:10 AM
Page 3 of 5
Arff Viewer shows all data credit-g in relational format. You can see any type of attribute in column
header. You also can view sample data in notepad or WordPad tools. Some data have description
before relation name. Furthermore, Weka also have feature to visualize data from menu
Visualization. You can choose any visualization format to show the dispersion of data. For example:
You can see figure blow that visualize sample data credit-g. You can change the ordinate and axis of
graph according to the data attribute.
HTS_LearningWeka_KnowYourData
9/17/2015 11:29:10 AM
Page 4 of 5
Task 1: Explore 23 sample data (.ARFF) using Arff Viewer in Weka Tools. Read the description (attribute,
number of tuple, etc) of data.
Task 2: Open sample data in Weka and try to visualize it, then see the dispersion of data. Please, try any
parameter of visualization according to the data attribute.
Task 3: Explore tab Preprocess, Classify, Cluster, Associate, Select Attributes and Visualize from Weka
Explorer. Remember you must choose filter method and data that you want to mining before do data
mining process.
Task 4: Please read WekaManual.pdf to explore and understand about Weka. This file can be access in
directory: C:\Program Files (x86)\Weka-3-6\
HTS_LearningWeka_KnowYourData
9/17/2015 11:29:10 AM
Page 5 of 5