EX-01-Weka and Rapidminer

The document outlines a study aimed at exploring the features of Weka, Rapid Miner tools, and UCI repository datasets. It details fundamental terms related to data attributes and instances, along with step-by-step procedures for using Weka and Rapid Miner to preprocess datasets. The conclusion emphasizes the exploration of various features across these tools and datasets.

Uploaded by

keerthivasank.22cse

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

23 views9 pages

EX-01-Weka and Rapidminer

Uploaded by

keerthivasank.22cse

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

EX.

No : 01 STUDY OF WEKA, RAPID MINER TOOLS AND UCI REPOSITORY DATASETS

AIM:
To explore the various features of Weka, Rapid miner Tools and UCI Repository
datasets.

PROCEDURE:
FUNDAMENTAL TERMS:
Feature/Attribute: A single column of data is called a feature. It is a component of an
observation and is also called an attribute of a data instance. Some features may be inputs to a
model (the predictors) and others may be outputs or the features to be predicted.
Attribute values: Attribute values are numbers or symbols assigned to an attribute.
Target attribute: Target attribute is a special attribute which corresponds to the label of each
instance.
Instance: Each row in the dataset is called the instance.
Datasets: A collection of instances is a dataset.
Training Dataset: A dataset that is fed into the machine learning algorithm to train the model.
Testing Dataset: A dataset that is used to validate the accuracy of the model but is not used to
train the model.
WEKA:
1. Download and install weka,
2. In the window, select the explorer button from the available five buttons.
3. The weka supports two common formats for files:
ARFF-Attribute Relation File Format
CSV-Comma Separated Values
EXPLORER:
The explorer window contains preprocess, classify, cluster, associate, select attribute and
visualize from which select preprocess.
OPEN FILE:
To open the default dataset into the machine.
OPEN URL:
To access the dataset in the website.
OPENDB:
To open the database which the user saved in the machine.
CHOOSE:
To select the filter option.
EDIT:
To set the filled dataset before and after the filter.
FILTERS:
To filter or tune the data.
1. REMOVE(ATTRIBUTE) :

A filter that removes a range of attributes from the dataset.

1. Open file button is clicked
2. Choose weka and open the data folder
3. Choose weather numeric.arff
4. In filter tab, click choose button
5. Choose unsupervised and select attribute
6. Select remove filter
7. Specify the attribute index in the filter editor window
8. Apply button is clicked
9. Choose the edit button to see the output data after filtering the attribute
2. REMOVE WITH VALUES:
Filters instances according to the value of the attribute.
1. Open file button is clicked
2. Choose weather numeric.arff
3. Choose unsupervised and select instance
4. Select “remove with values” filter
5. Set attribute index to 2 and split point to 60
6. The output contains column dataset in which the second column contains only the
values which is above the split point

3. REPLACE WITH MISSING VALUES:

Replace all missing values for nominal and numeric attributes in a dataset with the
modes and means from the training data.
1. Choose unsupervised and select attributes.
2. Select “remove missing values” filter.
3. Click “edit” and delete any one of the data.
4. After applying the filter, the deleted values or any missing values are replaced by
taking the mean values.

4. REMOVE PERCENTAGE:
A filter that removes a given percentage of a database.
1. Choose unsupervised and select instance.
2. Select “remove percentage” filter.
3. Set percentage as “50.0”
4. After filter is applied , from the dataset 50% of the instance are removed

5. REMOVE FREQUENT VALUES:

Determine which values of attribute or retained and filters the instances accordingly.
1. Choose unsupervised and select instances.
2. Select “remove frequent values” filter.
3. Specify the attribute index as 2.
4. When apply is clicked the less frequently repeated values are removed
OUTPUT:
Dataset: Weather.numeric.arff

Applying Remove Filter:

Applying Remove with values Filter

Applying Replacing Missing Value Filter

Applying Remove Percentage Filter

Applying Remove Frequent Values Filter

RAPIDMINER:

Design View
Preprocessing: Replace missing values

1. Load the Labor-Negotiations data set from the Samples folder.

2. Drag and drop the Replace Missing Values Operator. It applies the replacement on all
attributes in the dataset which have at least one missing value.
3. Click the play button and view the output.

Dataset with missing values:

Dataset after filling missing values:

UCI Repository:

Sample dataset:
Conclusion: The various features of Weka Tool, Rapidminer Tool and UCI Repository datasets
have been explored.

18+430 List of CH.: Design of Pier P1
No ratings yet
18+430 List of CH.: Design of Pier P1
54 pages
Applications of Nanotechnology in Agriculture
No ratings yet
Applications of Nanotechnology in Agriculture
417 pages
Excavador 330 BL Shematic System Electrical
No ratings yet
Excavador 330 BL Shematic System Electrical
11 pages
Nnew - DC Lab Manual
No ratings yet
Nnew - DC Lab Manual
106 pages
Oxymag English
No ratings yet
Oxymag English
40 pages
Comparison TIA Portal Vs Studio 5000 1
100% (1)
Comparison TIA Portal Vs Studio 5000 1
53 pages
Astm f2882
No ratings yet
Astm f2882
7 pages
Chapter 2 Components of Food
No ratings yet
Chapter 2 Components of Food
12 pages
Barber Colman
No ratings yet
Barber Colman
61 pages
CVR DWDM Manual
100% (1)
CVR DWDM Manual
70 pages
RF Circuits With Multisim 10 - Exp - 1 - 8
No ratings yet
RF Circuits With Multisim 10 - Exp - 1 - 8
52 pages
The "Everything We Could Find On Microsoft VBA" List: Microsoft Support Knowledge Base
0% (1)
The "Everything We Could Find On Microsoft VBA" List: Microsoft Support Knowledge Base
3 pages
CS423 Raw Sockets BW
No ratings yet
CS423 Raw Sockets BW
34 pages
World Population Analysis
100% (1)
World Population Analysis
64 pages
Automatic Pixel-Level Detection of Vertical Cracks in Asphalt Pavement Based On GPR Investigation and Improved Mask R-CNN
No ratings yet
Automatic Pixel-Level Detection of Vertical Cracks in Asphalt Pavement Based On GPR Investigation and Improved Mask R-CNN
44 pages
Data Mining Lab Manual
33% (3)
Data Mining Lab Manual
44 pages
Seismic Micro Zonation Aap PHD
No ratings yet
Seismic Micro Zonation Aap PHD
11 pages
Data Mining Lab Manual
No ratings yet
Data Mining Lab Manual
36 pages
Data Warehouse and Data Mining: Lab Manual
100% (1)
Data Warehouse and Data Mining: Lab Manual
69 pages
A Deep Learning Model For Detection of Cervical SP
No ratings yet
A Deep Learning Model For Detection of Cervical SP
12 pages
DWDM LAB Manual SVEC-16
No ratings yet
DWDM LAB Manual SVEC-16
8 pages
Laser Maser
No ratings yet
Laser Maser
4 pages
Idsa For Quiz 1
No ratings yet
Idsa For Quiz 1
21 pages
The Lecture Contains:: Lecture 9: Performance Issues in Shared Memory
No ratings yet
The Lecture Contains:: Lecture 9: Performance Issues in Shared Memory
7 pages
Data Mining-L3
No ratings yet
Data Mining-L3
22 pages
Data Warehousing and Data Mining Lab
No ratings yet
Data Warehousing and Data Mining Lab
53 pages
Lab 12 Introduction To Rapidminer/Weka.: Objective
No ratings yet
Lab 12 Introduction To Rapidminer/Weka.: Objective
24 pages
GED OnlineTest R2 Sci
No ratings yet
GED OnlineTest R2 Sci
18 pages
Machine Learning Based Integrated Scheduling and Rescheduling For Elective and Emergency Patients in The Operating Theatre
No ratings yet
Machine Learning Based Integrated Scheduling and Rescheduling For Elective and Emergency Patients in The Operating Theatre
24 pages
Lab Manual
No ratings yet
Lab Manual
69 pages
DWDM Lab Manual Using Weka-For MIC
No ratings yet
DWDM Lab Manual Using Weka-For MIC
42 pages
BANFLEX
No ratings yet
BANFLEX
1 page
Data Mining Lab Manual: Aurora's PG College Moosarambagh Mca Department
No ratings yet
Data Mining Lab Manual: Aurora's PG College Moosarambagh Mca Department
42 pages
Acti 9 iEM3000 - A9MEM3255
No ratings yet
Acti 9 iEM3000 - A9MEM3255
3 pages
DWDM Record With Alignment
No ratings yet
DWDM Record With Alignment
69 pages
Hooded Dino Blanket
No ratings yet
Hooded Dino Blanket
2 pages
MS5105 Module Outline 2022-2023
No ratings yet
MS5105 Module Outline 2022-2023
4 pages
Data-Mining-Lab-Manual Cs 703b
No ratings yet
Data-Mining-Lab-Manual Cs 703b
41 pages
DMW Lab Manual
No ratings yet
DMW Lab Manual
35 pages
WEKA Manual
No ratings yet
WEKA Manual
25 pages
Assignment Template
No ratings yet
Assignment Template
24 pages
Vihtavuori - 308 Win & 300WM Table de Chargement
No ratings yet
Vihtavuori - 308 Win & 300WM Table de Chargement
1 page
Assignment 1-Preprocessing Handon
No ratings yet
Assignment 1-Preprocessing Handon
6 pages
DWDM Lab Manual
No ratings yet
DWDM Lab Manual
47 pages
Task 0: Weka Introduction
No ratings yet
Task 0: Weka Introduction
11 pages
Workshop 1
No ratings yet
Workshop 1
16 pages
Data Mining Lab Manual
No ratings yet
Data Mining Lab Manual
71 pages
DWM1
No ratings yet
DWM1
19 pages
MC0717 Lab Manual
No ratings yet
MC0717 Lab Manual
42 pages
Flood Prediction Analysis
No ratings yet
Flood Prediction Analysis
42 pages
DWDM - Case Study On Weka - Ceb624
No ratings yet
DWDM - Case Study On Weka - Ceb624
13 pages
Weka-: Data Warehousing and Data Mining Lab Manual-Week 9
100% (1)
Weka-: Data Warehousing and Data Mining Lab Manual-Week 9
8 pages
Perform Data Pre-Processing On Sample Data Set (Student - Arff)
No ratings yet
Perform Data Pre-Processing On Sample Data Set (Student - Arff)
4 pages
Unit III 1
No ratings yet
Unit III 1
11 pages
DM Tools Sample-1
No ratings yet
DM Tools Sample-1
72 pages
DMW Lab Manual
No ratings yet
DMW Lab Manual
42 pages
DataMining-LabManual 241220 165057
No ratings yet
DataMining-LabManual 241220 165057
69 pages
Experiment No: 01 Data Exploration & Data Preprocessing
No ratings yet
Experiment No: 01 Data Exploration & Data Preprocessing
54 pages
Task 3
No ratings yet
Task 3
36 pages
Weka Tutorial: 1. Downloading and Installing Weka (Version 3.6)
No ratings yet
Weka Tutorial: 1. Downloading and Installing Weka (Version 3.6)
4 pages
DM 1
No ratings yet
DM 1
19 pages
DMLab
No ratings yet
DMLab
27 pages
2022 Lutomirski - Strength Reduction Factors
No ratings yet
2022 Lutomirski - Strength Reduction Factors
9 pages
Data Mining and Warehousing
No ratings yet
Data Mining and Warehousing
30 pages
DW Lab
No ratings yet
DW Lab
85 pages
AVINESH - RATHORE KUCP1085 - LAB Assignment 3
No ratings yet
AVINESH - RATHORE KUCP1085 - LAB Assignment 3
8 pages
ccs341 Data Warehousing Lab Manual2021
No ratings yet
ccs341 Data Warehousing Lab Manual2021
41 pages
Anne - CCS341 - DW - Students Record - 1a - 1b - 2 - Print
No ratings yet
Anne - CCS341 - DW - Students Record - 1a - 1b - 2 - Print
63 pages
DM Lab 1
No ratings yet
DM Lab 1
6 pages
Perform Data Preprocessing Tasks Using Labor Data Set in WEKA
No ratings yet
Perform Data Preprocessing Tasks Using Labor Data Set in WEKA
6 pages
Weka LAB-ALL
No ratings yet
Weka LAB-ALL
19 pages
2 Data Prep
No ratings yet
2 Data Prep
95 pages
BI - Experiment - No - 1
No ratings yet
BI - Experiment - No - 1
7 pages
DMW LabFile 0901CS243D11 Swastik
No ratings yet
DMW LabFile 0901CS243D11 Swastik
25 pages
Data Warehousing Lab Manual
No ratings yet
Data Warehousing Lab Manual
36 pages
Experiment 1: Installation of WEKA Tool Aim
No ratings yet
Experiment 1: Installation of WEKA Tool Aim
19 pages
Lab Manual
No ratings yet
Lab Manual
16 pages
Lecture 5-PCP
No ratings yet
Lecture 5-PCP
41 pages
Can You Double Check It and Give Me Detailed Step - .
No ratings yet
Can You Double Check It and Give Me Detailed Step - .
56 pages
Ccs341-Data-Warehousing-Lab-Manual2021 240410 1745 250417 141609
No ratings yet
Ccs341-Data-Warehousing-Lab-Manual2021 240410 1745 250417 141609
46 pages
This Is Are All Practical Questions and I Want An - .
No ratings yet
This Is Are All Practical Questions and I Want An - .
33 pages
Itdw
No ratings yet
Itdw
44 pages
Why Data Mining
No ratings yet
Why Data Mining
12 pages
DWM Lab Manual 2025-26 Updated
No ratings yet
DWM Lab Manual 2025-26 Updated
47 pages
Idromar SH-8000 UV System
No ratings yet
Idromar SH-8000 UV System
14 pages
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: SUPPORT VECTOR MACHINE, LOGISTIC REGRESSION, DISCRIMINANT ANALYSIS and DECISION TREES: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: SUPPORT VECTOR MACHINE, LOGISTIC REGRESSION, DISCRIMINANT ANALYSIS and DECISION TREES: Examples with MATLAB
César Pérez López
No ratings yet
DATA MINING and MACHINE LEARNING: CLUSTER ANALYSIS and kNN CLASSIFIERS. Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING: CLUSTER ANALYSIS and kNN CLASSIFIERS. Examples with MATLAB
César Pérez López
No ratings yet
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet