0% found this document useful (0 votes)

113 views14 pages

Iot Domain Analyst-Ece3502: Data Analytics Using Weka For Water Quality Related Data

1) The document describes an experiment using the Weka machine learning software to analyze a water quality dataset. 2) Weka was used to apply various classification techniques like J48 decision trees, random forests, linear regression, and Gaussian processes to predict water quality. 3) The random tree classification achieved the highest accuracy of 91.61% while random forest was the lowest at 57.27% based on 10-fold cross validation.

Uploaded by

sai manikanta

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

113 views14 pages

Iot Domain Analyst-Ece3502: Data Analytics Using Weka For Water Quality Related Data

Uploaded by

sai manikanta

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 14

Name: M.

SAIMANIKANTA Reg no: 18BEC1314

SLOT: L37+L38 FACLITY: - DR VELMATHI G

EXPERIMENT NUMBER: - 5 DATE: - 04/03/2021

IOT DOMAIN ANALYST- ECE3502

AIM: Data Analytics using Weka for water quality related data.

THEORY:
WEKA

The workbench for machine learning

Weka is tried and tested open source machine learning software that can be
accessed through a graphical user interface, standard terminal applications, or a
Java API. It is widely used for teaching, research, and industrial applications,
contains a plethora of built-in tools for standard machine learning tasks, and
additionally gives transparent access to well-known toolboxes such as scikit-
learn, R, and Deeplearning4j.
WEKA: Weka (Waikato Environment for Knowledge Analysis) is a popular suite of
machine learning software written in Java, developed at the University of
Waikato, New Zealand. Weka is free software available under the GNU General
Public License. The Weka workbench contains a collection of visualization tools
and algorithms for data analysis and predictive modeling, together with graphical
user interfaces for easy access to this functionality
Weka is a collection of machine learning algorithms for solving real-world data
mining problems. It is written in Java and runs on almost any platform. The
algorithms can either be applied directly to a dataset or called from your own Java
code
The original non-Java version of Weka was a TCL/TK front-end to (mostly third-
party) modeling algorithms implemented in other programming languages, plus
data preprocessing utilities in C, and a Makefile-based system for running
machine learning experiments. This original version was primarily designed as a
tool for analyzing data from agricultural domains, but the more recent fully Java-
based version (Weka 3), for which development started in 1997, is now used in
many different application areas, in particular for educational purposes and
research.
Advantages of Weka include:
 Free availability under the GNU General Public License
 Portability, since it is fully implemented in the Java programming language
and thus runs on almost any modern computing platform
 A comprehensive collection of data preprocessing and modeling techniques
 Ease of use due to its graphical user interfaces
DESIGN AND PROCEDURE:
1) Download and install weka software in laptop and open it.

2) Open a new explorer in weka

3) Now download dataset from interent.

4) Here we take a csv file

In that we make the changes like

We deleted the unwanted rows form the data set

We can exchange the rows form the dataset etc.

5) Now open the data set csv file through notepad.

And make them in the correct format for weka.

(This is the basic format)

% 1. Title: Iris Plants Database

@RELATION iris
% 2. Sources:

@ATTRIBUTE sepallength NUMERIC

@ATTRIBUTE sepalwidth NUMERIC
@ATTRIBUTE petallength NUMERIC
@ATTRIBUTE petalwidth NUMERIC
@ATTRIBUTE class {Iris-setosa,Iris-versicolor,Iris-virginica}

The Data of the ARFF file looks like the following:

@DATA
5.1,3.5,1.4,0.2,Iris-setosa
4.9,3.0,1.4,0.2,Iris-setosa
4.7,3.2,1.3,0.2,Iris-setosa
4.6,3.1,1.5,0.2,Iris-setosa
5.0,3.6,1.4,0.2,Iris-setosa
5.4,3.9,1.7,0.4,Iris-setosa
4.6,3.4,1.4,0.3,Iris-setosa
5.0,3.4,1.5,0.2,Iris-setosa
4.4,2.9,1.4,0.2,Iris-setosa
4.9,3.1,1.5,0.1,Iris-setosa

After that save file the .arff format.

6) In the following graph is showing the station code of water quality in that the
range is from 11-3330 and the mean is 2052 and stddev is 755
7) The following graph is showing the temperature of the water at different areas
of water dataset in that the range is from 0-33.8 and the mean is 25 and stddev is
4.2
8) In the following graph is showing the ph of water, in water dataset in that the
range is from 6.3-17.7 and the mean is 7.7 and stddev is 0.68
9) In the following graph is showing the nitratre_n of water, in water dataset in
that the range is from 0-45.4 and the mean is 1.3 and stddev is 2.8 and the mode
of nitrate_n is 0
OUTPUT:-
1) The following is showing the trees.j48 logic with 10 cross validation
classification used to study the data set for machine learning and to predict the
quality of water. The accuracy of this classification is 64.34% which is moderate
good.
2) The following is showing the trees.randomtree logic with 10 cross validation
classification used to study the data set for machine learning and to predict the
quality of water. The accuracy of this classification is 91.61% which is good.
3) The following is showing the random forest tree with 10 cross validation
classification used to study the data set for machine learning and to predict the
quality of water. The accuracy of this classification is 57.27% which is not good.
4) The following is showing the function linear regression with 10 cross validation
classification used to study the data set for machine learning and to predict the
quality of water. The accuracy of this classification is 79.48% which is moderate
good.
5) The following is showing the function Gaussian process with 10 cross validation
classification used to study the data set for machine learning and to predict the
quality of water. The accuracy of this classification is 85.46% which is good.

Result:
The following dataset of water analysis is analyzed and different classification
techniques are studied for machine learning process with the help of Weka.

TCPB Workflow English
No ratings yet
TCPB Workflow English
168 pages
Model Driven Engineering (MDE) : ITC-708 by Dr. Mir Sajjad Hussain Talpur Dated: 08-2-2021
50% (2)
Model Driven Engineering (MDE) : ITC-708 by Dr. Mir Sajjad Hussain Talpur Dated: 08-2-2021
17 pages
Computer Controlled Devices For Agri-Input Management
No ratings yet
Computer Controlled Devices For Agri-Input Management
9 pages
Simp Rewards: Downloaded From
82% (11)
Simp Rewards: Downloaded From
21 pages
DWDM Manual-1
No ratings yet
DWDM Manual-1
96 pages
CS-703 (B) Data Warehousing and Data Mining Lab
No ratings yet
CS-703 (B) Data Warehousing and Data Mining Lab
50 pages
Software Requirements Specification For Library Management System
No ratings yet
Software Requirements Specification For Library Management System
9 pages
Module 7
No ratings yet
Module 7
72 pages
Dinesh DM
No ratings yet
Dinesh DM
34 pages
JFo Section 8
100% (1)
JFo Section 8
3 pages
Data Warehousing Lab Excercise
No ratings yet
Data Warehousing Lab Excercise
45 pages
Labview Programming Reference Manual 7-30-2024-9001-11630
No ratings yet
Labview Programming Reference Manual 7-30-2024-9001-11630
2,630 pages
Practical DWDM
No ratings yet
Practical DWDM
32 pages
Empowerment Technology: Guided Learning Activity Kit
100% (3)
Empowerment Technology: Guided Learning Activity Kit
16 pages
CFFD Documentation
No ratings yet
CFFD Documentation
91 pages
Rintro Wekacomplete
No ratings yet
Rintro Wekacomplete
135 pages
T12 Se
No ratings yet
T12 Se
11 pages
DM Lab Material
No ratings yet
DM Lab Material
88 pages
Module 4 PPT - Part 1
No ratings yet
Module 4 PPT - Part 1
90 pages
Module 1
No ratings yet
Module 1
97 pages
Module 2 PPT - Part 1
No ratings yet
Module 2 PPT - Part 1
84 pages
AI-43 Data Mining
No ratings yet
AI-43 Data Mining
96 pages
Data Warehousing Lab Record Final
No ratings yet
Data Warehousing Lab Record Final
45 pages
DMDW LAB NEW - Merged
No ratings yet
DMDW LAB NEW - Merged
53 pages
Experiment 1 Aim:: Introduction To ML Lab With Tools (Hands On WEKA On Data Set (Iris - Arff) ) - (A) Start Weka
No ratings yet
Experiment 1 Aim:: Introduction To ML Lab With Tools (Hands On WEKA On Data Set (Iris - Arff) ) - (A) Start Weka
55 pages
DMDV 210
No ratings yet
DMDV 210
61 pages
Lab Updated - Merged
No ratings yet
Lab Updated - Merged
49 pages
DA LabFile
No ratings yet
DA LabFile
63 pages
DWDM File-Final Ver3.pdf 20241230 172003 0000
No ratings yet
DWDM File-Final Ver3.pdf 20241230 172003 0000
54 pages
DWDM File
No ratings yet
DWDM File
26 pages
DMDV 210
No ratings yet
DMDV 210
63 pages
Priyadarshini J. L. College of Engineering, Nagpur: Session 2022-23 Semester-V
No ratings yet
Priyadarshini J. L. College of Engineering, Nagpur: Session 2022-23 Semester-V
31 pages
Data Warehouse Final Record
No ratings yet
Data Warehouse Final Record
55 pages
DWM1
No ratings yet
DWM1
19 pages
DWH Manual Merged
No ratings yet
DWH Manual Merged
47 pages
Vijay DMPM
No ratings yet
Vijay DMPM
23 pages
VME Fundementals
No ratings yet
VME Fundementals
48 pages
Data Warehouse Lab Manual
No ratings yet
Data Warehouse Lab Manual
60 pages
WEKA
No ratings yet
WEKA
50 pages
Weka Tutorial
No ratings yet
Weka Tutorial
8 pages
JSPM'S Bhivarabai Sawant Institute of Technology & Research: Mini Project Report On
No ratings yet
JSPM'S Bhivarabai Sawant Institute of Technology & Research: Mini Project Report On
33 pages
Unit-4 Os
No ratings yet
Unit-4 Os
49 pages
9348 11568 1 PB Published Paper
No ratings yet
9348 11568 1 PB Published Paper
12 pages
Data Warehousing - To Write
No ratings yet
Data Warehousing - To Write
23 pages
Unit 5: Data Normalization
No ratings yet
Unit 5: Data Normalization
27 pages
Komal DWDM 1to5
No ratings yet
Komal DWDM 1to5
61 pages
Iot Domain Analyst-Ece3502: Data Analytics Using Weka For Weather Land Related Data
No ratings yet
Iot Domain Analyst-Ece3502: Data Analytics Using Weka For Weather Land Related Data
21 pages
Data Warehousing Lab Manual
No ratings yet
Data Warehousing Lab Manual
36 pages
Data Warehousing
No ratings yet
Data Warehousing
54 pages
ML Assignment 2
No ratings yet
ML Assignment 2
25 pages
DMW LabFile 0901CS243D11 Swastik
No ratings yet
DMW LabFile 0901CS243D11 Swastik
25 pages
Iot Domain Analyst-Ece3502
No ratings yet
Iot Domain Analyst-Ece3502
15 pages
Flood Prediction Analysis
No ratings yet
Flood Prediction Analysis
42 pages
Data Warehousing Record
No ratings yet
Data Warehousing Record
26 pages
Module 3 PPT - Part1
No ratings yet
Module 3 PPT - Part1
16 pages
Datawarehousing Lab Manual
No ratings yet
Datawarehousing Lab Manual
22 pages
Data Minig Lab File
No ratings yet
Data Minig Lab File
25 pages
DWDM Print
No ratings yet
DWDM Print
20 pages
OS Journal
No ratings yet
OS Journal
28 pages
Data Werehousing Lab Manual
No ratings yet
Data Werehousing Lab Manual
63 pages
Maximising Operational Uptime: A Strategic Approach To Mitigate Unplanned Machine Downtime and Boost Productivity Using Machine Learning Techniques
No ratings yet
Maximising Operational Uptime: A Strategic Approach To Mitigate Unplanned Machine Downtime and Boost Productivity Using Machine Learning Techniques
13 pages
Data Warehousing Record
No ratings yet
Data Warehousing Record
30 pages
Software Requirements Specification: Notepad
No ratings yet
Software Requirements Specification: Notepad
10 pages
Stack ADT Java
No ratings yet
Stack ADT Java
10 pages
Data Mining Unit 5
No ratings yet
Data Mining Unit 5
12 pages
Checkfinal 123
No ratings yet
Checkfinal 123
18 pages
VideoLogic Multiple Region Headers Example Uses
No ratings yet
VideoLogic Multiple Region Headers Example Uses
9 pages
Mobile Computing Lab Manual For IT
No ratings yet
Mobile Computing Lab Manual For IT
11 pages
Iot Domain Analyst-Ece3502: Data Analytics Using Knime Software
No ratings yet
Iot Domain Analyst-Ece3502: Data Analytics Using Knime Software
7 pages
X-Ways Forensics White Paper
No ratings yet
X-Ways Forensics White Paper
7 pages
Case Study MIS of Deloitte
No ratings yet
Case Study MIS of Deloitte
7 pages
Week 1
No ratings yet
Week 1
4 pages
5 Control Panel Projects Instr Exercise
No ratings yet
5 Control Panel Projects Instr Exercise
6 pages
Accounting Information System For Decision Making
No ratings yet
Accounting Information System For Decision Making
3 pages
REad Only ADME
No ratings yet
REad Only ADME
5 pages
Gravity Workout
No ratings yet
Gravity Workout
4 pages
Statement of Purpose Northumbria University
No ratings yet
Statement of Purpose Northumbria University
2 pages
Mid Consensys
No ratings yet
Mid Consensys
4 pages
Kumpletong Sahog NG Adobo
No ratings yet
Kumpletong Sahog NG Adobo
4 pages
C/C++ Programming Interview Questions and Answers: by Satish Shetty, July 14th, 2004
No ratings yet
C/C++ Programming Interview Questions and Answers: by Satish Shetty, July 14th, 2004
16 pages
How To Withdraw Bitcoins
No ratings yet
How To Withdraw Bitcoins
1 page
Carpet Size: Area Length×Width
No ratings yet
Carpet Size: Area Length×Width
3 pages
Indian Institute of Management Bangalore: PGP 4 Term 2019-20
No ratings yet
Indian Institute of Management Bangalore: PGP 4 Term 2019-20
3 pages
ML Lab External QP
No ratings yet
ML Lab External QP
2 pages
JavaScript Algorithms Step by Step: A Practical Guide with Examples
From Everand
JavaScript Algorithms Step by Step: A Practical Guide with Examples
William E. Clark
No ratings yet
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
From Everand
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
César Pérez López
No ratings yet
SystemTap Essentials: Definitive Reference for Developers and Engineers
From Everand
SystemTap Essentials: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Lexicon of Programming Terminology: Lexicon of Tech and Business, #17
From Everand
Lexicon of Programming Terminology: Lexicon of Tech and Business, #17
Mustafa Al-Dori
5/5 (1)
Exploring Hadoop Ecosystem (Volume 2): Stream Processing
From Everand
Exploring Hadoop Ecosystem (Volume 2): Stream Processing
Wei Liu
No ratings yet
Mastering Data Structures and Algorithms in Python & Java
From Everand
Mastering Data Structures and Algorithms in Python & Java
Sachin Naha
No ratings yet
Machine Learning with Python: A Comprehensive Guide with a Practical Example
From Everand
Machine Learning with Python: A Comprehensive Guide with a Practical Example
MARTIN NEEL
No ratings yet
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: NAIVE BAYES, NEAREST NEIGHBORS and NEURAL NETWORKS: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: NAIVE BAYES, NEAREST NEIGHBORS and NEURAL NETWORKS: Examples with MATLAB
César Pérez López
No ratings yet
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
Oracle Database Administration Interview Questions You'll Most Likely Be Asked: Job Interview Questions Series
From Everand
Oracle Database Administration Interview Questions You'll Most Likely Be Asked: Job Interview Questions Series
Vibrant Publishers
5/5 (1)
Dataflow and Reactive Programming Systems
From Everand
Dataflow and Reactive Programming Systems
Matt Carkci
No ratings yet
Advanced Backend Code Optimization
From Everand
Advanced Backend Code Optimization
Sid Touati
No ratings yet
Java / J2EE Interview Questions You'll Most Likely Be Asked
From Everand
Java / J2EE Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet

Iot Domain Analyst-Ece3502: Data Analytics Using Weka For Water Quality Related Data

Uploaded by

Iot Domain Analyst-Ece3502: Data Analytics Using Weka For Water Quality Related Data

Uploaded by

Name: M.

SAIMANIKANTA Reg no: 18BEC1314

EXPERIMENT NUMBER: - 5 DATE: - 04/03/2021

IOT DOMAIN ANALYST- ECE3502

The workbench for machine learning

2) Open a new explorer in weka

4) Here we take a csv file

In that we make the changes like

We deleted the unwanted rows form the data set

We can exchange the rows form the dataset etc.

5) Now open the data set csv file through notepad.

And make them in the correct format for weka.

(This is the basic format)

% 1. Title: Iris Plants Database

@ATTRIBUTE sepallength NUMERIC

The Data of the ARFF file looks like the following:

After that save file the .arff format.

You might also like