0% found this document useful (0 votes)
63 views4 pages

Department of Computer Engineering: Experiment No.3

This document describes an experiment using the WEKA data mining tool to perform data preprocessing, classification, clustering, and association mining on datasets. It provides an overview of the WEKA workflow, which includes preprocessing raw data, selecting a machine learning task (classify, cluster, associate), choosing an algorithm, running it on the dataset, and visualizing/evaluating the outputs. The student is to explore WEKA's preprocessing tools using default datasets on weather conditions and diabetes diagnosis.

Uploaded by

Bhumi
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
63 views4 pages

Department of Computer Engineering: Experiment No.3

This document describes an experiment using the WEKA data mining tool to perform data preprocessing, classification, clustering, and association mining on datasets. It provides an overview of the WEKA workflow, which includes preprocessing raw data, selecting a machine learning task (classify, cluster, associate), choosing an algorithm, running it on the dataset, and visualizing/evaluating the outputs. The student is to explore WEKA's preprocessing tools using default datasets on weather conditions and diabetes diagnosis.

Uploaded by

Bhumi
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 4

Department of Computer Engineering

Experiment No.3

Semester T.E. Semester V– Computer Engineering


Subject Data warehousing & Mining Lab
Subject Professor In-charge Prof. Prita Patil
Assisting Teachers Prof. Prita Patil
Laboratory MS-Team

Student Name Pratik Haldankar


Roll Number 20102A2006
Grade and Subject
Teacher’s Signature

Experiment Number 3

Experiment Title Perform data Pre-processing task and Demonstrate performing


Classification, Clustering, Association algorithm on data sets using
data mining tool (WEKA,R tool, XL Miner, etc.)

Resources / Apparatus Required Hardware: Software:


Computer system WEKA

Theory
WEKA - an open source software provides tools for data
preprocessing, implementation of several Machine Learning
algorithms, and visualization tools so that you can develop machine
learning techniques and apply them to real-world data mining
problems. What WEKA offers is summarized in the following
diagram −
Department of Computer Engineering
Experiment No.3

If you observe the beginning of the flow of the image, you will
understand that there are many stages in dealing with Big Data to
make it suitable for machine learning −

First, you will start with the raw data collected from the field. This
data may contain several null values and irrelevant fields. You use
the data preprocessing tools provided in WEKA to cleanse the data.

Then, you would save the preprocessed data in your local storage
for applying ML algorithms.

Next, depending on the kind of ML model that you are trying to


develop you would select one of the options such as Classify,
Cluster, or Associate. The Attributes Selection allows the
automatic selection of features to create a reduced dataset.

Note that under each category, WEKA provides the implementation


of several algorithms. You would select an algorithm of your choice,
set the desired parameters and run it on the dataset.

Then, WEKA would give you the statistical output of the model
processing. It provides you a visualization tool to inspect the data.

The various models can be applied on the same dataset. You can
then compare the outputs of different models and select the best
that meets your purpose.

Thus, the use of WEKA results in a quicker development of machine


learning models on the whole.
Department of Computer Engineering
Experiment No.3

Case Study Description Case Study1-: Whether Conditions

Case Study 2-: Diabetes Condition


Department of Computer Engineering
Experiment No.3

Conclusion
Explored weka tool prepossessing by using default datasets available
on weka.

You might also like