0% found this document useful (0 votes)
69 views4 pages

Data Mining Assignment: Problem Defination

The document discusses using the WEKA data mining tool to analyze a diabetes dataset containing medical information for Pima Indian females. It loads the diabetes dataset into WEKA, visualizes the data with histograms, and generates a decision tree classifier using the J48 algorithm to predict diabetes. Different tree algorithms are explored to find the most accurate structure for classification.

Uploaded by

Neeraj Khanna
Copyright
© Attribution Non-Commercial (BY-NC)
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
69 views4 pages

Data Mining Assignment: Problem Defination

The document discusses using the WEKA data mining tool to analyze a diabetes dataset containing medical information for Pima Indian females. It loads the diabetes dataset into WEKA, visualizes the data with histograms, and generates a decision tree classifier using the J48 algorithm to predict diabetes. Different tree algorithms are explored to find the most accurate structure for classification.

Uploaded by

Neeraj Khanna
Copyright
© Attribution Non-Commercial (BY-NC)
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 4

DATA MINING ASSIGNMENT

Problem Defination :- Several constraints were placed on the selection of these instances from
a larger database. In particular, all patients here are females at least 21 years old of Pima Indian heritage. ADAP is an adaptive learning routine that generates and executes digital analogs of perceptron-like devices.

Tool : - WEKA 3.0 (WEKA uses .arff file format for its explorer toolkit and we intend to use the explorer.)

DATA SET : - http://storm.cis.fordham.edu/~gweiss/data-mining/weka-data/diabetes.arff


After feeding the following data set in the WEKA tool we get the following GUI and we select our file named sample.arff.

After loading the file, we get the following screen to proceed with our work.

When we visualize all the data sets we get the following histogram

Now when we do a Classify on the data set we get the following decision tree with J48,

We can use different tree forming algorithms and get desired tree structures.

The above assignment is submitted by Rajdeep Sabui Neeraj Khanna M110377CA M110361CA [email protected] [email protected] [email protected]

Vaibhav Prasad M110358CA

You might also like