100% found this document useful (1 vote)
193 views12 pages

R Programming On Abalone Dataset

1) The document discusses using the abalone dataset to predict the age of abalone from characteristics like height, diameter, length, and weight measurements using R. 2) The abalone dataset contains 4,177 observations with 9 variables including sex, length, diameter, height, weight, and age. 3) Various analytical methods are applied to the data including importing into R, dividing into training and test sets, calculating mean age, and creating graphs to analyze relationships between variables like shell weight and age.

Uploaded by

anchal tanwar
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
100% found this document useful (1 vote)
193 views12 pages

R Programming On Abalone Dataset

1) The document discusses using the abalone dataset to predict the age of abalone from characteristics like height, diameter, length, and weight measurements using R. 2) The abalone dataset contains 4,177 observations with 9 variables including sex, length, diameter, height, weight, and age. 3) Various analytical methods are applied to the data including importing into R, dividing into training and test sets, calculating mean age, and creating graphs to analyze relationships between variables like shell weight and age.

Uploaded by

anchal tanwar
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 12

R Programming

on Abalone
Dataset
Contents

1 About Abalone 5 Analytical Methods

2 Dataset 6 Outcome

3 Secondary Data

4 About software R

Startup Budget 2
Introduction

Abalone

They are also commercially


Abalones are endangered marine snails farmed as a source of mother-of-
that are found in the cold coastal pearl. The shells of abalone are
waters worldwide, majorly being used for decorative purposes
distributed off the coasts of New owing to their iridescence. This
Zealand, South Africa, Australia, makes abalone a highly sought
Western North America, and Japan . after commodity and
They are considered a delicacy and a economically significantly.
highly nutritious food and extensively
consumed in certain parts of Latin
America, France, New Zealand,
Southeast Asia, China, Vietnam,
Japan, and Korea.

Startup Budget 3
Abalone Dataset

The original stated problem is estimating the age an


abalone, which can be determined from the number of rings
in their shell. However, counting the number of rings in an
abalone shell is an expensive method. Thus, one possible
solution is predicting the number of rings of an abalone
from characteristics like height, diameter, length and
weight measurements.

Dataset •Total number of observations in


dataset: 4177
•Total number of variables in
dataset: 9.

Startup Budget 4
Metadata and Attribute information:
Name Data Type Measurement Description
M, F, and I (Infant)
categorical It can be either one of Male, Female or Indeterminate
Sex
(factor) (Infant). Abalone gender is not determined at birth but
rather when they mature a little
Length Continuous mm Longest shell measurement

Diameter Continuous mm perpendicular to length

Height Continuous mm Height of abalone with meat in shell


Whole
Continuous grams Weight of the whole abalone
weight
Shucked
Continuous grams weight of just the meat
weight
Viscera
Continuous grams gut weight (after bleeding)
weight
Shell
Continuous grams Weight of shell after being dried
weight
Age Integer in years

Startup Budget 5
Secondary Data

Secondary data which refers to data, which is gathered by a


secondary party, other than the user himself. Common sources
of secondary data for social science include statements, data
collected by government agencies, organisational documents
and data that was basically collected for other research
objectives.

Startup Budget 6
About R

R is a programming language and


R provides a wide variety of statistical (linear and nonlinear
free software environment for modelling, classical statistical tests, time-series analysis,
statistical computing and graphics classification, clustering) and graphical techniques, and is
supported by the R Foundation for highly extensible. The S language is often the vehicle of
Statistical Computing. The R choice for research in statistical methodology, and R
language is widely used among provides an Open Source route to participation in that
statisticians and data miners for activity.
developing statistical software and
data analysis. Polls, data mining
surveys, and studies of scholarly
literature databases show
substantial increases in popularity
Startup Budget 7
Analytical Methods Used
1. Import the Abalone data in R. 2. What are the dimensions of the dataset?

3. Create a data frame from the imported dataset having


the columns length, Diameter, height and weight and age
only.

Startup Budget 8
4.Divide the dataset into 2 parts using sample 6. Create a bar chart for Shell weight vs. Age.
function (Training part 80%, Testing 20%).

5. Calculate the mean age of the abalone


and print it.

Startup Budget 9
7. What is the average age for the Male Abalone and
Female Abalone? Also show using the graph?

8. Which attribute shows strong dependence for the


prediction of Abalone Age? How can you say that?

Rings are the attribute that shows strong dependence


for the prediction of abalone age.

Startup Budget 10
Outcome By observing the correlation between
the targets attribute Rings and the
•Total number of observations in dataset = 4177 independent variables, we conclude
•Total number of variables in dataset = 9 that it is possible to build a model to
predict the target value in function of
•Mean age of abalone= 9.933684 the independent attributes. The weight
•Average age of male abalone = 0.3658128 of the Abalones varies proportional to
•Average age of female abalone = 0.312904 their sizes. There are no significant
differences in size, weight and
numbers of rings between male/female
abalones. The Infant Abalones groups
present lower mean values of size,
weight and number of rings. The
weight and height of abalones varies
accordingly to age until the adult age,
after adult life size and weight stops
varying, and after 16.5 years (15 rings)
these measurements aren't correlated.

Startup Budget 11
Thank
You
Group 7:
Kapil Dashora_20BSP1007
Anchal Tanwar_20BSP0243
Gopal Mantri_20BSP0793
Shweta Saraf_20BSP2389
Srishti Narang_20BSP2506
Atul Handuja_20BSP0463

You might also like