5.basic Statistics

This document discusses basic statistics concepts that are important in agriculture. It explains that variables can be either dependent or independent, and that discrete and continuous data can take different forms. It also covers topics like sampling techniques, frequency tables and graphs, measures of central tendency and dispersion, correlation and regression analysis, and the importance of statistical significance testing in research. Key points include the need to understand variables, types of data, how to collect unbiased samples, ways to visualize and describe data distributions, and establishing relationships between variables.

Uploaded by

Zamir Zainal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

21 views43 pages

5.basic Statistics

Uploaded by

Zamir Zainal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 43

BASIC STATISTICS

Importance of Mathematics in Agriculture

• An understanding of statistics and the

fundamental of algebra is needed.
• Correlation, regression and predictive
modeling are needed to analyze and interpret
spatial data.
Independent and dependent variable
• Variable refers to factors or events that can
have different values or, in other words, they
vary.
• Ex; soil PH range from 5 to 8. This makes soil
pH variable
• A variable, if its actual value is unknown, is
represented by a letter such as X.
• Dependent variable is an event or factor that is
effected by, or is dependent on, other factors.
• An independent variable is a factor that does not
depend on other variable for its value.
• Crop yield is dependent upon many variables,
such as soil PH and soil moisture.
• Soil PH and soil moisture is independent variable
because yield does not change pH, although PH
may change yield.
Discrete vs. Continuous Data
• Discrete ; Variables that can only take on a finite
number of values are called "discrete variables.
People and machinery can be grouped as discrete
variable, or temperature rounded to the nearest
degree. 45.6◦C. can be rounded to 45 ○C.
Continues variable are those things that can have
an infinitive number of values between two whole
numbers.
Discrete and continuous data
Types of data
1. Nominal data
2. Ordinal data
3. Interval data
4. Ratio
1. Nominal Data
-Nominal data are those data that don’t have
numerical value.
- They may be colors, shapes, brands, or even
the number don’t have value and might as
well be words or letters.
- Example nominal value and is used as an
identifier without ranking or size.
- A person’s social security number also.
2. Ordinal Data
• Ordinal data are those data that
accommodate infinite sequences/ranks and to
classify sets with certain kinds
of order structures on them.

Example : Rank 1th…..100th…..

3. Interval data
• Interval data are those data not only provide a
ranked order, but also a specific scale of
measurement.
• Temperature is an example.
• Fahrenheit scale is interval data because it
provide an order as well as a consistent scale.
4. Ratio data
• Ratio are real values that can be compared to
each other and are not limited to scale.
• Ex: level of phosphorous in the soil is 44 ppm
or twice as other soil that has 22 ppm.
• 44 ⁰C is not twice 22 ⁰C.
• Sport team has rating 100th is not twice rating
of the team has rating 50th.
SAMPLING
• Data cannot be collected on every event or
object within a study area.
• The next best thing is to select or sample
some of the events or object to represent all
of them.
• This is called sampling.
• Statistical sampling requires
1. A large number of samples to be
representative of the population
2. Random sampling where each area has an
equal chance of being selected so the sample
is unbiased
3. Sample from the population or a
homogenous area to which the results will be
applied.
• Sampling techniques:
1. Grid tessalation
- It is typically used to identify a systematic
pattern for determining regular or irregular
sampling points, with a sample taken from each
grid cell.
2. Grid pointing sampling
The process is the same with grid tessalation,
but instead of assuming that entire cell has the
same nutrient value, the nutrient value is
applied is applied to the point at which the
sample was taken.
Number of samples
• The size of the grid cell can range from 1 to 10
acres depending on the variability and size of
the field.
• The greater the variability, the more samples
need to be taken and the smaller the grid cell
needs to be.
Unbiased samples
• Random sampling assures an unbiased sample.
• 3 method for assuring unbiased samples
1. Centre method
Takes a sample in the center of the grid cell.
2. Offset method
Creates a diamond pattern by taking the sample a certain distance
offset from center.
3. Technique of collection sample.
Standard procedures calls for taking at least 10 samples from various
location within a radius of 10 feet of the sampling point to create one
composite sample
Frequency tables and Graphs
• Frequency is the number of times a value
occurs.
• Frequency tables and graphs can help a
people visualize the data.
Statistical techniques
• Frequency graph provide a method for
visualizing data.
• Statistics are used to describe numerically
what the frequency graph or curve look like.
• We can determine the center of the curve
(central tendency) and the width of it
(dispersion).
• This is refereed to as descriptive statitics.
• Inferential statistics are used when estimating
data or making an inference about differences
between data sets.
• The frequency graph can be used to visualize
this comparisons, statistics can be used to
quantify the values.
Central tendency
• Mean, median and mode provide basic
information regarding central tendencies.
• Mean is the average of all the numbers.
• Median" is the "middle" value in the list of
numbers.
• Mode is the number that is repeated more
often than any other.
• Range is just the difference between the
largest and smallest values.
• Find the mean, median, mode, and range for the following list of values:
13, 18, 13, 14, 13, 16, 14, 21, 13

• The mean is the usual average, so:

(13 + 18 + 13 + 14 + 13 + 16 + 14 + 21 + 13) ÷ 9 = 15

• The median is the middle value, so I'll have to rewrite the list in order:
13, 13, 13, 13, 14, 14, 16, 18, 21
There are nine numbers in the list, so the middle one will be the (9 + 1) ÷ 2 = 10 ÷ 2 = 5th
number:
13, 13, 13, 13, 14, 14, 16, 18, 21, So the median is 14.

• The mode is the number that is repeated more often than any other, so 13 is the mode.

• The largest value in the list is 21, and the smallest is 13, so the range is 21 – 13 = 8.

Answer:

• mean: 15
median: 14
mode: 13
range: 8
Dispersion
• Beside a center of data, we also want to know how
widely it is dispersed or what range of data values is.
• Standard deviation (SD) shows how much variation
or dispersion exists from the average (mean), or
expected value.
• A low standard deviation indicates that the data points
tend to be very close to the mean; high standard
deviation indicates that the data points are spread out
over a large range of values.
Formula :

Mean : Mean = Sum of X values / N(Number of values)

Standard Deviation :

Population Standard Deviation :

Variance : Variance = s2
Correlation and regression
• Another purpose of statistical analysis is
measuring relationships, or trying to establish
if the value of one variable is related of
connected to a second variable.
• Correlation coefficient is a one way of
measuring the relationship between paired
variable.
• It tries to show that one variable is correlated
to the second one.
Simple linear regresion
• Example:
• Time studying of student with the final score.
• To find the answer, the amount of time that each
students spent studying would be paired with the
student’s score.
• Higher values of time----------higher test score (
meaning strong positive correlation)
• Higher values of times----lower test score ( stronng
negative correlation).
• If higher values of time ----some to higher score, and
some low score test ( no correlation)
• One problem with correlation coefficient is that it
only deals with two factors and does not take into
account the influence that other factors may
have.
• A multivariate regressions uses data sets that
have three or more independent attribute values.
• The result of multivariate regression is a formula
that being used to predict the dependent
variable based on one or more independent
variables.
Formula of multivariate regression:
𝑌 = 𝐴 + 𝐵𝑋1 + 𝐶𝑋2 + 𝐷𝑋3
Y = the dependent variable that we are trying to
estimate or predict
A = the intercept, which could be thought of as the
scale’s starting point. It represent the lowest value
from which the prediction of the dependent
variable will start from.
X(1...)= represents all of the independent variables
that we are using to describe, estimate, or predict
the dependent variable Y.
• Formula will results in a linear
relationships.
• Most relationships are not perfectly
linear.
• If we are create a plot of all points used
in the formula, they will not line up.
• The difference between the line and each
point is the error.
Test for significance
• Significance is a very important concept because
what may seen like difference to us, may not be a
statistically significant difference.
• An example:
• Two set of yield data values; one yield data from
a no-till field (156 bushels) and conventional
tilled field (166 bushels).
• It is difference? We need to find out if it statically
different.
• A test can be used if there is a statistical
difference. E.g using T-test.
Research
• Research makes significant use of statistics.
• The ability to prove and disprove a hypothesis is
based on the objectivity provided by statistics.
• The objectivity is based on valid data collection,
the use of statistics, and the replication and
control of independent variables.
• Data should be take in unbiased manner and
sample should be take accurately.
• A research projects done once, without
replication, has little validity.
THANK YOU

It0089 Finalreviewer
100% (1)
It0089 Finalreviewer
143 pages
Statistics
No ratings yet
Statistics
65 pages
Wa0014
No ratings yet
Wa0014
63 pages
Statistics 1A Lecture Notes Article
No ratings yet
Statistics 1A Lecture Notes Article
123 pages
Business Statistics
No ratings yet
Business Statistics
73 pages
MMW 0607
No ratings yet
MMW 0607
29 pages
Handout-A-Preliminaries (Advance Statistics)
No ratings yet
Handout-A-Preliminaries (Advance Statistics)
29 pages
Handout Electro1
No ratings yet
Handout Electro1
41 pages
Week 5A - Statistics Handout
No ratings yet
Week 5A - Statistics Handout
9 pages
Introduction & Basic Concepts in Statistics
100% (2)
Introduction & Basic Concepts in Statistics
36 pages
Statistical Methods
No ratings yet
Statistical Methods
43 pages
1483082741da Mod10 Q1 e Text
No ratings yet
1483082741da Mod10 Q1 e Text
12 pages
Bio Statistics
No ratings yet
Bio Statistics
55 pages
Stats & HD Reviewer Prelims
No ratings yet
Stats & HD Reviewer Prelims
15 pages
Data Analysis:: Quantitative and Qualitative
No ratings yet
Data Analysis:: Quantitative and Qualitative
73 pages
What Is Statistics?: "Statistics Is A Way To Get Information From Data"
No ratings yet
What Is Statistics?: "Statistics Is A Way To Get Information From Data"
220 pages
Basic Concepts
No ratings yet
Basic Concepts
105 pages
Tutoring Session 2023 - Statistics For Business
No ratings yet
Tutoring Session 2023 - Statistics For Business
65 pages
It0089 Finalreviewer
No ratings yet
It0089 Finalreviewer
143 pages
STATS Lesson 1 A
No ratings yet
STATS Lesson 1 A
2 pages
Chapter One Probability and Statistics
No ratings yet
Chapter One Probability and Statistics
57 pages
Statistics Lecture 1
No ratings yet
Statistics Lecture 1
20 pages
Week1 Statistics Detailed
No ratings yet
Week1 Statistics Detailed
3 pages
Stas Tics
No ratings yet
Stas Tics
129 pages
1 - Basic Concepts
No ratings yet
1 - Basic Concepts
71 pages
WK 1 3
No ratings yet
WK 1 3
5 pages
Statapp Chapter 1 121928
No ratings yet
Statapp Chapter 1 121928
2 pages
Statistics
No ratings yet
Statistics
68 pages
IE 211 - Chapter 1
No ratings yet
IE 211 - Chapter 1
92 pages
Applications of Social Media and Social Network Analysis - Lecture Notes in Social Networks PDF
100% (1)
Applications of Social Media and Social Network Analysis - Lecture Notes in Social Networks PDF
247 pages
Presentation 1
No ratings yet
Presentation 1
9 pages
Data Management (1)
No ratings yet
Data Management (1)
46 pages
Main Title: Planning Data Analysis Using Statistical Data
100% (1)
Main Title: Planning Data Analysis Using Statistical Data
40 pages
3 Matm111
No ratings yet
3 Matm111
3 pages
STATISTICS (Tanya) PG 1 - 28
No ratings yet
STATISTICS (Tanya) PG 1 - 28
35 pages
Module 4 Mathematics As A Tool
100% (1)
Module 4 Mathematics As A Tool
6 pages
Psychology 117 Study Guide
100% (3)
Psychology 117 Study Guide
41 pages
Google Ai ML Virtual Internship Report
No ratings yet
Google Ai ML Virtual Internship Report
29 pages
ZYJ260
No ratings yet
ZYJ260
78 pages
Statistics
No ratings yet
Statistics
116 pages
Icc PDF
100% (1)
Icc PDF
279 pages
Module 2 - Statistical Foundations
No ratings yet
Module 2 - Statistical Foundations
108 pages
Understandingstatisticsinresearch 151026064600 Lva1 App6892
No ratings yet
Understandingstatisticsinresearch 151026064600 Lva1 App6892
37 pages
Math 1f - All Lessons
No ratings yet
Math 1f - All Lessons
81 pages
Note For Int To Statistics
No ratings yet
Note For Int To Statistics
24 pages
Pseudocode Cheat Sheet A4
100% (2)
Pseudocode Cheat Sheet A4
6 pages
Introduction Book 1
No ratings yet
Introduction Book 1
41 pages
Introduction To Statistics
100% (3)
Introduction To Statistics
43 pages
Lesson 1 Basic Concepts of Statistics
No ratings yet
Lesson 1 Basic Concepts of Statistics
9 pages
Statistical Techniques - Bda
No ratings yet
Statistical Techniques - Bda
33 pages
Curriculum Development Prof Ed LET Reviewer
100% (1)
Curriculum Development Prof Ed LET Reviewer
6 pages
Lesson 1: Fundamental Concepts and Summation Notation
No ratings yet
Lesson 1: Fundamental Concepts and Summation Notation
8 pages
Enma 104 Notes
No ratings yet
Enma 104 Notes
27 pages
14 AAU - Level 6 - Test - Challenge - Unit 4
100% (13)
14 AAU - Level 6 - Test - Challenge - Unit 4
5 pages
Statistics Is The Study of The Collection, Organization, Analysis, Interpretation, and
No ratings yet
Statistics Is The Study of The Collection, Organization, Analysis, Interpretation, and
18 pages
Basic Concepts in Statistics
No ratings yet
Basic Concepts in Statistics
42 pages
Bahir Dar University College of Agriculture and Environmental Sciences
No ratings yet
Bahir Dar University College of Agriculture and Environmental Sciences
44 pages
Course Introduction Inferential Statistics Prof. Sandy A. Lerio
No ratings yet
Course Introduction Inferential Statistics Prof. Sandy A. Lerio
46 pages
Basics of Statistics: Definition: Science of Collection, Presentation, Analysis, and Reasonable
100% (1)
Basics of Statistics: Definition: Science of Collection, Presentation, Analysis, and Reasonable
33 pages
2018 Book CyberSecurityForCyberPhysicalS PDF
100% (1)
2018 Book CyberSecurityForCyberPhysicalS PDF
189 pages
Cobra C1 FastScanManual
No ratings yet
Cobra C1 FastScanManual
64 pages
PSYCHSTATS
No ratings yet
PSYCHSTATS
9 pages
Introduction Statistics
100% (1)
Introduction Statistics
23 pages
Cargador Frontal WA500-6 (English) Komatsu
100% (1)
Cargador Frontal WA500-6 (English) Komatsu
12 pages
Lecture 1 - Introduction To Statistics
No ratings yet
Lecture 1 - Introduction To Statistics
3 pages
Nursing Care Assignment
No ratings yet
Nursing Care Assignment
8 pages
Alemite Oil Mist Application Manual
100% (1)
Alemite Oil Mist Application Manual
34 pages
Statistics: An Introduction and Overview
No ratings yet
Statistics: An Introduction and Overview
51 pages
EPP Lessonplan
No ratings yet
EPP Lessonplan
6 pages
MasterCast 222 TDS-974770
No ratings yet
MasterCast 222 TDS-974770
2 pages
BUCHI Destilador B-324 LIGAL 489 Operationmanual - SP
No ratings yet
BUCHI Destilador B-324 LIGAL 489 Operationmanual - SP
30 pages
Skill Development Under RKVY-2016-17
No ratings yet
Skill Development Under RKVY-2016-17
10 pages
1.safety Inspection Check List
No ratings yet
1.safety Inspection Check List
2 pages
Asset Holiday Home Work 2
No ratings yet
Asset Holiday Home Work 2
13 pages
Design and Optimization of Spur Gear: Second Review
No ratings yet
Design and Optimization of Spur Gear: Second Review
44 pages
Refrigeration
No ratings yet
Refrigeration
5 pages
Health - Lisa Bouslimani - Mental Wellbeing 2024-06-22
No ratings yet
Health - Lisa Bouslimani - Mental Wellbeing 2024-06-22
2 pages
Nostalgia Funny Car Rules V1
No ratings yet
Nostalgia Funny Car Rules V1
5 pages
Permodelan Proses Bisnis Untuk Procurement Suku Cadang Impor (Studi Pada PT Berkah Industri Mesin Angkat Surabaya)
No ratings yet
Permodelan Proses Bisnis Untuk Procurement Suku Cadang Impor (Studi Pada PT Berkah Industri Mesin Angkat Surabaya)
10 pages
Rapid Serial Visual Presentation in Dynamic Graph Visualization
No ratings yet
Rapid Serial Visual Presentation in Dynamic Graph Visualization
8 pages
159.52 101870341003 101870349999 Heating Climatic Unit
No ratings yet
159.52 101870341003 101870349999 Heating Climatic Unit
5 pages
Graph 2 Worksheet
No ratings yet
Graph 2 Worksheet
2 pages
Admission Form BNU
No ratings yet
Admission Form BNU
2 pages
Ambulong Climatological Extremes (As of 2016)
No ratings yet
Ambulong Climatological Extremes (As of 2016)
1 page
Epic Minigeddon2
No ratings yet
Epic Minigeddon2
1 page
Planning A Lesson Using PRIMM: The Five Stages of PRIMM
No ratings yet
Planning A Lesson Using PRIMM: The Five Stages of PRIMM
2 pages
Elementary Statistics
From Everand
Elementary Statistics
jay prakash Maheshwari
5/5 (1)
Statistics II Essentials
From Everand
Statistics II Essentials
Emil Milewski
2.5/5 (1)
Sampling in Statistics
From Everand
Sampling in Statistics
Stephanie Glen
No ratings yet
Descriptive Statistics: Six Sigma Thinking, #3
From Everand
Descriptive Statistics: Six Sigma Thinking, #3
Sumeet Savant
No ratings yet