Apstat - Module 2022 Edition
Apstat - Module 2022 Edition
STATISTICAL ANALYSIS
APSTAT 1
WITH SOFTWARE
APPLICATION
LEARNING OUTCOMES
On completion of this course, students will be able to:
1. Recognize and demonstrate comprehension of basic statistical terms
2. Construct descriptive analysis and design presentation of data
3. Summarize distributions by diagrams and statistics
4. Engage in problem solving using probability distributions
5. Construct and apply normal probability distributions
6. Construct a confidence interval for population means and various hypothesis test
MODULE 1
ed by
UNIT 1: INTRODUCTION
Objectives:
1. Define Statistics
2. Differentiate a sample from population; inferential from descriptive statistics
3. Distinguish a parameter from statistic
4. Differentiate qualitative from quantitative data
5. Contrast a discrete from a continuous data
6. Give examples of nominal, ordinal, interval, and ratio data
7. State the uses of Statistics
8. Discuss the brief historical development of Statistics
Statistics plays a vital role in every field of human activity. Statistics helps in determining the existing
position of per capita income, unemployment, population growth rates, housing, schooling medical
facilities, etc., in a country.
Now statistics holds a central position in almost every field, including industry, commerce, trade,
physics, chemistry, economics, mathematics, biology, botany, psychology, astronomy, etc., so the
application of statistics is very wide. Now we shall discuss some important fields in which statistics is
commonly applied
(1)Business
Statistics plays an important role in business. A successful businessman must be very quick and
accurate in decision making. He knows what his customers want; he should therefore know what to
produce and sell and in what quantities.
Statistics helps businessmen to plan production according to the taste of the customers, and the
quality of the products can also be checked more efficiently by using statistical methods. Thus, it can
be seen that all business activities are based on statistical information. Businessmen can make
correct decisions about the location of business, marketing of the products, financial resources, etc.
(2) Economics
hypotheses. The relationship between supply and demand is studied by statistical methods; imports
and exports, inflation rates, and per capita income are problems which require a good knowledge of
statistics.
(3) Mathematics
Statistics plays a central role in almost all natural and social sciences. The methods used in natural
sciences are the most reliable but conclusions drawn from them are only probable because they are
based on incomplete evidence.
Statistics helps in describing these measurements more precisely. Statistics is a branch of applied
mathematics. A large number of statistical methods like probability averages, dispersions, estimation,
etc., is used in mathematics, and different techniques of pure mathematics like integration,
differentiation and algebra are used in statistics.
(4) Banking
Statistics plays an important role in banking. Banks make use of statistics for a number of purposes.
They work on the principle that everyone who deposits their money with the banks does not withdraw
it at the same time. The bank earns profits out of these deposits by lending it to others on interest.
Bankers use statistical approaches based on probability to estimate the number of deposits and their
claims for a certain day.
In auditing, sampling techniques are commonly used. An auditor determines the sample size to be
audited on the basis of error.
(8) Astronomy
Astronomy is one of the oldest branches of statistical study; it deals with the measurement of
distance, and sizes, masses and densities of heavenly bodies by means of observations. During
these measurements errors are unavoidable, so the most probable measurements are found by using
statistical methods.
Example: This distance of the moon from the earth is measured. Since history, astronomers have
been using statistical methods like method of least squares to find the movements of stars.
Types of Statistics
a.) Descriptive Statistics – the method of organizing, summarizing, and presenting data in an
informative way
- Data can be organized in a frequency distribution
- Various charts and graphs maybe used to describe a data
b.) Inferential Statistics (statistical inference) – the method used to estimate a property of a
population on the basis of a sample
Elements of Statistics
a.) Population – the entire set of individuals or objects of interest or the measurement obtained
from all individuals or objects of interest
b.) Sample – a portion, or part of the population of interest
Data can be classified according to levels of measurement. The level of measurement of the data
dictates the calculations that can be done to summarize and present the data. It will also determine
the statistical test that should be performed.
1. Nominal level data – the data are sorted into categories with no particular order to the
categories
Properties:
a. The variable of interest is divided into categories or outcomes
b. There is no natural order to the outcomes
2. Ordinal level data – this measurement presumes that one classification is ranked higher than
another
Properties:
a. Data classified are represented by sets of labels or names (high, medium, low) that have
relative values
b. Because of relative values, the data classified can be ranked or ordered
3. Interval level data – this measurement has the ranking characteristics of the ordinal level of
measurement plus the characteristics that the distance between values is a constant size
Properties:
a. Data classification are ordered according to the amount of the characteristic they possess
b. Equal differences in the characteristics are represented by equal differences in the
measurements
4. Ratio level data – this measurement has all the characteristics of the interval level plus there
is a 0 (zero) point and the ration of two values is meaningful
Properties:
a. Data classification are ordered according to the amount of the characteristic they possess
b. Equal differences in the characteristics are represented by equal differences in the number
assigned to the classification
c. The zero point is the absence of the characteristics and the ration between two numbers is
meaningful
ASSESSMENTS:
1. Place these variables in the following classification tables. For each table, summarize your
observations and evaluate if the results are generally true. For example, salary is reported as a
continuous quantitative variable. It is also a continuous ration scaled variable. Use letters only (36
points)
Objectives:
1. State the different methods in collecting data
2. Enumerate and discuss the sources of data
3. Differentiate Random from Non- Random Sampling
4. Compute accurately for the sample size
5. Discuss the different features of a good questionnaire
6. Determine the appropriate sampling technique used in the study to be conducted
The choice of method is influenced by the data collection strategy, the type of variable, the accuracy
required, the collection point and the skill of the enumerator. Links between a variable, its source and
practical methods for its collection can help in choosing appropriate methods. The main data
collection methods are:
- Direct observations: This method is often referred to as interview method. Interviews: forms which
are completed through an interview with the respondent. More expensive than questionnaires, but
they are better for more complex questions, low literacy or less co-operation.
- Questionnaires: forms which are completed and returned by respondents. An inexpensive method
that is useful where literacy rates are high and respondents are co-operative.
· Registration: registers and licences are particularly valuable for complete enumeration, but are
limited to variables that change slowly, such as numbers of fishing vessels and their characteristics.
· Observation Method – This method is used to collect data pertaining attitudes, behavior, values, and
cultural patterns of the samples under investigation. Subjects may be taken individually or collectively.
It is usually used when the subjects cannot talk. This can be done directly or indirectly.
· The experiment method: this method is used if the researcher would like to determine the cause and
effect relationship of certain phenomena under investigation. This is used in making scientific inquiry
General Tips
Step 1: Conduct a census if you have a small population. A “small” population will depend on your
budget and time constraints. For example, it may take a day to take a census of a student body at a
small private university of 1,000 students but you may not have the time to survey 10,000 students at
a large state university.
Step 2: Use a sample size from a similar study. Chances are, your type of study has already been
undertaken by someone else. You’ll need access to academic databases to search for a study
(usually your school or college will have access). A pitfall: you’ll be relying on someone else correctly
calculating the sample size. Any errors they have made in their calculations will transfer over to your
study.
Step 5: Use a formula. There are many different formulas you can use, depending on what you know
(or don’t know) about your population. If you know some parameters about your population (like a
known standard deviation), you can use the techniques below. If you don’t know much about your
population, use Slovin’s formula..
𝑁
n=
( 1+𝑁 𝑒 2 )
Where:
• n = Number of samples,
• N = Total population and
• e = Error tolerance (level)
•
Sample question: Use Slovin’s formula to find out what sample of a population of 1,000 people you
need to take for a survey on their soda preferences.
Step 2. Plug your data into the formula. In this example, we’ll use a 95 percent confidence level
with a population size of 1,000.
n = N / (1 + N e2) =
1,000 / (1 + 1000 * 0.05 2) = 285.714286
Step 3: Round your answer to a whole number (because you can’t sample a fraction of a person or
thing!)
285.714286 = 286
You can use a sample size calculator to determine how big your sample should be. In
general, the larger the sample size, the more accurately and confidently you can
make inferences about the whole population.
MODULE 4
• Probability sampling involves random selection, allowing you to make statistical inferences
about the whole group.
To conduct this type of sampling, you can use tools like random number generators or other
techniques that are based entirely on chance.
Example
You want to select a simple random sample of 100 employees of Company X. You assign a number
to every employee in the company database from 1 to 1000, and use a random number generator to
select 100 numbers.
2. Systematic sampling
Systematic sampling is similar to simple random sampling, but it is usually slightly easier to conduct.
Every member of the population is listed with a number, but instead of randomly generating numbers,
individuals are chosen at regular intervals.
Example
All employees of the company are listed in alphabetical order. From the first 10 numbers, you
randomly select a starting point: number 6. From number 6 onwards, every 10th person on the list is
selected (6, 16, 26, 36, and so on), and you end up with a sample of 100 people.
If you use this technique, it is important to make sure that there is no hidden pattern in the list that
might skew the sample. For example, if the HR database groups employees by team, and team
3. Stratified sampling
This sampling method is appropriate when the population has mixed characteristics, and you want to
ensure that every characteristic is proportionally represented in the sample.
You divide the population into subgroups (called strata) based on the relevant characteristic (e.g.
gender, age range, income bracket, job role).
From the overall proportions of the population, you calculate how many people should be sampled
from each subgroup. Then you use random or systematic sampling to select a sample from each
subgroup.
Example
The company has 800 female employees and 200 male employees. You want to ensure that the
sample reflects the gender balance of the company, so you sort the population into two strata based
on gender. Then you use random sampling on each group, selecting 80 women and 20 men, which
gives you a representative sample of 100 people.
4. Cluster sampling
Cluster sampling also involves dividing the population into subgroups, but each subgroup should
have similar characteristics to the whole sample. Instead of sampling individuals from each subgroup,
you randomly select entire subgroups.
If it is practically possible, you might include every individual from each sampled cluster. If the
clusters themselves are large, you can also sample individuals from within each cluster using one of
the techniques above.
This method is good for dealing with large and dispersed populations, but there is more risk of error in
the sample, as there could be substantial differences between clusters. It’s difficult to guarantee that
the sampled clusters are really representative of the whole population.
Example
The company has offices in 10 cities across the country (all with roughly the same number of
employees in similar roles). You don’t have the capacity to travel to every office to collect your data,
so you use random sampling to select 3 offices – these are your clusters.
Non-probability sampling techniques are often appropriate for exploratory and qualitative research. In
these types of research, the aim is not to test a hypothesis about a broad population, but to develop
an initial understanding of a small or under-researched population.
1. Convenience sampling
A convenience sample simply includes the individuals who happen to be most accessible to the
researcher.
This is an easy and inexpensive way to gather initial data, but there is no way to tell if the sample is
representative of the population, so it can’t produce generalizable results.
Example
You are researching opinions about student support services in your university, so after each of your
classes, you ask your fellow students to complete a survey on the topic. This is a convenient way to
gather data, but as you only surveyed students taking the same classes as you at the same level, the
sample is not representative of all the students at your university.
Example
You send out the survey to all students at your university and a lot of students decide to complete it.
This can certainly give you some insight into the topic, but the people who responded are more likely
to be those who have strong opinions about the student support services, so you can’t be sure that
their opinions are representative of all students.
3. Purposive sampling
This type of sampling involves the researcher using their judgement to select a sample that is most
useful to the purposes of the research.
It is often used in qualitative research, where the researcher wants to gain detailed knowledge about
a specific phenomenon rather than make statistical inferences. An effective purposive sample must
have clear criteria and rationale for inclusion.
Example
You want to know more about the opinions and experiences of disabled students at your university,
so you purposefully select a number of students with different support needs in order to gather a
varied range of data on their experiences with student services.
4. Snowball sampling
If the population is hard to access, snowball sampling can be used to recruit participants via other
participants. The number of people you have access to “snowballs” as you get in contact with more
people.
Example
You are researching experiences of homelessness in your city. Since there is no list of all homeless
people in the city, probability sampling isn’t possible. You meet one person who agrees to participate
in the research, and she puts you in contact with other homeless people that she knows in the area.
ASSESSMENTS:
3. Given the population size with their corresponding margin of error, determine the sample size
of the following: (24 points)
N e Sample size
( show your
solution)
1. 10,000 5%
2. 5,000 10%
4. Determine what method/s of collection of data is/are appropriate to be used for the following
data: (40 points)
Objectives:
1. State the different methods in presenting data
2. Enumerate the different graphical presentations
3. Draw a graphical presentation of frequency distribution
4. Construct a frequency distribution table
5. Define basic concepts in frequency distribution
6. Graph ogives
A frequency table is a grouping of qualitative data into mutually exclusive classes showing the
number of observation in each class.
Solution:
Step 1: identify the highest value (HV) and lowest value (LV)
HV = 40; LV = 5
Step 2: compute the Range (R = HV – LV)
R = HV – LV
= 40 – 5
= 35 + 1 = 36
Step 3: Compute the interval
36 36
𝟏𝐬𝐭 𝐨𝐩𝐭𝐢𝐨𝐧 ∶ 𝑖 = = 4.5 ≈ 5 2nd option : 𝑖 = = 3.6 ≈ 4
8 10
The denominator indicates the number of classes
2. The expressway police have been monitoring high speedy vehicles to prevent road accidents
especially at night. However there have been doubts expressed on possibility of over speeding
due to excessive traffic jams. There are sixty recorded speeds obtained and as shown below.
23 60 79 32 57 74 52 70 82 60 64 90
95 41 65 92 85 55 76 52 36 64 72 75
80 98 81 67 41 71 83 54 80 75 88 79
73 60 78 89 76 84 88 84 77 78 96 34
71 82 69 74 63 80 85 61 81 25 72 67
a.) Organize the data into a frequency distribution
b.) Determine the relative frequency distribution
Solution:
Step 1: identify the highest value (HV) and lowest value (LV)
HV =___ ; LV = _____
3. CEZC Specialty Food and Beverage Company has been serving a cola drink with an additional
flavoring, Cola -Plus that is very popular among its customers. The company is interested in
customers preference for Cola-Plus versus Coca Cola, Pepsi, and a Lemon-Lime beverage.
They ask 100 randomly sampled customers to take a taste test and select the beverage they
preferred most. The results are shown in the table below.
Beverage Number
Cola-Plus 40
Coca-Cola 25
Pepsi 20
Lemon-Lime 15
Total 100
4. The following histogram shows the scores on the first grading exam for a statistical class
0
50 60 70 80 90
SCORE
Assessments:
6.The food services division of Enchanted Kingdom is studying the amount customer who visit the
park spend per day on food and drinks. (40 points)
₱217 342 116 76 434 240 192 295 319 250 279 205 279 266
199 363 279 232 227 123 474 181 321 309 246 278 50 41
335 177 159 151 303 337 325 297 170 188 320 429 294 570
a. Organize the data into a frequency distribution
b. Determine the relative frequency distribution
c. Determine the cumulative frequency and its corresponding percentages
vanilla, 15 chocolate,
35
𝛴𝑥
𝜇=
𝑁
b. The formula for the mean of a sample is
𝛴𝑥
𝑥̅ =
𝑛
B. The weighted mean is found by multiplying each observation by its corresponding weight.
1. The formula for weighted mean is
𝑤1 𝑥1 + 𝑤2 𝑥2 + 𝑤3 𝑥3 + ⋯ . +𝑤𝑛 𝑥𝑛
𝑥𝑤 =
̅̅̅̅
𝑤1 + 𝑤2 + 𝑤3 + ⋯ + 𝑤𝑛
D. The mode the value that occurs most often in a set of data
1. The mode can be found for a nominal-level data
2. A set of data can have more than one mode
n 𝑥̅ Md Mo
1. 12,13,13,14,15,17,19,20,22,24,24,27,29,30,33 15
2. 22,23,24,25,25,25,27,28,29,29,29,30,31,33,33,33,34,35 18
3. 43,44,45,46,47,48,50,51,53,54,57,59,60 14
4. 31,31,33,34,35,36,39,41,41,43,44,46,46,47,49,51,53 17
5. 23,45,21,13,42,23,34,54,34,16,19,35,25,19,22,36 16
2. Metro Bank is studying the number of times the ATM located in Assumption Road is used per day. The
following are the number of times the machine was used daily over each of the last two weeks.
Determine the mean, the median number of times the ATM was used, and the modal number of times
the ATM was used.
83,64,84,76,84,54,75,59,70,61,63,80,84,73
𝑥̅ = Md = Mo =
ASSESSMENTS:
B. The annual salaries of quality-control managers in selected areas are shown below. What is the
a.) mean annual salary b.) median annual salary c.) modal annual salary (10 pts)
C. Listed below are several indicators of long-term economic growth in the Philippines. The projections
are through the year 2008 (10 pts)
Economic Indicator Percent Change Economic Indicator Percent Change
Inflation 4.5 Real GNP 2.9
Exports 4.7 Investment (residential) 3.6
Imports 2.3 Investment (non-residential) 2.1
Real Disposal Income 2.9 Productivity (total) 1.4
Consumption 2.7 Productivity (Manufacturing) 5.2
D. The CEZC Food, Inc pay its hourly employees ₱16.50, ₱19.00, or ₱25 per hour. There 26 hourly
employees, 14 of which are paid at the ₱16.50 rate, 10 at the ₱19.00 rate, and 2 at the ₱25 rate.
(10 pts)
MODULE 8
GROUPED DATA
Central Variable Formula Where:
tendency
Mean ∑ 𝑓𝑥 ∑ 𝑓𝑥 = summation of f
𝑥̅ 𝑥̅ = multiplied by x
𝑁
N= sample size
Example 1:
Checking: 𝑓1 + 𝑓𝑚 + 𝑓2 = N
Solution:
Solution:
ASSESSMENTS:
MODULE 9
FORMULA
Quartiles Deciles Percentiles
𝑸(𝒏+𝟏) 𝑫(𝒏+𝟏) 𝑷(𝒏+𝟏)
Q= D= P=
𝟒 𝟏𝟎 𝟏𝟎𝟎
n=15
Solve for:
Solution
Formula Substitution Answer Interpolation Final Answer
a. 𝑸𝟏 𝑸(𝒏+𝟏) 𝟏(𝟏𝟓 + 𝟏) th
4 item in the No interpolation, the 𝑸𝟏 =14
𝑸𝟏 = array value is exact
𝟒 𝟒
SOLUTION:
𝑸𝟏 1 3 ( 5.5 − 2) 23.13
𝑖 ( 𝑁 − 𝑓1 ) 20.5 +
4 4
𝑸𝟏 = 𝐿𝐶𝐵 +
𝑓𝑚
3 3 ( 16.5 − 16) 23.13
𝑖 ( 𝑁 − 𝑓2 ) 23.5 −
4 4
𝑸𝟏 = 𝑈𝐶𝐵 −
𝑓𝑚
𝑸𝟑 3 3( 16.5 − 15) 33.63
𝑖 ( 𝑁 − 𝑓1 ) 32.5 +
4 4
𝑸𝟑 = 𝐿𝐶𝐵 +
𝑓𝑚
1 3( 5.5 − 3) 33.63
𝑖 ( 𝑁 − 𝑓2 ) 35.5 −
4 4
𝑸𝟑 = 𝑈𝐶𝐵 −
𝑓𝑚
𝑫𝟐 2 3( 4.4 − 2) 22.3
𝑖 ( 𝑁 − 𝑓1 ) 20.5 +
10 4
𝑫𝟐 = 𝐿𝐶𝐵 +
𝑓𝑚
8 3( 17.6 − 16) 22.3
𝑖 ( 𝑁 − 𝑓2 ) 23.5 −
10 4
𝑫𝟐 = 𝑈𝐶𝐵 −
𝑓𝑚
𝑫𝟔 6 3 ( 13.2 − 11) 31.15
𝑖 ( 𝑁 − 𝑓1 ) 29.5 +
10 4
𝑫𝟔 = 𝐿𝐶𝐵 +
𝑓𝑚
4 3( 8.8 − 7) 31.15
𝑖 ( 𝑁 − 𝑓2 ) 32.5 −
10 4
𝑫𝟔 = 𝑈𝐶𝐵 −
𝑓𝑚
𝑷𝟒𝟐 42 3 ( 9.24 − 9) 26.86
𝑖 ( 𝑁 − 𝑓1 ) 26.5 +
100 2
𝑷𝟒𝟐 = 𝐿𝐶𝐵 +
𝑓𝑚
ASSESSMENTS:
For #10 and #11 solve for skewness for both ungrouped data (30 points) and grouped data (50points) using
:
a.
b.
c.
d.
10.
(30 pts.)
11.
(50 pts.)
MODULE 11
5.3 Computation of Range, Semi-Interquartile Range, Mean Absolute Deviation and Standard Deviation
from Grouped Data
GROUPED DATA
Variable FORMULA Where:
used
RANGE R R = UBUC - LBLC UBLC = upper boundary of the
upper class
LBLC = lower boundary of the
lower class
MEAN ABSOLUTE f = frequency
DEVIATION 𝛴 𝑓𝑑 d = difference of x and 𝑥̅
MAD MAD = where x = midpoint
𝑁
𝑥̅ = mean
∑ 𝑓𝑥
where : 𝑥̅ =
𝑁
N = sample size
STANDARD
DEVIATION
𝑁 ∑ 𝑓𝑥 2 − ( ∑ 𝑓𝑥 )2
𝛿 𝛿=√
𝑁 (𝑁 − 1)
VARIANCE 𝛿2 𝛿 2 = (𝛿)2
Semi-Interquartile SIR 𝑄𝐷 𝑄3 − 𝑄1
Range SIR = = 2
2
Class interval f
40 – 44 1
35 – 39 4
30 – 34 9
25 – 29 11
20 – 24 12
15 – 19 11
10 – 14 6
5–9 6
i=5 N= 60
MODULE 13
SOLUTION:
a. More than ₱4,250
Solution:
𝑥− 𝜇 4,250−5,000
Z= = = −0.75 𝛿
𝛿 1000
𝐴1 = 0.2734
𝐴 𝑇 = 𝐴1 + 0.50
Solution:
𝑥− 𝜇 6,700−5,000
Z= = = 1.7𝛿
𝛿 1,000
Use table 5 to locate the equivalent area of 1.7
𝐴1 = 0.50 − 0.4554
𝐴1 = 0.0446
𝑥− 𝜇 3,500−5,000 𝑥− 𝜇 7,770−5,000
𝑧1 = = 𝑧2 = =
𝛿 1,000 𝛿 1,000
𝑧1 = -1.5𝛿 𝑧2 = 2.77𝛿
𝑥2
𝑥1
𝑥1 − 𝜇 6,500−5000
Z 1= = = 1.5𝛿 ; 𝐴1= 0.4332
𝛿 1,000
𝑥2 − 𝜇 7,750−5000
Z 2= = = 2.75𝛿; 𝐴2 = 0.4970
𝛿 1,000
𝐴𝑡 = 𝐴2 - 𝐴1 = 0.0638
Z value Area
0.39 = 0.1517
0.01 d z = 0.15 0.002 0.0037
0.38 = 0.1480
0.002 𝑥 0.01
d= = 0.0054
0.0037
z= 0.38 + 0.0054
z= 0.3853
Z value Area
0.26 = 0.1026
0.01 d z = 0.10 0.0013 0.0039
0.25 = 0.0987
0.0013 𝑥 0.01
d= = 0.0033
0.0039
z= 0.25 + 0.0033
z= 0.2533
Solve for the value of x:
x=± 𝑧𝛿 + 𝜇
x= 𝑧𝛿 + 𝜇
= (0.2533 x 1,000) + 5,000
x= ₱5,253.30
Z value Area
1.44 = 0.4251
0.01 d z = 0.425 0.0014 0.0015
1.43 = 0.4236
0.0014 𝑥 0.01
d= = 0.0093
0.0015
z= 1.43+ 0.0033
±𝑧= 1.4333
From past records the number of complaints received per day by the complaint department of a department
store has a 𝛿 = 5.5 𝑐𝑜𝑚𝑝𝑙𝑎𝑖𝑛𝑡𝑠. Assuming that the number of complaints is normally distributed, (60 points)
a. What is the mean number of complaints received per day if the probability that the complaint department
received less than 25 complaints is 0.063?
b. What is the probability that the complaint department will receive
b.1. less than 45 complaints in a day?
b.2. between 35 and 40 complaints in a day?
c. How many complaints did the complaint department receive if the probability is
c.1. less than 0.3825
c.2. more than 0.4596
ASSESSMENT # 17:
Example 3:
(𝒙− 𝝁)√𝒏
t=
𝒔
̅𝟏 − 𝒙
(𝒙 ̅𝟐 )−(𝝁𝟏 − 𝝁𝟐 )
t=
(𝒏 −𝟏)𝒔𝟏 𝟐 +(𝒏𝟐 −𝟏)𝒔𝟐 𝟐 𝟏 𝟏
√ 𝟏 ∗√ +
𝒏𝟏 +𝒏𝟐−𝟐 𝒏𝟏 𝒏𝟐
(𝒙
̅− 𝝁)√𝒏
z=
𝜹
Two sample mean test: A sample mean with another sample mean
̅𝟏 − 𝒙
(𝒙 ̅𝟐 )−(𝝁𝟏 − 𝝁𝟐 )
z=
𝟐 𝟐
√𝜹𝟏 +𝜹𝟐
𝒏𝟏 𝒏𝟐
MODULE 16
https://www.socscistatistics.com/tests/chisquare2/default2.aspx
MODULE 17
Three brands of infant powdered milk (infant formula) were given to three of 8 infants each and the results
were monitored for a certain period of time. The result in terms of weight gain are tabulated below.
Respondents Brand A Brand B Brand C
1 4.5 3.2 3.0
2 4.1 3.0 2.8
3 3.0 3.8 3.2
4 5.3 3.9 3.6
5 4.8 4.2 3.5
6 2.7 3.1 3.5
7 4.3 4.0 3.9
8 3.8 3.3 3.6
Test the hypothesis that there is a significant difference in the average weight gain in (lb) among the three
groups of respondents using three brands, at 0.01 level.
Trend Line
Variable Meaning Formula Where:
𝑦𝑡 Forecast for period t 𝑦𝑡 = a+bt 𝑦𝑡 = Forecast for period t
b Slope of the line 𝑛𝛴𝑡𝑦− 𝛴𝑡𝛴𝑦 b = Slope of the line
b= a = Value of 𝑦𝑡 , 𝑎𝑡 𝑡 = 0
𝑛(𝛴𝑡 2 )−(𝛴𝑡)2
a Value of 𝑦𝑡 , 𝑎𝑡 𝑡 = 0 𝛴𝑦−𝑏𝛴𝑡
a=
𝑛
Solution:
REGRESSION
Variable Formula Substitution Final Answer
b 𝑛𝛴𝑥𝑦− 𝛴𝑥𝛴𝑦 10(2184) − (190)(111.5) 0.78
b=
𝑛(𝛴𝑥 2 )−(𝛴𝑥)2 10(3694) − (190)2
a 𝛴𝑦−𝑏𝛴𝑥 190 − 0.78(190) 4.18
a=
𝑛 10
𝑦𝑡 𝑦𝑡 = a + bx 𝑦𝑡 = 4.18 + 0.78 (30) ₱27.58 million
𝑦𝑡 = 4.18 + 0.78 (33) ₱29.92 million
Trend Line
Variable Formula Substitution Final Answer
Jefferson, the Store Manager of Bench, Glorietta Branch, wants to determine the sales for the 13th,14th,15th and
16th month.
Month TV Ad. Sales
(This year) ($000) ($000)
1 8 24
2 5 25
3 13 28
4 9 31
5 6 27
6 16 39
7 10 26
8 15 37
9 11 29
10 14 32
11 12 35
12 7 30
REQUIRED:
• REGRESSION Predict the sales at…. a. 35 ($000 TV Ad)
b. 9 ($000 TV Ad)
c. 18 ($000 TV Ad)
d. 42 ($000 TV Ad)
• TREND LINE Determine the sales for the… e. 13th month
f. 15th month
g. 20th month
h. 35th month
https://www.slideshare.net/rdhaker2011/introduction-to-statisticsppt-rahul
https://www.amstat.org/ASA/History-of-Statistics.aspx
https://www.emathzone.com/tutorials/basic-statistics/functions-or-uses-of-statistics.html
https://www.emathzone.com/tutorials/basic-statistics/importance-of-statistics-in-different-fields.html
https://www.statisticshowto.com/how-to-use-slovins-formula/
https://www.statisticshowto.com/probability-and-statistics/find-sample-size/
https://www.scribbr.com/methodology/sampling-methods/
https://www.slideshare.net/rubyocenar/presentation-of-data-37973327
https://www.slideshare.net/rajkumarteotia/skewness-40437601
https://www.slideshare.net/nilanjanbhaumik9/measures-of-dispersion-43610696
https://www.slideshare.net/joyjoyfradejas/mean-deviation?next_slideshow=1
https://www.dummies.com/education/math/statistics/understanding-the-statistical-properties-of-the-normal-
distribution/
https://www.slideshare.net/dsaadeddin/normal-distribution-77299816
https://statisticsbyjim.com/basics/normal-distribution/
https://www.slideshare.net/mmirfattah/practice-final-page-13?next_slideshow=1
https://www.slideshare.net/nenevie/chapter9-the-normal-curve-distribution
https://examples.yourdictionary.com/examples-of-hypothesis.html
http://www.authorstream.com/Presentation/bharatmewara-1922386-research-hypothesis/
https://www.statisticshowto.com/probability-and-statistics/t-test/
https://www.investopedia.com/terms/t/t-test.asp
https://www.slideshare.net/parth241989/chi-square-test-16093013
http://onlinestatbook.com/2/chi_square/distribution.html
https://www.slideshare.net/RamakanthGadepalli/chi-squared-test-52755609
http://websupport1.citytech.cuny.edu/Faculty/mbessonov/MAT1272/Worksheet%20November%2021%20Solutions.pdf
https://slideplayer.com/slide/5131935/
https://www.slideshare.net/MisabPT/correlation-analysis-67866929
https://www.slideshare.net/rajukammari/hypothesis-and-its-types
https://www.slideshare.net/sonustar3/anova-analysis-of-variance-90850361
https://www.slideshare.net/oxygen024/regression-analysis-8672920
ASSESSMENT # ?
UNIT
TOPIC
NAME OF STUDENT
GROUP NUMBER
GROUP MEMBERS
REFLECTION PAPER
TITLE
NAME OF STUDENT
Instructions:
1. Use standard long bond paper (8.5 inches by 13 inches) in writing your case analysis. Follow the margin 1.5
inches left margin and 1 inch on the right side. Use Arial font 11. Line spacing at 1.5
2. Always label properly your picture as Picture 1; table as Table 1; figure as Figure 1; and chart as Chart 1.
Together with the brief description of the picture, table, figure, and chart.
3. The first page should bear the following information written on the upper left-hand corner:
4. The contents of your given task are found in the succeeding text below
Reflection Paper Template (This outline template follows 5 paragraphs but you can add more paragraphs and rearrange
the body paragraphs to fit your needs)
Paragraph 1: Introduction
a. Hook – give a quick preview of the most exciting part of the lesson learned
b. Problem statement – without giving too much away, write how this experience influenced you
Paragraph 2- 4: Body Paragraph
• What lesson or activity did I enjoy most? Why?
• What is the most important lesson which I can apply in my daily life?
• What are the new insights / discoveries that I learned?
• What topic/s do I find least important?
• What possible topic/s should have been included?
Paragraph 5: Conclusion
a. Summary of the experience
b. Overall impact / lesson learned