Lesson 1 STATISTICAL ANALYSIS
Lesson 1 STATISTICAL ANALYSIS
Statistics is a branch of mathematics that deals with the collection, classification, description, and interpretation of data obtained by the
conduct of surveys and experiments. Its fundamental purpose is to describe and draw inferences about the numerical properties of a
population.
STATISTICS refer to numerical observations of almost any kind.
- refers to the science that deals with the collection, tabulation or presentation, analysis, and interpretation of numerical or quantitative
data.
Collection of data - refers to the process of obtaining numerical measurements,
Tabulation or presentation of data - refers to the organization of data into tables, graphs or charts, so that logical and statistical
conclusions can be derived from the collected measurements.
Analysis of data - pertains to the process of extracting from the given data relevant information from which numerical description can be
formulated.
Interpretation of data - refers to the task of drawing conclusions from the analyzed data.
- normally involves the formulation of forecasts or predictions about larger groups on the data collected from small groups.
Two important terms that you should understand in studying statistics are population and sample.
In statistics, population does not only mean a group of people. Population may also mean a defined group or aggregates of objects,
animals, materials, measurements, “things”, “events” or “happenings” of any kind. Thus, a sack of rice, a whole pizza pie, or a set of
weights and heights are considered population.
Since it would be impractical to study the whole population as in the case of a sack of rice, then it is necessary to just take a sample of
the population. Thus, a handful of rice is a sample of the population in a sack of rice. Thus, sample is defined as any subgroup of the
population drawn by some appropriate method from the population. It should be a representative of the population, that is, the sample
will show the properties of the population.
Originally, statistical data took the forms of
1. Theater attendance
2. Basketball results
3. Car sales in in a month
4. Heights
5. Weights
6. So many others that can be expressed numerically
2. One of the most common exposures of the youth today to statistics is in the existing world of sports- in basketball, for
instance. After each quarter of a game, the newscaster would report numerical figures and their averages to millions of thrilled
basketball fans watching the game on television. These figures normally consist of points made out of so many attempts from
the field or from the foul line. These “statistics” would eventually decide whether a player deserves to be paid more or is being
paid more than he deserves.
3. In education, statistical tools are used to get information on enrollment, finance,physical facilities, and so on. Such data are
needed for intelligent administration and management.
4. Statistics are gathered for the purpose of providing government heads with data necessary to guide them in managing the
affairs of the State. From earlier times, most civilized countries have compiled large-scale “statistics” in order to ascertain the
manpower and material strength of the nation. These data are needed for military and fiscal reasons. A large amount of
organized records on the movement of population, cost of living, taxes, wages, and material resources is necessary for
intelligent policy-making and administration.
Methods for the statistical design of experiments are valuable to researchers in medicine and the physical sciences. Causes
and effects of factors which affect experiments are best evaluated using statistical techniques.
5. Psychologists are able to understand the human person better if they are able to systematize, analyze, and interpret data
on intelligence scores, aptitudes, personality trait ratings and attitudes.
6. In sociology, Statistics is used in the study of the conditions of the society in which man lives. Observations, when properly
analyzed and interpreted, may effect positive action toward the improvement of society.
7. In business and economics, statistics plays an important role in the exploration of new markets for a product, forecasting of
business trends, control on the quality of goods produced, and improvement of personnel relations. Decisions and policies for
efficient business and economic management must be based on data which have been properly analyzed and interpreted.
Everyday life is influenced more and more by decision based on quantitative information.
A good survey research paper relies on the precision of the methods and procedures of conducting the study. This includes
reliability of the selected subjects or respondents of the study. The validity of information gathered out of the distributed
questionnaires and the accuracy of measurements used in answering the research questions and other observations.
A study which was conducted in the entire population assures us of 100% reliability since the responses are obtained from all
members of the population. This means that data was collected by a complete enumeration method or the so-called census
taking. However, it is impossible for many types of research to conduct a survey to all members of the population especially if
the population size is infinite or finite but very large. To minimize the time and cost involved in conducting the survey to a
large population, it has been accepted that the information about the population will be based only from a small portion of the
population, called sample. On the other hand considering only the responses of a small portion of the population may result
into some possible biases due to improper selection of the samples and errors due to the manner of measuring the desired
observations since the selected sample may not have equally represented the characteristics of the entire population.
It also normally involves the formulation of forecasts or predictions about larger groups based on the data collected from small
groups.
Descriptive Statistics is concerned with the gathering, classification, and presentation of data and the collection of summarizing values
to describe group characteristics of the data. The summarizing values most commonly used in descriptive statistics are the measures
of central tendency, of variability, and of skewness and kurtosis.
Inferential Statistics demands a higher order of critical judgment and mathematical methods. It aims to give information about large
groups of data without dealing with each and every element of these groups. It uses only a small portion of the total set of data in order
to draw conclusions or judgments regarding the entire set.
Topics included in the study of Inferential Statistics:
1. To prepare and conduct periodic censuses on population, housing, agriculture, fisheries, industry, business, and other sectors of the
economy.
2. To prepare and conduct statistical surveys, researches, and studies on all aspects of socioeconomic conditions.
3. To collect and process for statistical purposes data and records from the different departments, bureaus, offices and agencies of the
government.
4. To conduct researches and studies on census in cooperation with national or local statistical organizations.
5. To develop a well- integrated, consolidated, and coordinated program of up-to-date statistical collection, production, analysis, and
publication for the use of the government and the public.
6. To maintain an efficient system of civil registration.
At present, statistics is a reliable means of describing accurately the values of economic, political, social, psychological, biological, and
physical data. Statistics serves as a tool to correlate and analyze collected data. It is no longer confined to gathering and tabulating
data. Now, it is also a process of interpreting the information that serves as a basis for preparing plans.
1. if the population being considered is the length of life of all the batteries produced by company, then we would have to use up all
batteries in order to get data on their length of life.
2. Imagine that we are interested in the daily food consumption quantity of all families in Makati. So much time, energy, and oney will
be wasted just to get the needed information from each and every family.
This method is time consuming and requires too much effort and money
This is where statistical methods and techniques could come in. When the mass of data is too great to be handled in its entirety, the
sampling method is used. This is the method of getting facts from a small but representative cross-section pf the population. This
representative part of the population is called sample.
The sample is used to describe the population from which it was taken.
For convenience, variables will be used to Student No. Score present the data.
Let x = score obtained by each student 1 36
xi = different values or observations of x 2 28
Xi is read as “x sub i” where i is a subscript 3 46 which indicates the position of each value in the
series. In the given data, there are 10 4 65 observations denoted as x1, x2, x3, x4, x5, x6, x7, x8, x9,
x10. 5 26
Hence, ∑ = x1+ x2+ x3+ x4+ x5+ x6+ x7+ x8+ x9+ 6 38 x10.
I - 1 10 7 52
The symbol ∑ ix is read as “the sum of 10 8 47 observations x1 to x10 ”.
10 9 39
To substitute the data: ∑ ix = 36 + 28 +46 + 10 35 65 + 26 + 38 + 52 + 47 + 39 + 35
I-1
= 412
For large observations, say 50, the summation will be expressed as: ∑ xi = x1+ x2+ x3
i – I 50
A variable is an observable characteristics of a person or objects which is capable of taking several values or of being expressed in
several different categories.
Kinds of Variables:
1. Continuous Variable
A variable which may take any value within a specified range of values.
Example:
Weight height
2. Discontinuous/discrete variable
A variable that can take specific values only.
Values have breaks, gaps or jumps.
Example:
Number of BSA students enrolled in Statistical Analysis
Family size
MEASURENENT OF SCALES:
Four levels or types of measurement:
1. Nominal measurements
Most limited type of measurement.
Merely used to differentiate classes or categories for purely classification or identification purposes.
Example:
Sex ( male, female) the two groups formed can be identified by using numbers like 1 for male group and 0 for female group or vice
versa. These numbers are merely used for identification purposes. We cannot give meaning to the magnitude or size of such numbers.
Although numbers may be used to designate categories or groups, these numbers have very few of the usual properties of numbers.
We cannot use the four fundamental operations on these numbers because these numbers are merely labels or codes for categories.
2. Ordinal Measurements
These do not only classify but also order the classes.
Expressed in ranks is possible if different degrees of an attribute or property are present
Example:
Ranks 1, 2, 3 given by judges to the 3 finalist in a beauty contests
However we are usually unable to determine the degree of difference between any consecutive ordinal measures.
We cannot determine by just how much the beauty contest winner is more beautiful than the second place winner and to what degree
the second place winner is more beautiful than the third placer.
3. Interval Measurements
Has the attributes of ordinal measure plus one more: it can differentiate between any two classes in terms of degrees of differences.
Example:
Mental ability scores
Achievement scores
Temperatures in degrees Celsius
82º C is bigger than 80º C
68º C is lower than 72º C by 4º C
Addition and subtraction have meanings
Zero point of the interval scale is arbitrary and does not reflect the absence of the attribute.
4. Ratio Measurement
Differs from the interval measurement only in one aspect. It has a true zero point which indicates a total absence of the property being
measured.
Example:
Length (0 length means no length at all)
Age
Number of children in a family
Ratios of the numbers assigned in the type of measurement reflect ratios in the amounts of the property being measured.
Example:
If Lea is 180 centimetres tall and Lyka is 90 centimetres tall, we say that Lea is twice as tall than Lyka. Their heights can be
expresses in the ratio 2:1 (two is to one)
Multiplication and division have meanings.