Lesson 1 Intro To Statistics
Lesson 1 Intro To Statistics
Statistics is defined as a branch of science which deals with the collection, presentation, analysis of
data and interpretation of results.
Collection of data refers to the different data gathering techniques.
Presentation of data refers to the process of organizing data such as tabulation, presenting through the
use of charts or graphs.
Analysis of data refers to the method of obtaining necessary, relevant and noteworthy information from
the given data through the use of statistical tool.
Interpretation of results refers to the tools of drawing out of conclusions, descriptions, inferences, or
generalization from the analyzed data.
Main division of statistics:
1. Descriptive statistics pertains to the methods dealing with the collection, organization and analysis
of a set of data without making conclusions, predictions or inferences about a larger set. Here the
main goal is simply to provide a description of a particular data set for which the conclusion or the
important characteristics apply only to the data set on hand.
2. Inferential statistics pertains to the methods of dealing with making inferences (a conclusion reached
on the basis of evidence and reasoning), estimations or predictions about a larger set of data using
the information gathered from a subset of this larger set. Here the main goal is not merely
providing a description of a particular data set but also to make prediction and inferences based on
the available information gathered. The conclusions or the important characteristics apply to a
larger set from which the data on the hand is only a subset.
Example:
Descriptive Statistics
A college dean wants to determine the average semestral enrolment in the past 5 school years.
A businessman wants to know his profit for the past three months.
A chief nurse wants to determine the average number of dengue patients during the last 6 months.
Inferential Statistics
A college dean wants to forecast the average semestral enrolment based on the enrolment for the last five
school years.
A businessman would like to estimate his profit for the next five months.
A chief nurse wants to predict the number of dengue patients for the next two years.
In research activities, the word population refers to the collection of all cases in which the researcher is
interested. In its general sense, this pertains to the totality of objects, animals, persons, materials,
happenings or events under investigation.
Example:
a) All students of ABC University.
b) All patients of Surgical Ward in a hospital.
c) All companies manufacturing electronic parts.
A sample refers to a portion of a subset of the population from which the information is gathered.
Theoretically, a sample must be a representative of the population from which it is taken.
Example:
a) Engineering students of ABC University.
b) Patients of surgical ward in a hospital with respiratory complications.
c) Companies manufacturing electronic parts that have employees less than 300.
If a researcher for instance wants to determine the grade point average of all students of ABC
University in a particular semester, then this value would normally be different if he were to determine
the grade point average only of Engineering students in that particular semester. Thus, it is plausible to
conclude that a population measure is different from a sample measure.
In statistics, population measures are called parameters. These are the numerical characteristics
of a population which is usually denoted by small Greek letters.
On the other hand, sample measure or values related to samples are called statistics. These are
usually denoted by the lower- case letters of the English alphabet.
Example:
Types of Data
A variable is a characteristic, description, or attribute of persons or objects which assumes different
values or labels.
Example:
a) Height, age, weight are variables which assume numerical responses or values.
Height: 5 feet and 4 inches
Weight: 120 kilograms
Age: 20 years old
b) Gender, religious affiliation, civil status, are also variables which assumes not values but different
labels or categories.
Gender: Male (M) Female (F)
Religious Affiliation: Roman Catholic Adventist
Mormon Protestant
Civil Status: Single Married
Widow Separated
Variables are generally classified into two namely qualitative and quantitative variables.
A qualitative variable yields categorical responses while a quantitative variable yields numerical
responses representing an amount or quantity.
Example: Civil status and religious affiliation are qualitative variables. Number of children in the family,
blood pressure, temperature are quantitative variables.
4. A Ratio data possesses all the properties of the nominal, ordinal and interval data. It is also numeric
in nature and has an absolute zero point. Thus, in a ratio data, we can classify, order/rank them and
likewise we can also compare their magnitudes.
Example:
Age, income, exam scores are examples of ratio data.
Classification of Data
1. Raw Data are those which are in their original form and structure. Responses out from surveys,
taped interviews, recorded observations are examples of raw data.
2. Grouped Data are those placed and summarized in tabular form