Statsprob Reviewer
Statsprob Reviewer
- it establishes the foundation for deeper understanding later on down the road. KEY POINTS
- ease of customization is key when setting up online learning - Difference between questions that could be and those that could not be answered
using Statistics.
- use your professional judgement and tailor the content as needed for your students. - Statistics is a science that studies data.
You want to manage assignment availability options, levels of feedback, and links to - There are many uses of Statistics but its main use is in decision-making.
the text to best meet student needs. - Logical decisions or solutions to a problem could be attained through a statistical
process
- select assessment questions that hit varying levels of difficulty—easy, moderate,
and hard—and build on previous learning.
- most educational institutions currently use digital spaces to post grades, submit Data Collection Activity
assignments, discuss material, and exchange ideas. Leverage these items as an
educator to support student learning outside of the classroom Basic Terms in Statistics
- Facts are collected for purposes of getting aggregate information, but confidentiality
should be protected
Origin: Christian Bible - Agencies mandated to collect data is bound by law to protect the confidentiality of
information provided by respondents
- Book of Numbers
- God reported to have instructed Moses to carry out a census. Data Collection
- Census ordered by Caesar Augustus throughout the entire Roman Empire before
the birth of Christ. - provide the needed information to the best of your knowledge
- before responding to the questionnaire provided, it is recommended that each item - collection of respondents from whom one obtain the data.
in the questionnaire should be clarified
- such as Student Information Sheet
Variable
Types of Data Collection
- a collection of facts from experiments, sample surveys and censuses, and Broad Classification of Variables
administrative reporting systems
- facts and figures that are presented, collected and analyzed Qualitative
- either numeric or non-numeric and must be contextualized
- variables express a categorical attribute
- do not strictly take on numeric values
- answer questions “what kind.”
Contextualize Data - e.g. sex (male or female), religion, marital status, region of residence, highest
- Identify its six W’s or to put meaning on the data educational attainment
1. Who? Who provided the data? Quantitative
2. What? What are the information from the respondents and What is the unit of the
measurement used for each of the information (if there is any)? - otherwise called numerical data, whose sizes are meaningful
3. When? When was the data collected? - answer questions such as “how much” or “how many”
4. Where? Where was the data collected? - have actual unit of measure
5. Why? Why was the data collected? - e. g. height, weight, number of registered cars
6. HoW? HoW was the data collected?
Once the data are contextualized, there is now meaning to the collection of number
and symbols.
Quantitative Classification
Discrete Data
Ordinal Level
- a universe is a collection of units from which the data were gathered Likert Scale
- a variable is a characteristic we observed or measured from every element of the
universe
- a population is a set of all possible values of a variable
- a sample is a subgroup of a universe or a population
- in a study there is only one universe but could have several populations
- variables could be classified as qualitative or quantitative, and the latter could be
further classified as discrete or continuous
LEVELS OF MEASUREMENT
- Plan or design the collection of data to verify the validity of the statement in a way
that maximizes information content and minimizes bias;
- Collect the data as required in the plan;
- Verify the quality of the data after it was collected;
- Summarize the information extracted from the data; and
- Examine the summary statistics so that insight and meaningful information can be
produced to support your decision whether to believe or not the given statement NOTE
Variable Level of
Measurement
Class Student Number Nominal
Sex Nominal
Number of Siblings Ratio
Weight (in kilograms) Ratio
Height (in centimeters) Ratio
Use of Existing Records
Age of Mother Ratio
Usual Daily Allowance in Ratio - data collected by other entities for certain purposes these data are secondary data
School (in pesos) e.g., data from the Philippine Statistics Authority, administrative records,
Usual Daily Food Ratio news articles, internet
Expenditure in School (in
the latest series of the Consumer Price Index (CPI) generated by the
pesos)
Philippine Statistics Authority was downloaded from PSA website
Usual Number of Text Ratio
Messages Sent in a Day
Usual Sleeping Time Nominal
Most Preferred Color Nominal NOTE: Use of Existing Records
Happiness Index of the Day Ordinal
- we must be confident of the quality of the data we are using by knowing how the
data were gathered
METHODS OF DATA COLLECTION - we must remember to request permission and acknowledge the source of the data
when using data gathered by other agency or people
Objective
- uses any or combination of the five senses (sense of sight, touch, hearing, taste,
and smell) to measure the variable
- provides the label of what is being presented in a column
Row Header
Body
- are the information in the cell intersecting the row and the column
Graphical Presentation
Tabular Presentation
Table Title
- includes the number and a short description of what is found inside the table
Column Header
FDT
Qualitative FDT
- the non-overlapping categories of the variable are identified and frequencies the
percentages of observations falling into the categories are computed
Quantitative FDT
Ungrouped FDT
- is constructed when there are only a few observations or if the data set contains only
few possible values
Grouped FDT
- is constructed when there is a large number of observations and when the data set
involves many possible values
- distinct values are grouped into class intervals
- identify the largest data value or the maximum (max) and smallest data value or the
minimum (min) from the data set and compute the range (R)
- the range is the difference between the largest and smallest value
The Frequency Distribution Table (FDT) and Histogram R = MAX – MIN
- determine the number of classes, k using k = √N, where N is the total number of
- a special type of tabular and graphical presentation observations in the data set
- used to depict the distribution of the data - round off k to the nearest whole number; it should be noted that the computed k
- most of the time, these are used in technical reports might not be equal to the actual number of classes constructed in an FDT
- calculate the class size, (C), using C = R/k;
- round off C to the nearest value with precision the same as that with raw data
- construct the classes or the intervals;
- a class interval is defined by a lower limit (LL) and an upper limit (UL) - refers to the total number of observations greater than or equal to the LL of the class
the LL of the lowest class is usually the MIN of the data set (>CF) or the total number of observations less than or equal to the UL of the class
the LL’s of the succeeding classes are then obtained by adding C to the LL of the (<CF)
preceding classes
the UL of the lowest class is obtained by subtracting one unit of measure (1/10 x),
where x is the maximum number of decimal places observed from the raw data from
Relative Cumulative Frequency (RCF)
the LL of the next class
the UL’s of the succeeding classes are then obtained by adding C to the UL of the - refers to the fraction of the total number of observations greater than or equal to the
preceding classes LL of the class (>RCF) or the fraction of the total number of observations less than
the lowest class should contain the MIN, while the highest class should contain the or equal to the UL of the class (<RCF)
MAX - both the <RCF and >RCF can also be expressed in percent
- tally the data into the classes constructed in step to obtain the frequency of each
class
- each observation must fall in one and only one class
Histogram
- add distributional characteristics
- a graphical presentation of the frequency distribution table in the form of a vertical
bar graph
True Class Boundaries (TCB) - frequency – vertical axis
- true class boundaries – horizontal axis
- the TCBs reflect the continuous data
- defined by a lower TCB (LTCB) and an upper TCB (UTCB)
- these are obtained by taking the midpoints of the gaps between classes or by using
the following formulas:
LTCB = LL – 0.5 (one unit of measure)
UTCB = UL + 0.5 (one unit of measure)
- the midpoint of a class and is obtained by taking the average of the lower and upper
TCB’s
CM = (LTCB + UTCB) / 2