0% found this document useful (0 votes)
50 views76 pages

Module 1 - Introduction To Statistical Concepts, Basic Concepts On Data Management, Methods of Data Collection-JARR

This document provides an introduction to basic statistics concepts and data management. It outlines topics such as the importance of statistics, population vs sample, steps in a statistical inquiry, classification of variables, and data collection methods.

Uploaded by

mho.aglipay
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
50 views76 pages

Module 1 - Introduction To Statistical Concepts, Basic Concepts On Data Management, Methods of Data Collection-JARR

This document provides an introduction to basic statistics concepts and data management. It outlines topics such as the importance of statistics, population vs sample, steps in a statistical inquiry, classification of variables, and data collection methods.

Uploaded by

mho.aglipay
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 76

Introduction to

Introduction to Basic Statistics Concepts

Basic Statistics
and Data Management

Concepts
and Data
Management
Jazel Ann R. Raper
Resource Person
PSRTI
1
MODULE OUTLINE
Introduction to Basic Statistics Concepts

▪ Statistics
and Data Management

▪ Importance of Statistics
▪ Population vs Sample
▪ Steps in a Statistical Inquiry
▪ Variable, Observation, Data
▪ Classification of Variables
▪ Levels of Measurement
2
2
MODULE OUTLINE
Introduction to Basic Statistics Concepts

▪ Data Collection Methods


and Data Management

▪ Defining Data Management


▪ Strategies in Data Management

3
3
Introduction to Basic Statistics Concepts

6
and Data Management

Basic Concepts in Statistics.


6
Introduction to Basic Statistics Concepts

“STATISTICS is the science


of collecting,
collecting organizing,
organizing
and Data Management

presenting,
presenting analyzing,
analyzing
and interpreting numerical data
to assist in making more
effective decisions.”

7
7
Introduction to Basic Statistics Concepts

8
and Data Management

figures.
STATISTICS
is a set of numerical

8
Other Examples of Statistics
Introduction to Basic Statistics Concepts
and Data Management

Monthly salaries
Population count of the different agencies
of government employees

Number of establishments Weight of newborn babies No. of teachers in K12 in Public Schools
9
within the city
Introduction to Basic Statistics Concepts

10
and Data Management

IN DAY TO DAY LIFE


IMPORTANCE OF STATISTICS

10
Statistics is a useful aid
in decision making
Introduction to Basic Statistics Concepts

▪ Describe the general characteristics of the


individuals in the different barangays
and Data Management

▪ Compare incomes and expenses of the different


municipalities in the province

▪ Explain the surge of dengue cases in a certain


municipality

▪ Forecast the population of the province in 2025

▪ Describe and Predict the factors that affect the


production of palay

▪ Study cause-and-effect of drugs among the youth

11
11
Statistics provide information
Introduction to Basic Statistics Concepts

Data
and Data Management

Data

Data Processing Information

Data

Data

12
12
Example:


Introduction to Basic Statistics Concepts
and Data Management

Average peso dollar exchange rate

Average monthly interest


Student Enrollment rates of loans

13
13
Actual Application of Statistics in Decision Making
Among families, the proportion of poor
Introduction to Basic Statistics Concepts

families in the first semester of 2023


was estimated at 16.4%, which is
and Data Management

equivalent to around four million, five


SDG
hundred and ten thousand families.
Meanwhile, the subsistence incidence
among families was recorded at 5.9%, or
around 1.62 million food poor families in
the first semester of 2023. (PSA, 2023)

DSWD and other government agencies


can create more programs to feed the
poor or provide cash transfers to aid
them.
14
14
POPULATION VS SAMPLE
Introduction to Basic Statistics Concepts
and Data Management

Population Sample
A population is a collection of all the A sample is a part (or subset) of the
elements under consideration in any population from which information is
statistical study collected

Example: All government employees Example: A subset of government


in the province employees in the province

15
15
STEPS IN A STATISTICAL INQUIRY
1 2 3
Introduction to Basic Statistics Concepts

Identify the Problem and


Plan the Study Collect the Data
and Data Management

State the Hypothesis

6 5 4
Analyze and
Present the Results Interpret the Explore the Data
Results
16
16
Introduction to Basic Statistics Concepts

17
and Data Management

Areas of Statistics

17
Descriptive Statistics
Methods concerned with Example:
Introduction to Basic Statistics Concepts

collecting, describing, and


analyzing a set of data
and Data Management

without drawing conclusions Four Out of Ten Persons Were


(or inferences) about a large Never Married (2020 Census
group of Population and Housing)

18
18
Inferential Statistics
Methods concerned with the analysis of a subset of data leading to predictions or
inferences about the entire set of data.
Introduction to Basic Statistics Concepts
and Data Management

Sample
Selection

Population Sample

Inferences and
Generalizations

19
19
Example on Inferential Statistics
A new milk formulation designed to improve infant brain development
Introduction to Basic Statistics Concepts

was tested on randomly selected infants.


and Data Management

Sampling

Inferences

Based on the results, it was concluded that the new milk formulation is
effective in improving brain development of infants.

20
20
Example of Inferential Statistics
Introduction to Basic Statistics Concepts

▪ The efficacy of etoricoxib vs mefenamic acid in the treatment


of primary dysmenorrhoea in a randomized comparative
and Data Management

trial.

▪ Authors: Nor Azlin et al. (2008)

▪ Conclusions: “Etoricoxib is a better treatment for primary


dysmenorrhea with better pain relief, less menstrual blood
loss and fewer side-effects compared with mefenamic acid.”

21
21
Introduction to Basic Statistics Concepts

22
and Data Management

WHAT IS A VARIABLE?

22
VARIABLE
• A variable is a characteristic or attribute of the elements in a collection
that can assume different values for different elements
Introduction to Basic Statistics Concepts

• Usually denoted by capital letters of the English Alphabet (X, Y, Z)


and Data Management

• Variable names can also be used to denote a variable (age, income,


children)

Examples of Variables:
X or age = age of pregnant women in the LGU (25 yrs, 34 yrs, 18 yrs, etc.)
Y or income = monthly income of pregnant women P23,400, P42,180, etc.)
Z or children = number of children of pregnant women (3, 5, 0, etc.)

23
23
Variable
Pregnant
Women Age Income Children
Introduction to Basic Statistics Concepts

1 25 15000 3 observations

2 34 23400 5 observations
and Data Management

3 28 42180 0 observations

4 39 21000 1 observations DATA


5 16 15234 4 observations Collection of
6 22 12768 2 observations observations
7 34 10234 2 observations

8 33 32111 1 observations

9 19 13234 4 observations

10 21 15211 3 observations

24
24
Classification of Variables
Introduction to Basic Statistics Concepts

Qualitative Variable Quantitative Variable


and Data Management

have labels or names assigned to is any characteristics that can be


their respective categories. measured or counted.

Also called Categorical Variable Also called Numerical Variable

Examples: Examples:
salary grade level – SG 1, SG 2, etc. weight- 160 lb, 125 lb, 140 lb, etc.
LGU income level – 1st, 2nd, 3rd, etc. no. of personnel per barangay – 5,
type of agency – public, private 10, 8, etc.
estimated monthly income of
households – PhP4,000, PhP 10,500,
PhP18,500, etc.
25
Examples of Variables
Introduction to Basic Statistics Concepts

A study is conducted to determine the socio-demographic characteristics


and Data Management

and waste management practices of households in a certain barangay.


Some information obtained include sex, highest educational attainment
of head, monthly income (in pesos), household size, type of waste (dried
leaves, recyclables, food scraps, etc.) and type of disposal (compost pits,
dump in open pits, garbage collector, etc.)

26
26
Levels of Measurement
Nominal Ordinal
Introduction to Basic Statistics Concepts

▪ Numbers or symbols are ▪ Has all the properties of


Nominal
and Data Management

used to classify
▪ Categories are distinct and ▪ Accounts for order
non-overlapping ▪ No indication of distance
between positions
▪ Categories are of equal
importance Example:
Size of Shirt (Small, Medium,
Example: Large)
Sex (1-Male, 0-Female)

27
27
Levels of Measurement
Interval Ratio
Introduction to Basic Statistics Concepts

▪ Has all the properties of ▪ Has all the properties of


Ordinal
and Data Management

Interval
▪ Equal intervals ▪ May be divided or
▪ May be added or multiplied
subtracted ▪ Has an absolute zero
▪ No absolute zero
Example:
Examples: Income, Weight, Height
Temperature, IQ
28
28
Variable Classification and Levels of
Measurement
NOMINAL
Introduction to Basic Statistics Concepts

Attributes are only named;


weakest
QUALITATIVE
and Data Management

VARIABLES

ORDINAL
Attributes can be
ordered

INTERVAL
Distance is meaningful

QUANTITATIVE
RATIO
Absolute zero; strongest

29
29
Exercise: Classification of Variables
Identify the classification of the variables whether it is Qualitative or
Quantitative (From waste management practices example):
Introduction to Basic Statistics Concepts

1. sex QUALITATIVE
and Data Management

2. highest educational attainment of head QUALITATIVE


3. household size QUANTITATIVE
4. monthly income (in pesos) QUANTITATIVE
5. type of waste (dried leaves, recyclables, food scraps, etc.)
QUALITATIVE
6. type of disposal (compost pits, dump in open pits, garbage collector,
etc.) QUALITATIVE
30
30
RECAP
1. _____________
STATISTICS is the collection, organization, presentation, analysis,
Introduction to Basic Statistics Concepts

and interpretation of a set of numerical figures.


and Data Management

2. _____________
POPULATION is the collection of all the elements under consideration
in the study, where parameters are drawn from. While statistics are
produced from a __________,
SAMPLE which is a subset of this collection.

3. The characteristic or attribute of the elements in a collection that can


assume different values for different elements is called a ____________.
VARIABLE

4. Qualitative variable is also called as _______________


CATEGORICAL variable while
Quantitative variable can also be called as ______________
NUMERICAL variable.

31
31
Introduction to Basic Statistics Concepts

32
and Data Management

Data Collection Methods


32
Introduction to Basic Statistics Concepts
and Data Management

SURVEYS OBSERVATION EXPERIMENTATION

REGISTRATION ADMINISTRATIVE DATA USE OF DOCUMENTED DATA

33
33
Obtains information through:

Personal Interviews
Introduction to Basic Statistics Concepts

Survey Self-administered questionnaires


and Data Management

Data are obtained from Telephone surveys


individuals either by verbal or
written communication Mailed questionnaire

Census Survey vs Sample Survey Online surveys

Focus group discussions


Key Informant Interviews
34
34
PERSONAL INTERVIEW
Interviewers personally ask the questions to the respondents and
Introduction to Basic Statistics Concepts

record the answers of the respondents on the questionnaire


and Data Management

▪ PAPI – pencil and paper


interview
▪ CAPI – computer assisted
personal interview
▪ CATI - computer assisted
telephone interview

35
35
SELF-ADMINISTERED QUESTIONNAIRES

Respondents fill up the questionnaires


themselves without any assistance from an
Introduction to Basic Statistics Concepts

interviewer
and Data Management

TELEPHONE INTERVIEW

Interviewers ask the questions to the


respondents through the telephone

36
36
MAILED QUESTIONNAIRE

A set of questions mailed to the respondent


Introduction to Basic Statistics Concepts
and Data Management

ONLINE SURVEYS

Respondent reads the questions and sends


his responses via the internet or electronic
mail (e-mail)

37
37
FOCUS GROUP DISCUSSION
A moderator follows a question guide to direct a
freewheeling discussion among a small group of
Introduction to Basic Statistics Concepts

people
and Data Management

KEY INFORMANT INTERVIEWS

In-depth interviews with people who know what is going on


in the community - including community leaders,
professionals, or residents—who have first hand
knowledge about the community.
38
38
OBSERVATION
Introduction to Basic Statistics Concepts

▪ Data are obtained by recording


and Data Management

the behavior of an individual or


a phenomenon at the time of
occurrence

▪ Used to gather nonverbal


behavior in a natural setting

39
39
EXPERIMENTATION
Introduction to Basic Statistics Concepts
and Data Management

▪ Direct human intervention on the conditions that may affect the values of the variable of
interest
▪ Goal of an experiment is to isolate the effect(s) of a treatment or intervention on the
variable of interest
40
40
REGISTRATION
Introduction to Basic Statistics Concepts

Refers to enumeration and recording


of vital statistics
and Data Management

▪ Registration of birth
▪ Voter registration
▪ Inventory management
▪ Registration of participants in a
program
▪ Signing-up to a social media platform

41
41
ADMINISTRATIVE DATA
▪ Collected by governments or other organizations for non-statistical reasons to provide
Introduction to Basic Statistics Concepts

overviews on registration, transactions, and record keeping.

Examples of administrative data


and Data Management

▪ Insurance enrollment and claims data


(e.g. Medicare, PhilHealth)
▪ Hospital discharge data (billing data)
▪ Pharmaceutical claims data (billing data)
▪ National health surveys
▪ Surveillance data
▪ Medical record data
▪ Vital statistics
42
42 ▪ Census data
USE OF DOCUMENTED DATA
▪ Data which is developed in accordance with scientifically valid analytical
Introduction to Basic Statistics Concepts

protocols including field trials where appropriate, is subjected to peer


review, results from more than one study, and consistent with other
and Data Management

credible research.
▪ Documentation is the combining
collection of the data.
▪ Documentation that researchers
get from process of observation,
interviews, field notes and file
from institution.

43
43
Sources of Data
Introduction to Basic Statistics Concepts

Primary Data Secondary Data


and Data Management

▪ Data that are collected first- ▪ Data used by a researcher that


hand by the researchers is collected by another
themselves institution
▪ This could be done through ▪ It is usually gathered from
survey, experimentation, documented data and the like
observation, etc.

44 44
RECAP
1. The methods of collecting data are ___________,
SURVEY _____________,
OBSERVATION
Introduction to Basic Statistics Concepts

_________________,________________,________________________,
EXPERIMENTATION REGISTRATION USE OF DOCUMENTED DATA and
____________________.
ADMINISTRATIVE DATA
and Data Management

2. __________
CENSUS is a method of collecting data from every element
of the well-defined population, while a
________________collects
SAMPLE SURVEY from a group of people selected from
the well-defined population.

3. Data gathered first-hand from the source are


_________________,
PRIMARY DATA while data used by researchers but
collected by another institution are _____________________.
SECONDARY DATA

45
45
Introduction to Basic Statistics Concepts

46
and Data Management

Defining
Data Management

46
Introduction to Basic Statistics Concepts

Data Management
and Data Management

Practice of managing data as a


valuable resource to unlock its
potential for an organization.

47
47
Importance of Data Management
1. Productivity - Data management makes it so that it is easier and faster to find,
Introduction to Basic Statistics Concepts

understand, and relay information.


2. Cost Efficiency - When data is managed well then there is no need to re-do data
and Data Management

gathering, but rather just update data on-hand.


3. Operational Agility - Use data to calculate statistics that will be helpful in responding
to incoming events or even make predictions on what may happen.
4. Security - Data management will teach us to how to secure the data collected.
5. Reduced Data Loss - Data management will introduce ways to back-up data and make
it easier to access with proper data identification.
6. Accurate Decisions - Good data management will assist us in coming up with statistics
that will help us make wise and accurate decisions.
48
48
Introduction to Basic Statistics Concepts

49
and Data Management

Strategies in
Data Management

49
Effective Data Management
Introduction to Basic Statistics Concepts

• Developing a data strategy and


reliable methods to:
and Data Management

✔Collect
✔Cleanse
✔Process
✔Store
✔Govern
✔Prepare data for analytics

50
50
Basic strategies
in data management Train and
execute
Introduction to Basic Statistics Concepts

Practice data
governance
and Data Management

Use the right


technology
Create strong
processes
Set
objectives

51
51
1. Set objectives
▪ Consider how it will help contribute to your
Introduction to Basic Statistics Concepts

office’s vision, mission, and goals:


and Data Management

▪ Improve protection of client information


▪ Reduce duplication of information on target
clients
▪ Faster access to existing account records for
efficient processing of documents
▪ Faster computation of indicative program budget
▪ Identification of potential communities for setting
up different programs
52
52
More considerations on objective
setting:
Introduction to Basic Statistics Concepts

Ask:
▪ SMART (Specific,
and Data Management

▪ How often do you


Measurable, need it?
Achievable, ▪ Who needs access?
Relevant, and
▪ What variables are
Time-Bound) important?
▪ Where is the scope?

53
53
2. Create Strong Process
Analyze &
Collect Prepare Store
Introduction to Basic Statistics Concepts

Distribute

• Data sources • Clean and • Data extension • Statistics


and Data Management

• Method of data transform raw type development


collection data • Physical or • Identification of
• Validate data, cloud storage teams or offices
• Time of
handle • Ensure easy
collection • Keeping data access and
incomplete or secure analysis
mismatched
• Communicating
data insights
• Guidelines for
naming, and
data dictionary

54
Collection Stage
Introduction to Basic Statistics Concepts

• Data sources – primary or secondary data


• Method of data collection – surveys, administrative data,
and Data Management

registry data
• Time of collection – ad hoc or with regular schedule

55
55
Preparation Stage
Introduction to Basic Statistics Concepts

• Entry or encoding
• Cleaning, formatting, and validation
and Data Management

• Organizing Data
• Updating

56
56
Data Entry: Tips
▪ Instead of encoding the entire
question, you can give it a shorter
Introduction to Basic Statistics Concepts

variable name.
and Data Management

▪ Qualitative data are better


represented using “codes”
▪ Use codes for missing and non-
response answers, e.g., 9, 99, 999…
etc
▪ Names, codes and definitions
should be made available in the
data dictionary and metadata.

57
57
Data Cleaning, Formatting
and Validation
Introduction to Basic Statistics Concepts

• Confirming missing, skipped entries from the


questionnaire
and Data Management

• Formatting dates, decimals, numerical values, etc.


• Checking response for invalid answers, e.g., married
but aged 12, answered questions meant for another
type of respondent, codes outside the allowed
values.
• Checking answers for too big or too small values,
e.g., age of 130, monthly income of 50,000 for a poor
household. 58
58
Organizing Data
Introduction to Basic Statistics Concepts

• Use of Pivot Tables to display frequency counts of


categorical variables.
and Data Management

Table 1. Number of Employees by Sex in


Company A, 2021

Sex No.
Female 19
Male 11
Total 30

59
59
Organizing Data
Introduction to Basic Statistics Concepts

• Use of Frequency Distribution Tables which summarizes


data by showing the number of observations that belong in
and Data Management

groups or “class intervals”.


Table 2. Distribution of Respondents’ Age in Company A,
2021
Age (in years) No.
20-29 12
30-39 16
40-49 32
50-59 22
60-69 18
Total 100
60
60
Other Organization Techniques
Introduction to Basic Statistics Concepts

▪ Use of tools such as sorting, filtering, conditional formatting


and Data Management

visualize and Helps to quickly…


understand the data organize and find
better specific data

consequently make
more effective
decisions
61
61
Updating Data
Introduction to Basic Statistics Concepts

• Consolidating worksheets - collection


and integration of data from multiple
and Data Management

sources into a single destination

• Protecting worksheets – limiting access


or changes to data files, by adding
passwords to datasets, or making sure
than updates or new entries can only be
done on selected areas.

62
62
Storage Stage
• Data extension type – consistency of file types for data files,
Introduction to Basic Statistics Concepts

e.g., XLS
and Data Management

• Use of either physical or cloud storage


• Develop processes and use tools to keep data secure

63
Analysis and Dissemination Stage
• Analysis and dissemination is collaboration of
Introduction to Basic Statistics Concepts

several offices such as: Local Chief Executive’s Office,


Planning and Development, Office of Strategy
and Data Management

Management, Management Information Systems


Office, Sectoral Offices, and Administration Offices.
• Data management is ensuring that access to data
and analysis is made easy for all those involved.
• The importance of all these is that data and
statistics serve as evidence for policy and
program planning, and therefore data
management is an important practice to achieve
that.

64
3. Technology
Introduction to Basic Statistics Concepts

• Hardware and software


and Data Management

• File-Based vs Database
Management System
• Storage and back-ups.

65
Good Practices in Storing Data
✓ Choose the appropriate storage medium
Introduction to Basic Statistics Concepts
and Data Management

Desktop Portable Cloud


Computer Devices Storage

66
Good Practices in Storing Data
Introduction to Basic Statistics Concepts

✓ Manage versions and


copies of your data MASTE
carefully R
and Data Management

• Protect raw data ORIGIN


• Keep temporary and AL
master copies apart TEMP
• Back-up master copy in
physically distinct
locations
• Set up a strategy for
version control
FINAL!
v1 v2
!
67
Good Practices in Storing Data
Introduction to Basic Statistics Concepts

✓ Structure names and folders


• Focus on easy navigation and finding of
and Data Management

files, e.g., by program then year; by


sector; by function
• Keep it short but meaningful, e.g.,
recognizable acronyms for programs,
people, dates
• Keep a record for the naming and
folder navigation

68
Good Practices in Storing Data
Introduction to Basic Statistics Concepts

✓ Use standard file formats


and Data Management

• Commonly used file formats: .pptx,


.xlsx, .docx, .pdf, .csv, .txt
✓ Secure your data files
• Protection of data files using computer
system security or physical data
security

69
4. Data Governance
▪ A set of processes that ensures that important data assets are
Introduction to Basic Statistics Concepts

formally managed throughout the organization.


▪ Provides direction and oversight.
and Data Management

✓Data dictionary, metadata,


statistical standards
✓Quality Assurance
✓Data security and privacy

70
Definition: Data
Dictionary
Introduction to Basic Statistics Concepts
and Data Management

• Collection of names, definitions, and


attributes about data elements that
are being used or captured in a
database, information system, or
part of a research project.

71
Definition: Metadata
• Data that provides information about other data, or, "data about data".
Introduction to Basic Statistics Concepts

• Metadata can describe:


and Data Management

✓ what data an organization has ✓ what is the data format


✓ what it represents ✓ how is it defined
✓ how it is classified ✓ what are the coding rules
✓ where it came from

72
72
Definition: Statistical Standards
• Agreed rules or guidelines on concepts and definitions for consistency
Introduction to Basic Statistics Concepts

and comparability of data. Standards are used for statistical


purposes such as:
and Data Management

Variables Methodologies
Classification
and and
Systems
Statistics Procedures

73
73
Definition:
Data Quality
Assurance
Introduction to Basic Statistics Concepts
and Data Management

• Planned and systematic actions


needed to provide adequate
confidence that a product, service or
result will satisfy given requirements
for quality and be fit for use.

74
Definition:
Introduction to Basic Statistics Concepts

Data Security and Privacy


and Data Management

• Data security ensures that data privacy


and confidentiality are maintained, that
data is not breached, and that data is
accessed appropriately.
• Ask for identifying information only
when needed, for example, in
beneficiary targeting.

75
Definition: Data Security and Privacy
Introduction to Basic Statistics Concepts

• Data privacy or
and Data Management

information privacy is a
branch of data security
concerned with the
proper handling of data –
consent, notice, and
regulatory obligations.

76
5. Train and execute
Introduction to Basic Statistics Concepts
and Data Management

• Train your Local Planning


Officers, Programmers, Data
analysts.
• Attend training courses.
• Share reliable resources.
• Execute and sustain learnings.

77
THANK
YOU!

You might also like