Data Managementmmw

Statistics is a branch of applied mathematics focused on collecting, organizing, and interpreting data to make inferences about populations based on samples. It includes descriptive statistics, which summarizes data, and inferential statistics, which generalizes findings from samples to larger populations. Key concepts include population vs. sample, types of variables, data collection methods, and measures of central tendency and variability.

Uploaded by

asassin831

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views26 pages

Data Managementmmw

Uploaded by

asassin831

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 26

MATHEMATICS AS A

TOOL
DATA MANAGEMENT
What is STATISTICS
■ Statistics is a branch of applied mathematics
concerned with collecting, organizing, and
interpreting data .It attempts to infer the
properties of a large collection of data from
inspection of a sample of the collection thereby
allowing educated guesses to be made with a
minimum of expense.
MAIN BRANCHES OF
STATISTICS
■ Descriptive Statistics refers to the collection, presentation,
and summary of data (either using charts and graphs or using a
numerical summary).

■ Inferential Statistics refers to generalizing from a sample to a

population, estimating unknown population parameters,
drawing conclusions, and making decisions.
Population and Sample
Population refers to all the items (infinite or finite) that we
are interested in. It consists of the totality of the
observations ,individuals, or objects in which the
investigator/researcher is interested in.

Sample is a subset or portion of the population. It involves

looking only at some items selected from a population.
Parameter – is a value calculated using all the Data from a
population.
Statistic- is a value calculated using the data from the sample.

What is Variable?
A VARIABLE is a characteristic of interest about an object under
investigation that can take on different possible outcomes, such as age,
hair, color, height, weight, and religious preference.
Two kinds of Variables
1.QUALITATIVE VARIABLES
>These are variables that can be placed into distinct categories, according
to some characteristics or attributes.
■ QUANTITATIVE VARIABLES – These are numerical and
can be ordered or ranked.
■ Also, these consist of two types: Discrete and Continuous.
■ Discrete are frequencies, obtained by means of counting.
■ Continuous are represented by measurement values.
DATA
■ Data is a set of values collected from the variable from each of
the subjects that belong to the sample. It refers to a collection
of natural phenomena descriptors such as results from
experiences, observations or experiments, or a set of premises.
It may consist of numbers, words, or images.
■ Data can be classified according to the type of variable for
which it was drawn. There are two general types of data
according to how the data vary across cases:
Types of Statistical Data
■ 1.Numerical data. These data have meaning as a measurement
such as a person’s height, weight, IQ,Or blood pressure or
shares of stocks a person owns.
■ 2. Categorical data: Categorical data represent characteristics
such as a person’s gender, . marital status, hometown, or the
types of movies they like. Categorical data can take on
numerical values (such as “1” indicating male and “2”
indicating female) but those numbers don’t have mathematical
meaning.
Four levels of Measurement
■ 1.Nominal –the lowest of the four ways to characterize data. It
deals with names, categories, or labels. (eg. Colors of eyes, yes
or no responses to a survey, favorite breakfast cereal, and
number on the back of a football jersey).
■
■ 2. Ordinal – the data at this level can be ordered but no
differences between the data. (eg. Ten cities are ranked from
one to ten, but differences between the cities don’t make much
sense, letter grades where we can order things so that A is
higher than B but without any other information).
■ 3. Interval – deals with data that can be ordered, and
in which differences between the data does make
sense. But data at this level has no starting point.(eg.
Fahrenheit and Celsius scales of temperatures).
■
■ 4. Ratio – the highest level of measurement. Data
possess all of the features of the interval level, in
addition to an absolute zero. Due to the presence of a
zero, it now makes sense to compare the ratios of
measurements.
DATA COLLECTION METHOD
■ Methods of Collecting Data
1.In-Person Interviews
Pros: In-depth and a high degree of confidence on the data
Cons: Time consuming, expensive and can be dismissed as
anecdotal.
2. Mail Surveys
Pros: Can reach anyone and everyone – no barrier
Cons: Expensive, data collection errors, lag time
DATA COLLECTION METHOD
3. Phone Surveys
Pros: High degree of confidence on the data collected, reach almost
anyone
Cons: Expensive, cannot self-administer, need to hire an agency
4. Web/Online Surveys
Pros: Cheap, can self-administer, very low probability of data errors .
Cons: Not all your customers might have an email address/be on the
internet, customers may be wary of divulging information online.
Three Ways of Presenting Data

1.Textual presentation use words, statements or paragraphs with

numerals, numbers to describe data.
Example:
There are 42, 036 barangays in the Phiippines. The largest
barangay in terms of population size in Barangay 176 in Caloocan
City with 247 thousand persons. It is followed by Commamealth
in Quezon City (198,295) and Batasan Hals in Quezon City
(161,409]. Twelve other
barangays posted a population size of more than a hundred
thousand
■ Tabular Presentation of Data
Tables present clear and organized data. A table must be clear and
simple but
complete.
A good table should include the following parts.
Table number and title –these are placed above the table. The
title is usually written right after the table number.
Caption subhead –this refers to columns and rows.
Body –it contains all the data under each subhead.
Source- it indicates if the data is secondary and it should be
acknowledge.
Graphical Method of Presenting the Data
A graph or chart portrays the visual presentation of data using symbols such
as lines, dots, bars or slices. It depicts the trend of a certain set of
measurements or shows comparison between two or more sets of data or
quantities.
Frequency Distribution
■ Frequency is the rate that measures how often something
occurs.
■ Example 1
Jack joins football practice every Wednesday morning, Sunday
morning and afternoon. The frequency of Jack’s football practice
every week is 3(2 on Sunday and 1 on Wednesday).By counting
frequencies we can make Frequency Distribution Table.
Example 2
Jack’s team has scored the following numbers of goals in
their games,
3,1, 2, 1,3,2, 4, 2, 3,2, 5,4,3, 2.
Jack put the numbers in order, then added up:
How often 1 occurs (2 times),
How often 2 occurs (5 times),
how often 3 occurs (4 times)
how often 4 occurs (2 times),
how often 5 occur (1 time)
Graphical Representation of Frequency
Distribution
A. Bar Graph is a pictorial representation of statistical data in such a way that length of the
rectangles in the graph represents the proportional value of the variable. Bar graphs are
generally used to compare the values of several variables at a time to analyze data. The length
of the bars(horizontal or vertical) represents the frequency of the variable and is applicable to
discrete categories only.
B. Line graph or Line chart is a graphical display of information that changes continuously
over time. Within a line graph, there are points connecting the data to show a continuous
change. The lines in a line graph can descend and ascend based on the data. We can also
compare different events, situations, and information.
C.Pie Chart is a type of graph that displays data in a circular graph. The pieces of the graph are
proportional to the fraction of the whole in each category. Each slice of the pie is relative to the
size of that category in the group as a whole. The entire “pie” represents 100 percent of a
whole, while the pie “slices” represent portions of the whole.
MEASURES OF CENTRAL
TENDENCY
Types of Measures for Center
■ Once the data are collected, it is useful to summarize the data set by
identifying a value around which the data are centered.
Mean – is the numerical balancing point of the data set.
Example;
Add all the numbers then divide by the amount of numbers.
9,3,1,8,3,6
9+3+1+8+3+6=30
30÷6=5
The mean is 5.
Median – is the middle number or the mean of the two middle
numbers in an ordered set of data.
Example;
Order the set of numbers, the median is the middle number
1,3,3,6,8,9 The median is 4.5

Mode – is the most frequently occurring number in a data set.

Example;
The most common number
9,3,1,8,3,6
The mode is 3
Types of Measures of Dispersion
or Variability
■ Another important feature that can help us understand more
about a data set is the manner in which the data are distributed.
■ Range is the difference between the largest value
(maximum) and the smallest value (minimum) in the data.
■ Standard deviation is an extremely important measure of
spread ,That is based on the mean. It is a measure of the
average deviation for all of the data point from the mean.
■ Variance is the square of the standard deviation of the data.
It does
■ Not use the same unit of measure as the original data.
Measures of Relative Position
■
■ Used to describe the position of a data value in
relation to the rest of the data.
■ Types:
■ Quartiles 2. Percentiles 3. Deciles
■ Quartiles..
■ Quartiles divide an ordered data set into four equal parts
(quarters). We use subscript notation to label the quartiles: Q1,
Q2 and Q3. The first quartile, Q1, is (or 25%) of the way
through the data – the lower quartile. The second quartile, Q2
is ,(or 50%) of the way through the data – the median .The
third quartile, Q3 is (or 75%) of the way through the data- the
upper quartile.
E.g:
3 4 4 5 6 8 10
4 is the lower quartile
5 would be the median
8 is the upper quartile
■ Percentiles..
■ Values of the variable that divide a ranked
Set into 100 subsets.
For example, P30 would be at 30%.
Percentile Example......
The 78th percentile means 78% are
Smaller than the given value.
Does making the 80th percentile mean that
You made an 80% on test?
DECILES
Decile
A quantitative method of splitting up a set of ranked data into 10 equally
large subsections.

Z-scores
A z-score represents the number of standard deviations a data value falls
above or below the mean. It is used as a way to measure relative position.
Z- Score formula
■ Example.....
■ A student scored a 65 on a math test that
Had a mean of 50 and a standard deviation of 10. She
scored 30 on a history test with a mean of 25 and a
standard deviation of 5. Compare her relative position on
the two tests.
Answer....
■ Math: z =(65-50)/10= 15/10= 1.5
■ History: z =(30-25)/5 =5/5=1
The student did better in math because
The z-score was higher.

Sources of Data
100% (3)
Sources of Data
18 pages
BASIC CBLM9 Work in A Diverse Environment
100% (2)
BASIC CBLM9 Work in A Diverse Environment
55 pages
Basics of Statistics: Definition: Science of Collection, Presentation, Analysis, and Reasonable
100% (1)
Basics of Statistics: Definition: Science of Collection, Presentation, Analysis, and Reasonable
33 pages
American Culture and Drug Abuse
No ratings yet
American Culture and Drug Abuse
1 page
AS Chemistry - Revision Notes Unit 3 - Introduction To Organic Chemistry
No ratings yet
AS Chemistry - Revision Notes Unit 3 - Introduction To Organic Chemistry
15 pages
Data Management (MMW)
No ratings yet
Data Management (MMW)
6 pages
Introduction To Statistics
No ratings yet
Introduction To Statistics
24 pages
Bustat Reviewer
No ratings yet
Bustat Reviewer
6 pages
Statistics A Review
No ratings yet
Statistics A Review
47 pages
Basic Statistics
No ratings yet
Basic Statistics
54 pages
Basic Statistical Concepts - Measures of Location
No ratings yet
Basic Statistical Concepts - Measures of Location
14 pages
Data Management
No ratings yet
Data Management
57 pages
Written Report Gathering and Organizing Data
No ratings yet
Written Report Gathering and Organizing Data
13 pages
Mathematics in The Modern World
No ratings yet
Mathematics in The Modern World
50 pages
Introduction To Statistics: "There Are Three Kinds of Lies: Lies, Damned Lies, and Statistics." (B.Disraeli)
No ratings yet
Introduction To Statistics: "There Are Three Kinds of Lies: Lies, Damned Lies, and Statistics." (B.Disraeli)
32 pages
Article Review 1 Eng
No ratings yet
Article Review 1 Eng
30 pages
Physics
No ratings yet
Physics
6 pages
Chapter 4 Data Management
No ratings yet
Chapter 4 Data Management
56 pages
Introduction To Statistics
No ratings yet
Introduction To Statistics
8 pages
Statistics
No ratings yet
Statistics
42 pages
Mat101 Midterms
No ratings yet
Mat101 Midterms
6 pages
Basic Statistics
No ratings yet
Basic Statistics
52 pages
RM Data Analysis
No ratings yet
RM Data Analysis
67 pages
Statistics
No ratings yet
Statistics
49 pages
Statistics
No ratings yet
Statistics
16 pages
Data Management (1)
No ratings yet
Data Management (1)
46 pages
Week 1 Quantitative
No ratings yet
Week 1 Quantitative
32 pages
Descriptive Statistics: Instructor: Maira Sami
No ratings yet
Descriptive Statistics: Instructor: Maira Sami
55 pages
Lecture 5
No ratings yet
Lecture 5
33 pages
Class 1
No ratings yet
Class 1
52 pages
Basic of Statistics #5 (!!!)
No ratings yet
Basic of Statistics #5 (!!!)
49 pages
Math 5
No ratings yet
Math 5
3 pages
STAT Module I Notes
No ratings yet
STAT Module I Notes
10 pages
Part1 141104090445 Conversion Gate01
No ratings yet
Part1 141104090445 Conversion Gate01
27 pages
MMW Reviewer
No ratings yet
MMW Reviewer
3 pages
Lecture Afffasfafa
No ratings yet
Lecture Afffasfafa
29 pages
Stats For PGDM
No ratings yet
Stats For PGDM
52 pages
Biostatistics Notes-Numbered
No ratings yet
Biostatistics Notes-Numbered
21 pages
Basic Ideas of Data Management
No ratings yet
Basic Ideas of Data Management
32 pages
Lecture 1-Statistics Introduction-Defining, Displaying and Summarizing Data
No ratings yet
Lecture 1-Statistics Introduction-Defining, Displaying and Summarizing Data
53 pages
Ahsan Stats
No ratings yet
Ahsan Stats
9 pages
Introduction To Stati Stics: There Are Three Kinds of Lies: Lies, Damned Lies, A ND Statistics." (B.Disraeli)
No ratings yet
Introduction To Stati Stics: There Are Three Kinds of Lies: Lies, Damned Lies, A ND Statistics." (B.Disraeli)
39 pages
Unit 9
No ratings yet
Unit 9
9 pages
Week 1
No ratings yet
Week 1
6 pages
AL - I (Unit - I)
No ratings yet
AL - I (Unit - I)
19 pages
Chapter-4 MMW A
No ratings yet
Chapter-4 MMW A
20 pages
3RD Quarter Statistics and Probability
No ratings yet
3RD Quarter Statistics and Probability
7 pages
Statapp Chapter 1 121928
No ratings yet
Statapp Chapter 1 121928
2 pages
Statistics Introduction
No ratings yet
Statistics Introduction
37 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
73 pages
MMW Statistics
No ratings yet
MMW Statistics
50 pages
Statistics 2ND Sem Reviewer
No ratings yet
Statistics 2ND Sem Reviewer
5 pages
Unit 2
No ratings yet
Unit 2
72 pages
Intro To Stat1
No ratings yet
Intro To Stat1
31 pages
Basic Stat 1
No ratings yet
Basic Stat 1
50 pages
Statistics 24 04 2021 20210618114031
No ratings yet
Statistics 24 04 2021 20210618114031
41 pages
Statistics Ppt.1
No ratings yet
Statistics Ppt.1
39 pages
Statistics
No ratings yet
Statistics
88 pages
Intro To Stat
No ratings yet
Intro To Stat
50 pages
Descriptive Statistics: Six Sigma Thinking, #3
From Everand
Descriptive Statistics: Six Sigma Thinking, #3
Sumeet Savant
No ratings yet
Business Statistics I Essentials
From Everand
Business Statistics I Essentials
Louise Clark
5/5 (5)
Introduction To Business Statistics Through R Software: Software
From Everand
Introduction To Business Statistics Through R Software: Software
Editor IJSMI
No ratings yet
Machine Learning - A Complete Exploration of Highly Advanced Machine Learning Concepts, Best Practices and Techniques: 4
From Everand
Machine Learning - A Complete Exploration of Highly Advanced Machine Learning Concepts, Best Practices and Techniques: 4
Peter Bradley
No ratings yet
Introduction To Non Parametric Methods Through R Software
From Everand
Introduction To Non Parametric Methods Through R Software
Editor IJSMI
No ratings yet
The Structure of Appearance
No ratings yet
The Structure of Appearance
344 pages
Variability of Unconfined Compressive Strength in Relation To Number of Test Samples
No ratings yet
Variability of Unconfined Compressive Strength in Relation To Number of Test Samples
8 pages
Effects of Habitat Fragmentation On The Persistence of Medium and Large Mammal Species in The Brazilian Savanna of Goiás State
No ratings yet
Effects of Habitat Fragmentation On The Persistence of Medium and Large Mammal Species in The Brazilian Savanna of Goiás State
9 pages
1750 Blood Angels (Warhammer 40,000 9th Edition) (92 PL, 11CP, 1,749pts)
No ratings yet
1750 Blood Angels (Warhammer 40,000 9th Edition) (92 PL, 11CP, 1,749pts)
15 pages
Prueba Modelo Diagnostica Optativa Ingles
No ratings yet
Prueba Modelo Diagnostica Optativa Ingles
5 pages
The Use of Smart Materials in Building Design
No ratings yet
The Use of Smart Materials in Building Design
5 pages
AI Book 10 - Worksheets - Unit 1 - Answer Key
No ratings yet
AI Book 10 - Worksheets - Unit 1 - Answer Key
8 pages
CE118 Project Part 1
No ratings yet
CE118 Project Part 1
42 pages
Iaad 2023
No ratings yet
Iaad 2023
4 pages
AS 4 Explicit and Implicit Signals With Answer KEY Rodriguez Ray Enrick Moses A.
No ratings yet
AS 4 Explicit and Implicit Signals With Answer KEY Rodriguez Ray Enrick Moses A.
20 pages
UC3843 ChipsWinner
No ratings yet
UC3843 ChipsWinner
11 pages
Practical Skill Improvement Needs of Technical College Mechanical Engineering Craft Practice Curriculum in Nigeria
No ratings yet
Practical Skill Improvement Needs of Technical College Mechanical Engineering Craft Practice Curriculum in Nigeria
9 pages
OPCRF Plan Template For School Heads Elem 2023 2024
No ratings yet
OPCRF Plan Template For School Heads Elem 2023 2024
11 pages
ANOVA Poplar-Trees
No ratings yet
ANOVA Poplar-Trees
3 pages
8.1-Transport - in - Plants - Igcse-Cie-Biology - Solved OLI
100% (1)
8.1-Transport - in - Plants - Igcse-Cie-Biology - Solved OLI
13 pages
BachHoang FritoLay Memo
No ratings yet
BachHoang FritoLay Memo
4 pages
Dissertation Zusammenfassung Schreiben
100% (2)
Dissertation Zusammenfassung Schreiben
6 pages
A Model of Self-Regulation
No ratings yet
A Model of Self-Regulation
15 pages
Jayson Dr. Palisoc Domain 3 Diversity of Learners
No ratings yet
Jayson Dr. Palisoc Domain 3 Diversity of Learners
7 pages
Introduction To Management Chapter 3
No ratings yet
Introduction To Management Chapter 3
10 pages
Essay On Greenhouse Effect
100% (2)
Essay On Greenhouse Effect
3 pages
Lesson Plan in Direct Proof (Paragraph Form)
No ratings yet
Lesson Plan in Direct Proof (Paragraph Form)
6 pages
GIS A Tool For Sustainable Development PDF
No ratings yet
GIS A Tool For Sustainable Development PDF
11 pages
ANP Technical Note 10 - Human Factors
No ratings yet
ANP Technical Note 10 - Human Factors
7 pages
8 TQ Quarter4
No ratings yet
8 TQ Quarter4
2 pages
Angela Ales Bello The Divine in Husserl and Other Explorations 1st Edition Angela Ales Bello Auth Instant Download
No ratings yet
Angela Ales Bello The Divine in Husserl and Other Explorations 1st Edition Angela Ales Bello Auth Instant Download
29 pages