0% found this document useful (0 votes)
137 views34 pages

Module 1 Data Around Us

The document discusses the evolution of data from the 1960s to the 1980s, including the development of databases, online transaction processing, and personal computers. It also covers different types of data such as categorical, quantitative, discrete, and continuous data as well as sources of data like primary and secondary data.

Uploaded by

mimi
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
137 views34 pages

Module 1 Data Around Us

The document discusses the evolution of data from the 1960s to the 1980s, including the development of databases, online transaction processing, and personal computers. It also covers different types of data such as categorical, quantitative, discrete, and continuous data as well as sources of data like primary and secondary data.

Uploaded by

mimi
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 34

MODULE 1:

DATA AROUND US
Image or icon Image or icon Image or icon

What is Data is Data is the Understanding the

DATA?
nature of data
gathered central
is the most
body of thread of fundamental or
facts. any activity. proper and
effective use of
statistical skills.
DATA

DATA is often
considered the
building block of
information systems.
Data is defined as a
gathered body of
facts.
CENTRO ESCOLAR UNIVERSITY FOR INTERNAL CIRCULATION ONLY 2018
EVOLUTION OF DATA

1960 1965 1970 1975 1980


• Master files, • Lots of master • Database – “a • Online, high- • The single-
reports files single source of performance database-serving-
- Complexity of data for all transaction all-purposes
Maintenance processing” processing paradigm
& Development • PC’s, 4GL
- Synchronization technology
of data
-Hardware

CENTRO ESCOLAR UNIVERSITY FOR INTERNAL CIRCULATION ONLY 2018


EVOLUTION OF DATA

1960s
• Individual application that were run
using master files.
• Reports and programs built in an
early language (e.g. Fortran or
COBOL)
• Punched cards and paper tape
• Magnetic tape

CENTRO ESCOLAR UNIVERSITY FOR INTERNAL CIRCULATION ONLY 2018


EVOLUTION OF DATA

Mid - 1960s
• Master files and
magnetic tape exploded
• Huge amounts of
redundant data

CENTRO ESCOLAR UNIVERSITY FOR INTERNAL CIRCULATION ONLY 2018


EVOLUTION OF DATA

Mid - 1960s
• Proliferation of master files and redundant data presented some very insidious
problems:
- Need to synchronize data upon update
- Complexity of maintaining programs
- Complexity of developing new programs
- Need for extensive amounts of hardware to support all the master files

CENTRO ESCOLAR UNIVERSITY FOR INTERNAL CIRCULATION ONLY 2018


EVOLUTION OF DATA

1970s
• Direct Access Storage Device (DASD)
- Time to locate a record could be measured in
milliseconds
• Database Management System (DBMS) was
introduced
Purpose:
- Make it easy to store, index and access data on a
DASD
- Single source of data for all processing

CENTRO ESCOLAR UNIVERSITY FOR INTERNAL CIRCULATION ONLY 2018


EVOLUTION OF DATA

Mid-1970s
• Online Transaction Processing
(OLTP)

• Faster access to data possible

• Opening whole new vistas for


business and processing

CENTRO ESCOLAR UNIVERSITY FOR INTERNAL CIRCULATION ONLY 2018


EVOLUTION OF DATA

1980s
• PCs and fourth-generation
languages (4GLs)
• More could be done with data
than simply processing online
transactions
• Management Information
System (MIS) could be
implemented.
CENTRO ESCOLAR UNIVERSITY FOR INTERNAL CIRCULATION ONLY 2018
EVOLUTION OF DATA

Mid-1980s
• Extract program (simplest of all
programs)
• Rummages through a file or
database
• Uses some criteria for selecting
data, and, on finding qualified
data, transports the data to
another file or database

CENTRO ESCOLAR UNIVERSITY FOR INTERNAL CIRCULATION ONLY 2018


KINDS OF DATA
OBSERVATIONAL DATA LABORATORY EXPERIMENTAL DATA

COMPUTER SIMULATION PHYSICAL ARTIFACTS TEXTUAL ANALYSIS

CENTRO ESCOLAR UNIVERSITY FOR INTERNAL CIRCULATION ONLY 2018


TYPES OF DATA

QUALITATIVE (CATEGORICAL) DATA


• Objects being studied are
grouped into categories
based on some qualitative
trait.
• Resulting data are merely
labels or categories.
CENTRO ESCOLAR UNIVERSITY FOR INTERNAL CIRCULATION ONLY 2018
QUALITATIVE (CATEGORICAL) DATA

NOMINAL DATA ORDINAL DATA


• A type of categorical data in • A type of categorical data in
which objects fall into which ORDER is important.
UNORDERED categories. EXAMPLE:
EXAMPLE: • Class
• Hair color • Degree of Illness
• Race • Opinion of Students About
• Status Riots
CENTRO ESCOLAR UNIVERSITY FOR INTERNAL CIRCULATION ONLY 2018
TYPES OF DATA

QUANTITATIVE (MEASUREMENT) DATA


• Objects being studied are
grouped into categories
based on some quantitative
trait.
• Resulting data are set of
numbers.
CENTRO ESCOLAR UNIVERSITY FOR INTERNAL CIRCULATION ONLY 2018
QUANTITATIVE (MEASUREMENT) DATA

DISCRETE DATA CONTINUOUS DATA


• Only certain values are • Theoretically, any value
possible (there are gaps within an interval is possible
between the possible with a fine enough measuring
values) device.
• Common example are • Common example are
numerical values that are numerical values that are
integers. fractions and decimals.
CENTRO ESCOLAR UNIVERSITY FOR INTERNAL CIRCULATION ONLY 2018
QUANTITATIVE (MEASUREMENT) DATA

DISCRETE DATA CONTINUOUS DATA


EXAMPLES: EXAMPLES:
• Number of students present • Cholesterol level
in class • Height
• Number of people who • Age
agree on a certain issue • Time
• Number of times the word • Length
number is used • Weight
CENTRO ESCOLAR UNIVERSITY FOR INTERNAL CIRCULATION ONLY 2018
QUANTITATIVE (MEASUREMENT) DATA
CONTINUOUS DATA
INTERVAL DATA RATIO DATA
• Represent ordered units that have • Ordered units that have the same
the same difference. difference.
• A variable that contains numeric • Do NOT have an absolute zero.
values that are ordered and where
we know the exact differences
between the values.
• Don’t have a “true zero”.
CENTRO ESCOLAR UNIVERSITY FOR INTERNAL CIRCULATION ONLY 2018
COMPARISON OF CONTINUOUS & DISCRETE DATA

• Continuous data is more precise than discrete.


• Continuous data is more informative than discrete.
• Continuous data can remove estimation and rounding of
measurements.
• Continuous data is often more time consuming to obtain.
• Discrete should also be converted to continuous data
when possible as to obtain a higher level of information
and detail.
CENTRO ESCOLAR UNIVERSITY FOR INTERNAL CIRCULATION ONLY 2018
EXAMPLES OF CONVERSION OF
DISCRETE TO CONTINUOUS DATA

Measuring Attribute/Discrete Continuous


Gas Tank Empty/Full Gas Volume
Tree Heights Tall/Short Meters
Performance Poor/Average/Good Points/Pieces
Temperature # Days of Cold Average Temp.
Delivery No. of Late Time per Delivery
Scrap # of Pieces > Max Length Average Length

CENTRO ESCOLAR UNIVERSITY FOR INTERNAL CIRCULATION ONLY 2018


TYPES OF VARIABLE

DEPENDENT INDEPENDENT
VARIABLE VARIABLE
• The variable of interest • The variable that is being
• The outcome of which is tested for an effect on the
dependent on something dependent variable
else

CENTRO ESCOLAR UNIVERSITY FOR INTERNAL CIRCULATION ONLY 2018


EXAMPLE

DOES HIGH-DOSE
CIPROFLOXACIN LEAD TO SEIZURE?

DEPENDENT VARIABLE INDEPENDENT VARIABLE


SEIZURE DOSE

CENTRO ESCOLAR UNIVERSITY FOR INTERNAL CIRCULATION ONLY 2018


IMPORTANCE OF DATA TYPES

• Statistical methods can only be used with certain


types.
• With this, you need to analyze continuous data
differently than categorical data otherwise you’ll have
a wrong analysis.
• Knowing the type of data enables to choose the
correct method of analysis.
CENTRO ESCOLAR UNIVERSITY FOR INTERNAL CIRCULATION ONLY 2018
SOURCES OF DATA

PRIMARY DATA SECONDARY DATA


• Uses surveys, experiments • Collection may be conducted
by collecting information from
or direct observations
a diverse source of
documents or electronically
stored information, census
and market studies
• “Data Mining”

CENTRO ESCOLAR UNIVERSITY FOR INTERNAL CIRCULATION ONLY 2018


SOURCES OF DATA

PRIMARY DATA
• Means original data that has been
collected specially for the purpose in mind
• Someone collected the data from the
original source
• Has not been published yet and is more
reliable, authentic and objective
• Has not been changed or altered by
human beings therefore validity is greater.
CENTRO ESCOLAR UNIVERSITY FOR INTERNAL CIRCULATION ONLY 2018
EXAMPLES OF PRIMARY DATA
SURVEY QUESTIONNAIRE
Most commonly used Most commonly used method in
method in social survey. These are a list of
sciences, management, questions either an open-ended
marketing and or close-ended for which the
psychology to some respondent give answers.
extent and can be These can be conducted via
conducted in different telephone, mail, live in a public
Icon methods. Icon area, or in an institute, through
electronic mail or through fax
INTERVIEW and other methods.
This is a face-to-face OBSERVATIONS
conversation with the
Icon respondent. It is slow, Icon These can be done while letting
expensive and take the observing person know that
people away from their he is being observed or without
regular jobs, but this letting him know. These can
allow in-depth also be made in natural setting
questioning and follow- as well as in artificially created
up questions. environment.
CENTRO ESCOLAR UNIVERSITY FOR INTERNAL CIRCULATION ONLY 2018
ADVANTAGES AND DISADVANTAGES OF PRIMARY DATA

ADVANTAGES OF DISADVANTAGES OF
PRIMARY DATA PRIMARY DATA
• Data interpretation is better.
• Targeted issues are addressed. • High cost
• Efficient spending for • Time consuming
information. • Inaccurate feedbacks
• Decency of data. • More number of resources is
• Addresses specific research required
issues.
• Greater control.
• Proprietary issues.
CENTRO ESCOLAR UNIVERSITY FOR INTERNAL CIRCULATION ONLY 2018
SOURCES OF DATA

SECONDARY DATA
• Data that has been already collected by
and readily available from other sources.
• Data being reused from the primary data.
• Data are more quickly obtainable than the
primary data.
• Obtained from many sources, including
literature, industry surveys, compilations
from computerized databases and
information systems, and computerizes or
mathematical models of environmental
processes.
CENTRO ESCOLAR UNIVERSITY FOR INTERNAL CIRCULATION ONLY 2018
EXAMPLES OF SECONDARY DATA
PUBLISHED PUBLISHED
PRINTED SOURCES ELECTRONIC SOURCES

Icon Icon

BOOKS JOURNALS & E-JOURNALS WEBLOGS


PERIODICALS
Icon Icon

MAGAZINES & GENERAL


NEWSPAPERS WEBSITES
CENTRO ESCOLAR UNIVERSITY FOR INTERNAL CIRCULATION ONLY 2018
ADVANTAGES AND DISADVANTAGES OF SECONDARY DATA

ADVANTAGES OF DISADVANTAGES OF
SECONDARY DATA SECONDARY DATA
• Inexpensive • Some are expensive
• Easily accessible
• Immediately available • Some are not immediately
• Will provide essential background and available and takes time to
help to clarify or refine research
problem – essential for literature
define problem, sampling
review frame, method and analysis
• Secondary data sources will provide • Some are not as readily
research method alternatives.
• Will also alert the researcher to any accessible.
potential difficulties • Incomplete information
CENTRO ESCOLAR UNIVERSITY FOR INTERNAL CIRCULATION ONLY 2018
IMPORTANCE OF DATA AND DATA COLLECTION

• Data is one of the most important and viral aspect of any


research studies.
• Researchers conducted in different fields of study can be
different in methodology but every research is based on
data which is analyzed and interpreted to get information.
• Data is the basic unit in statistical studies. Statistical
information like census, population variables, health
statistics and other related studies are all developed from
data.
CENTRO ESCOLAR UNIVERSITY FOR INTERNAL CIRCULATION ONLY 2018
DATA VS. INFORMATION

DATA INFORMATION
Raw numbers that we Collection of data points we
capture according to can use to understand
some agreed to something about the thing
Icon standards. Icon being measured.

Go through the Data that have been


client’s current data recorded, classified,
structure and organized, related, or
Icon Icon
normalize the data; interpreted within a
Making sure that all framework so that meaning
things measured in emerges.
the same way.

CENTRO ESCOLAR UNIVERSITY FOR INTERNAL CIRCULATION ONLY 2018


MODULE 1: DATA AROUND US

References
Buendia, M. (2016). Introduction to Data Warehousing. Special Training in Business Analytics (Module 2) for Teachers of HEI
(pp. 1-5). Manila: University of the East.
Campaign, D. Q. (2011, June 24). Data Is Power. Retrieved from Youtube: https://www.youtube.com/watch?v=77UPUxB2b7o
Data vs. Information vs. Insight. (2018). Retrieved from Benedictine University:
https://online.ben.edu/programs/mba/resources/data-vs-information-vs-insight
Data, Information and Statistics. (2013, July 23). Retrieved from Statistics Canada: https://www.statcan.gc.ca/edu/power-
pouvoir/ch1/definitions/5214853-eng.htm
Donges, N. (2018, March 18). Data Types in Statistics. Retrieved from Towards Data Science:
https://towardsdatascience.com/data-types-in-statistics-347e152e8bee
Hinglish, T. (2018, January 22). What is Data | What is Information | Difference Between Data and Information. Retrieved
from Youtube: https://www.youtube.com/watch?v=F20qEwXBQaE
M, M. K. (2013, May 9). Source of Data in Research. Retrieved from SlideShare:
https://www.slideshare.net/manukumarkm/source-of-data-in-research
Roberts, T. (2015, December 6). Data vs. Information. Retrieved from YouTube: https://www.youtube.com/watch?
v=bitUrAmXTnI
Smith, P. (n.d.). Types of Data, Descriptive Statistics, and Statistical Tests for Nominal Data. Retrieved from University at
Buffalo: https://www.accp.com/docs/bookstore/biosampl.pdf
Sridhar, M. S. (2014, November 25). Types of data. Retrieved from Slideshare: https://www.slideshare.net/mssridhar/types-
of-data-42010881
Types of Data. (n.d.). Retrieved from Albany: https://www.albany.edu/~msz03/sta552/pennstate/types_of_data.pdf
VelactionVideos. (2013, April 24). Data Collection: Understanding the Types of Data. Retrieved from YouTube:
https://www.youtube.com/watch?v=Coe0N2xb8kk
What is Data. (2018). Retrieved from University of Minnesota: https://www.lib.umn.edu/datamanagement/whatdata
CENTRO ESCOLAR UNIVERSITY FOR INTERNAL CIRCULATION ONLY 2018
MODULE 1: DATA AROUND US

THANK YOU!
UNDERSTANDING DATA AROUND US

CENTRO ESCOLAR UNIVERSITY FOR INTERNAL CIRCULATION ONLY 2018

You might also like