Module 1 Data Around Us
Module 1 Data Around Us
DATA AROUND US
Image or icon Image or icon Image or icon
DATA?
nature of data
gathered central
is the most
body of thread of fundamental or
facts. any activity. proper and
effective use of
statistical skills.
DATA
DATA is often
considered the
building block of
information systems.
Data is defined as a
gathered body of
facts.
CENTRO ESCOLAR UNIVERSITY FOR INTERNAL CIRCULATION ONLY 2018
EVOLUTION OF DATA
1960s
• Individual application that were run
using master files.
• Reports and programs built in an
early language (e.g. Fortran or
COBOL)
• Punched cards and paper tape
• Magnetic tape
Mid - 1960s
• Master files and
magnetic tape exploded
• Huge amounts of
redundant data
Mid - 1960s
• Proliferation of master files and redundant data presented some very insidious
problems:
- Need to synchronize data upon update
- Complexity of maintaining programs
- Complexity of developing new programs
- Need for extensive amounts of hardware to support all the master files
1970s
• Direct Access Storage Device (DASD)
- Time to locate a record could be measured in
milliseconds
• Database Management System (DBMS) was
introduced
Purpose:
- Make it easy to store, index and access data on a
DASD
- Single source of data for all processing
Mid-1970s
• Online Transaction Processing
(OLTP)
1980s
• PCs and fourth-generation
languages (4GLs)
• More could be done with data
than simply processing online
transactions
• Management Information
System (MIS) could be
implemented.
CENTRO ESCOLAR UNIVERSITY FOR INTERNAL CIRCULATION ONLY 2018
EVOLUTION OF DATA
Mid-1980s
• Extract program (simplest of all
programs)
• Rummages through a file or
database
• Uses some criteria for selecting
data, and, on finding qualified
data, transports the data to
another file or database
DEPENDENT INDEPENDENT
VARIABLE VARIABLE
• The variable of interest • The variable that is being
• The outcome of which is tested for an effect on the
dependent on something dependent variable
else
DOES HIGH-DOSE
CIPROFLOXACIN LEAD TO SEIZURE?
PRIMARY DATA
• Means original data that has been
collected specially for the purpose in mind
• Someone collected the data from the
original source
• Has not been published yet and is more
reliable, authentic and objective
• Has not been changed or altered by
human beings therefore validity is greater.
CENTRO ESCOLAR UNIVERSITY FOR INTERNAL CIRCULATION ONLY 2018
EXAMPLES OF PRIMARY DATA
SURVEY QUESTIONNAIRE
Most commonly used Most commonly used method in
method in social survey. These are a list of
sciences, management, questions either an open-ended
marketing and or close-ended for which the
psychology to some respondent give answers.
extent and can be These can be conducted via
conducted in different telephone, mail, live in a public
Icon methods. Icon area, or in an institute, through
electronic mail or through fax
INTERVIEW and other methods.
This is a face-to-face OBSERVATIONS
conversation with the
Icon respondent. It is slow, Icon These can be done while letting
expensive and take the observing person know that
people away from their he is being observed or without
regular jobs, but this letting him know. These can
allow in-depth also be made in natural setting
questioning and follow- as well as in artificially created
up questions. environment.
CENTRO ESCOLAR UNIVERSITY FOR INTERNAL CIRCULATION ONLY 2018
ADVANTAGES AND DISADVANTAGES OF PRIMARY DATA
ADVANTAGES OF DISADVANTAGES OF
PRIMARY DATA PRIMARY DATA
• Data interpretation is better.
• Targeted issues are addressed. • High cost
• Efficient spending for • Time consuming
information. • Inaccurate feedbacks
• Decency of data. • More number of resources is
• Addresses specific research required
issues.
• Greater control.
• Proprietary issues.
CENTRO ESCOLAR UNIVERSITY FOR INTERNAL CIRCULATION ONLY 2018
SOURCES OF DATA
SECONDARY DATA
• Data that has been already collected by
and readily available from other sources.
• Data being reused from the primary data.
• Data are more quickly obtainable than the
primary data.
• Obtained from many sources, including
literature, industry surveys, compilations
from computerized databases and
information systems, and computerizes or
mathematical models of environmental
processes.
CENTRO ESCOLAR UNIVERSITY FOR INTERNAL CIRCULATION ONLY 2018
EXAMPLES OF SECONDARY DATA
PUBLISHED PUBLISHED
PRINTED SOURCES ELECTRONIC SOURCES
Icon Icon
ADVANTAGES OF DISADVANTAGES OF
SECONDARY DATA SECONDARY DATA
• Inexpensive • Some are expensive
• Easily accessible
• Immediately available • Some are not immediately
• Will provide essential background and available and takes time to
help to clarify or refine research
problem – essential for literature
define problem, sampling
review frame, method and analysis
• Secondary data sources will provide • Some are not as readily
research method alternatives.
• Will also alert the researcher to any accessible.
potential difficulties • Incomplete information
CENTRO ESCOLAR UNIVERSITY FOR INTERNAL CIRCULATION ONLY 2018
IMPORTANCE OF DATA AND DATA COLLECTION
DATA INFORMATION
Raw numbers that we Collection of data points we
capture according to can use to understand
some agreed to something about the thing
Icon standards. Icon being measured.
References
Buendia, M. (2016). Introduction to Data Warehousing. Special Training in Business Analytics (Module 2) for Teachers of HEI
(pp. 1-5). Manila: University of the East.
Campaign, D. Q. (2011, June 24). Data Is Power. Retrieved from Youtube: https://www.youtube.com/watch?v=77UPUxB2b7o
Data vs. Information vs. Insight. (2018). Retrieved from Benedictine University:
https://online.ben.edu/programs/mba/resources/data-vs-information-vs-insight
Data, Information and Statistics. (2013, July 23). Retrieved from Statistics Canada: https://www.statcan.gc.ca/edu/power-
pouvoir/ch1/definitions/5214853-eng.htm
Donges, N. (2018, March 18). Data Types in Statistics. Retrieved from Towards Data Science:
https://towardsdatascience.com/data-types-in-statistics-347e152e8bee
Hinglish, T. (2018, January 22). What is Data | What is Information | Difference Between Data and Information. Retrieved
from Youtube: https://www.youtube.com/watch?v=F20qEwXBQaE
M, M. K. (2013, May 9). Source of Data in Research. Retrieved from SlideShare:
https://www.slideshare.net/manukumarkm/source-of-data-in-research
Roberts, T. (2015, December 6). Data vs. Information. Retrieved from YouTube: https://www.youtube.com/watch?
v=bitUrAmXTnI
Smith, P. (n.d.). Types of Data, Descriptive Statistics, and Statistical Tests for Nominal Data. Retrieved from University at
Buffalo: https://www.accp.com/docs/bookstore/biosampl.pdf
Sridhar, M. S. (2014, November 25). Types of data. Retrieved from Slideshare: https://www.slideshare.net/mssridhar/types-
of-data-42010881
Types of Data. (n.d.). Retrieved from Albany: https://www.albany.edu/~msz03/sta552/pennstate/types_of_data.pdf
VelactionVideos. (2013, April 24). Data Collection: Understanding the Types of Data. Retrieved from YouTube:
https://www.youtube.com/watch?v=Coe0N2xb8kk
What is Data. (2018). Retrieved from University of Minnesota: https://www.lib.umn.edu/datamanagement/whatdata
CENTRO ESCOLAR UNIVERSITY FOR INTERNAL CIRCULATION ONLY 2018
MODULE 1: DATA AROUND US
THANK YOU!
UNDERSTANDING DATA AROUND US