Data Collection
Data Collection
Research
Methodology
Data Collection
Jane Labadin
• I’m just a mathematician
• Always a first time
• Deliver based on experience
Caveat
original information which is
collected, stored, accessed, used or
disposed of during the course of the
research, and the final report of the
research findings
What is data?
• Can you do research without data?
• How can you resolve the research problem without
supporting data?
• How do you convince others, that your data are sufficient
to support the solution?
• Where do you go to find data?
• Can you have imaginary data in research?
• Can you have data simulation for research?
Motivation
collection of information (data) which
can be interpreted or analysed to
frame answers to your research
questions or increase knowledge of
your research topic.
Purpose of Collecting
Data
1. Sources – where will you get the information
SOURCE of Information
• Testimonials
• Survey
• Physical evidence
• Interview
• Time series
• Observation
• Tests
• Group assessment
• Photographs, videos
• Expert or peer
reviews • Diaries, journals, logs
• Portfolio reviews • Document review and
analysis
METHODS of data
collection
• Obtained via
• surveys of populations
• repeated experimental procedures
• …
• When recording, include
• detailed information
• Dates
• place of collection
• methods of measurement
• units of measurement) to minimise confusion
• Recorded on
• printed datasheets, then stored in spreadsheet format.
• In some cases, data may be recorded by handheld computers or specialised data
recorders which can later be downloaded to more secure devices.
• Data recorders can often be set up to record data remotely, without the
requirement that researchers be present. Such techniques are frequently used in
meteorological research or in situations where it would be too hazardous for a
researcher to be present (eg industrial chemistry applications, space research).
Quantitative Information
• May be in the form:
• Recordings of interviews
• Transcribed into written form
• Supporting notes
• Description of text/artefacts/system
• Interpretation of text/artefacts/system
Qualitative Information
• Source of Data
In a nutshell
Source of Data
Source of data
Quantitative Qualitative
(numerical) (categorical)
Quantitative data
• Primary data: data observed and recorded or collected
directly from respondents
Types of Data
Primary Secondary
Data Collection Data
Compilation
Print or Electronic
Observation Survey
Experimentation
Types of Data
Ratio Data Differences between
measurements, true
Height, Age, Weekly
zero exists Food Spending
Categorical Data
• Information?
Things to consider
• How much information should you collect?
Sampling
“The natural state of most engineering
information contains significant
variability”
Ang and Tang, Probability Concepts in Engineering Planning and
Design
Research Methods in
Engineering & Computer
Science
Qualitative Research Quantitative Research
• Concentrates on collecting and • Concentrates on what can be measured.
analyzing subjective data • Involves collecting and analyzing objective data
• Usually the perceptions of the • Usually involves some form of math
people involved – Statistical
• Intention is to illuminate – Calculus
perceptions and, thus, gain – Discrete
– greater insight (explain why) and
– Knowledge (reproduce or
recognize).
Research Methods
s monkey doing?
Branch of Quantitative
Research
An Example…
ACR 4
Data Collection Method 1: Retrospective Study
ACR 6
Data Collection Method 2: Observational Study
Probability
Population Sample
Inferential
Statistics
ACR 12
Table 1 Questions asked by software engineering researchers (column 2) that can be answered
by field study techniques
Used by researchers Also used
when their goal is Volume by software
Technique to understand: of data engineers for
Direct techniques
Brainstorming Ideas and general Small Requirements
and focus background about gathering, project
groups the process and product, planning
general opinions
(also useful to enhance
participant rapport)
Interviews and General information Small Requirements
questionnaires (including opinions) to large and evaluation
about process, product,
personal knowledge etc.
Conceptual Mental models of Small Requirements
modeling product or process
Work diaries Time spent or frequency of certain Medium Time sheets
tasks (rough approximation,
over days or weeks)
Think-aloud Mental models, goals, Medium UI evaluation
sessions rationale and patterns to large
of activities
Shadowing and Time spent or frequency of tasks Small Advanced
observation (intermittent over relatively approaches to
short periods), patterns of use case or task
activities, some goals and analysis
rationale
Participant Deep understanding, goals and Medium
observation rationale for actions, time to large
(joining the spent or frequency over
team) a long period
Indirect techniques
Instrumenting Software usage over a long Large Software
systems period, for many participants usage analysis
Fly on the wall Time spent intermittently in one Medium
location, patterns of activities
(particularly collaboration)
Independent techniques
Analysis of work Long-term patterns relating to Large Metrics
databases software evolution, faults etc. gathering
Analysis of Details of tool usage Large
tool use logs
Documentation Design and documentation Medium Reverse
analysis practices, general engineering
understanding
Static and dynamic Design and programming Large Program
analysis practices, general comprehension,
understanding metrics, testing,
etc.
Ask Question
Do background Research
Report results
//www.sciencebuddies.org/ml
Engineering & Computer Science
Why experiment?
• Will I be using a formal, objective, systematic process
where data are utilized to test my research hypothesis?
• What are the variables I will consider in my study?
• Independent Variable(s)
• Dependent Variable(s)
• Extraneous Variable(s)
• What type of quantitative investigation will I pursue?
Ask yourself
Insist on precision only
when is needed
L
r
Montgo
Faculty of Science
LEADERS OF TOMORROW Personal Errors
Master of Inform
Faculty of Comp
• Ensure no distraction– eg:
• Check list
• Place the apparatus at a c
Acknowledgement
• General lighting should be
• Ventilation should be adeq