Data Analysis Lecture Notes
Data Analysis Lecture Notes
Data Analysis
02 situations where
variables are
manipulated and
controlled to
establish clear cause
and effect
Observations relationships.
This includes observing the
subjects within their natural
environment where variables
cannot be controlled.
03
Example
You survey 200 students at a university and ask them questions such as “On a scale of 1 to 10, how satisfied are
they with the wi-fi facility”?
You can then perform statistical analysis with the data obtained and draw concrete and productive conclusions.
Checklist for Surveys
• A: Planning – thinking through ‘who’, ‘where’, ‘when’, ‘how’ and ‘what’.
• ☑ Who you plan to speak about (population) – and gather data from (sample)
• ☑ How you will reach your sample
• ☑ What aspects of your research question that can be answered through a questionnaire.
• ☑ When to get the data. Contingencies – that is, having a back-up plan if response rates are low.
• ☑ Where to get the Ethics/ethics approval
• C: Piloting
• ☑ Done a run-through
• ☑ Reflected on the piloting process
• ☑ Sought feedback
• D: Modifying
• ☑ Made appropriate modifications
• ☑ Trialled your stats package
• E: Implementing
• ☑ Distribute your questionnaires
• ☑ Sent reminder letters/emails – send these out if response rates are low
• E: Refining.
• ☑ Kept reviewing and refining – observation takes practice; keep refining until you are comfortable with the process
and the data collected
• ☑ Made modifications – based on your own review of the process; any confirmation strategies you have attempted;
and the quality of the data generated
10
Field Experiments
▪ Field experiments are experimental
investigations carried out in the
natural environment of those being
studied
▪ The field experiment is used in
situations where it is considered
particularly important for research
to take account of the natural
environment.
▪ Advantages
▪ Improved ecological validity
▪ Reduction of demand
characteristics
▪ Weaknesses
▪ Establishing controls
▪ Generalizing to other situations
▪ Cost
▪ Use of technical equipment
11
Parametric Study Objective 3:
Infill modification and
Material / Composition modified putty Graphene
potential improvement
Properties modified grout
No filler With filler on repair
and
behaviour
Composition modified putty:
Compression Very low modulus; High modulus; High modulus, Only grout without filler show
moderate strength; moderate moderate remarkable improvement in strength
very ductile strength; brittle strength; brittle for tensile, flexural, lap-shear
Tensile Very low modulus; High modulus; High modulus;
high strength; low strength; low strength;
brittle brittle brittle
Data Collection
Interview with
experts
Literature
review Impact
Risk
Risk
Identify all type of Risk
risks in PPP projects
Literature Frequency
review
Matrix
Risk
4
Frequency
1
1 2 3 4 5
Impact
Frequency-Impact Matrix for 83 Risk Factors
14
Results
Significant risks
Design and
construction
Operating Social Macroeconomic
15
Results
Significant risks
16
Research Methodology
2. Identification and evaluated risk allocation criteria
and barriers
Literature
Barriers Questionnai
review Identify re
risk
allocation
Criteria and Criteria Comparison
Barriers matrix ANP network
Interview
Rank Criteria
weight
Obtain rank of each
Criteria and Barriers
17
Results
Risk assessment
network
Construction
completion
delay
Availability of
Construction
material /
cost overrun
labor
18
Results
Rank of each risk group
C1: Construction group 0.4014
C2: Legal group
C3: Operating group
C4: Political group
C5: Social group
0.1916
C6: Organization group 0.157
C7: Project group
C8:Macroeconomic group 0.0861
0.0713
C9: Natural group 0.0366 0.0319 0.0187 0.0098
C1 C2 C3 C4 C7 C6 C5 C8 C9
19
Results
Network structure for criteria and barriers
20
Results
E31 E21 E13 E32 E11 E24 E23 E12 E22 E15 E14 C14: Bear the risk at the lowest price
C17: Control the chance of risk
E31: Different sets of information about project risk C32: Risk attitude
E21: Lack of efficient risk allocation mechanisms C21: Obtain reasonable
E13: Lack of understanding of the benefits of optimal allocation C23: Level of governmental support
E32: Poor risk management communication C16: Resources of risk control
E11: Aversion to risk by project participants C22: Obtain intangible assess
E24: Staging involvement of project participants C11: Identification of risk
E23: Static risk allocation C18: Minimize loss if risk occurs
E12: Imbalance and abuse of power C31: Assume the direct
E22: Contract complexity C13: Evaluation of risk
E15: Competitive attitude C110: Expertise of control risk
E14: Lack of trust among project participants C19: Sustain the consequence
C12: Foreseeing risk
C15: Capability of control risk
21
Research Methodology
3. Identify shared risks
ANP
Criteria • Questionnaire • Check consistency
• Pairwise • Obtain score of each
• Make network risk based on public
Risks comparisons structure – private and shared
matrix • Insert data in
Barriers super decision
Fuzzy Limit super
matrix
R R R
Identify Private
R Shared
shared
risks Public
R R
22
Research Methodology
Acceptability
• Determine index • Perception on the MATLAB
weight of each Contribution of Risks to
risk (A) • Acceptable risk Increase Project Cost,
ANP Rating thresholds (B) Time and Decrease
• Make function
Quality (C)
for risk
allocation (D)
Mean index
A
Determine the
KP
Shared B Public percentage of
risks risk allocation
Public
C GA between
D partners
MATLAB
23
Conclusion
24
Qualitative Research
• Qualitative research is expressed in words and non-
numeric data.
• It is used to understand concepts, thoughts or
experiences to discover and gain deep insights into
individual thoughts, experiences, opinions, trends
and dive deep into the problem at hand. This type
of research enables you to gather in-depth insights
on topics that are not well understood.
• A qualitative research analyst collects non-numeric
data from multiple sources ranging from entirely
unstructured to partly structured data. Common
methods include interviews with open-ended
questions, observations described in words, and
literature reviews that explore concepts and
theories, focus group discussions and direct
behavioural observations.
Quantitative vs.Qualitative Data - YouTube
What is Qualitative Research
Used for?
• Qualitative researchers study things in their
natural setting, attempting to make sense of or
interpret phenomenon in terms of the meanings
people bring to them (Denzin & Lincoln, 2000).
• Qualitative research is a form of inquiry that
analyzes information conveyed through language
and behaviour in natural settings.
• It is used to capture expressive information not
conveyed in quantitative data about beliefs,
values, feelings, and motivations that underlie
behaviours.
Some of the common quantitative data collection methods are:
Focus Literature
Interviews Groups Ethnography Review
These include This involves This method entails This includes a
asking respondents discussion amongst being a part of an survey of the works
open-ended a group of people organization or of published
questions verbally. regarding a specific community for a long authors
. topic to gather period to closely
opinions for future observe its culture and
research. mannerisms
Example:
You conduct interviews of 100 employees of a company, asking them open-ended questions like
“Are they satisfied with their job” and “What they think is the most positive aspect of their job”?
Etc.
The answers can help you follow up with further questions and clarifications.
How to Choose the Right
Statistical Method to Use
• Data type
• Nominal, Ordinal, Interval, Ratio?
• Data shape
• Normal Distribution? Skewed?
• Research questions
• Specific research questions
• Hypothesis
When to use Qualitative vs.
Quantitative Research
• For most research requirements,
you can choose quantitative,
qualitative, or mixed approaches.
• The research method you choose
depends upon:
• whether you are undertaking
inductive or deductive research,
• what your research questions are,
• whether you are doing descriptive,
experimental, or correlational
research, and
• the factors into consideration,
namely, money, time, data availability,
and access to respondents.
Data Analysis
Portal of Research Methodology and
Methods
Quantitative Research Qualitative Research
Data Analysis
• Data Analysis is the process of systematically applying statistical
and/or logical techniques to describe and illustrate, condense and “.. various analytic
recap, and evaluate data. procedures provide a way
• While data analysis in qualitative research can include statistical of drawing inductive
procedures, many times analysis becomes an ongoing iterative
process where data is continuously collected and analyzed almost inferences from data and
simultaneously. distinguishing the signal
• The form of the analysis is determined by the specific qualitative (the phenomenon of
approach taken (field study, ethnography content analysis, oral interest) from the noise
history, biography, unobtrusive research) and the form of the data
(field notes, documents, audiotape, videotape). (statistical fluctuations)
present in the data”..
• Improper statistical analyses distort scientific findings, mislead
Shamoo and Resnik (2003)
casual readers (Shepard, 2002), and may negatively influence the
public perception of research.
https://ori.hhs.gov/education/products/n_illinois_u/datamanagement/ 32
Quantitative Data Analysis
Descriptive
Technique
Inferential
Technique
Descriptive Statistics
• Descriptive statistics describe a sample.
Use descriptive statistics to summarize and
graph the data for a group that you
choose. This process allows you to
understand that specific set of
observations.
• You simply take a group that you’re
interested in, record data about the group
members, and then use summary statistics
and graphs to present the group
properties.
Why they matter:
• With descriptive statistics, there is no 1. Provide a macro and micro
uncertainty because you are describing view of the data
only the people or items that you actually 2. Help identify errors and
measure. You’re not trying to infer
properties about a larger population. anomalies in the data
3. Inform which inferential
• Mode of analysis: Mean (average), median statistics you will use
(midpoint), mode (most common),
Standard deviation, skewness
Inferential Statistics
• Inferential statistics takes data from a sample and makes inferences
about the larger population from which the sample was drawn.
• Because the goal of inferential statistics is to draw conclusions from a
sample and generalize them to a population, we need to have
confidence that our sample accurately reflects the population.
Common uses:
• This requirement affects our process. At a broad level, we must do
the following:
1. Predictions about
• Define the population we are studying. differences
• Draw a representative sample from that population. between two or
• Use analyses that incorporate the sampling error. more groups.
• You can use the following methods to collect a representative 2. Predictions about
sample: relationship
• Simple random sampling between two or
• Stratified sampling
more variables
• Cluster sampling
• Systematic sampling
• Correlation
• To assess the relationship between variables. E.g. if
Variable A goes up, does Variable B go up or down?
• Regression
• To assess the causal relationship between variables. E.g.
Does Variable A’s movement…cause Variable B’s
movement? Or do they just happen to move together
naturally?
Data management
This involves familiarizing yourself with
CONTENT ANALYSIS
THEMATIC ANALYSIS
This involves tracking the position,
occurrence, and meaning of words This involves carefully examining the
or phrases. data to identify the key themes and
patterns.
DISCOURSE GROUNDED
ANALYSIS THEORY
This includes exploring the Grounded theory refers to a set of
working of communication systematic inductive methods for
within social contexts. conducting qualitative research
aimed toward theory development.
NARRATIVE IMPERATIVE
ANALYSIS PHENOMENAL
Narrative analysis to understand how ANALYSIS (IPA)
research participants construct story and
narrative from their own personal This is a qualitative approach which aims
experience, through journals, letters, to provide detailed examinations of
conversations, autobiographies, personal lived experience. It produces
transcripts of in-depth interviews, focus an account of lived experience in its own
groups, or other types of narrative terms rather than one prescribed by
qualitative research. pre-existing theoretical preconception.
.
5 Steps to Qualitative Data Analysis
1 2 3 4 5
DATA COLLECTION DATA PROCESS DATA ANALYSIS VALIDATE DATA REPORT
Step 1: Arrange your Step 2: Organize all your Step 3: Set a Code to Step 4: Validate your Step 5: Concluding the
Data Data the Data Collected Data Analysis Process
First, transcribe the Based on the research Code the qualitative Accuracy of your It is important to finally
data collected. objectives, reorganise data by categorizing research design or conclude your data,
Arranging data by the data based on the and assigning methods. Reliability, which means
converting all the data questions asked. properties and patterns which is the extent to systematically
(visual & audio to the collected data. which the methods presenting your data, a
recording, field notes, After assigning codes to produce accurate data report that can be
jotted notes etc) into a your data, you can then consistently. readily used. The report
text format. begin to build on the should state the
patterns to gain in- method that you used
depth insight into the to conduct the research
data that will help make studies, the positives,
informed decisions. and negatives and study
limitations.
Steps in Systematic Data
Analysis
1. Formulate the research question – Like any research process, a clear, unambiguous research question
will help set the direction for your study, i.e. what type of health promotions campaigns have been most
effective in reducing smoking rates of Australian teenagers or Does school leadership makes a difference
to educational standards?
2. Develop and use an explicit, reproducible methodology – Key to systematic reviews are that bias is
minimized and that methods are transparent and reproducible.
3. Develop and use clear inclusion/ exclusion criteria – The array of literature out there is vast.
Determining clear selection criteria for inclusion is essential.
4. Develop and use an explicit search strategy – It is important to identify all studies that meet the
eligibility criteria set in #3. The search for studies need to be extensive should be extensive and draw on
multiple databases.
5. Critically assess the validity of the findings in included studies – This is likely to involve critical
appraisal guides and quality checklists that cover participant recruitment, data collection methods, and
modes of analysis. Assessment is often conducted by two or more reviewers who know both the topic
area and commonly used methods.
6. Analysis of findings across the studies – This can involve analysis, comparison, and synthesis of results
using methodological criteria. This is often the case for qualitative studies. Quantitative studies generally
attempt to use statistical methods to explore differences between studies and combine their effects (see
meta analysis below). If divergences are found, the source of the divergence is analysed.
7. Synthesis and interpretation of results – synthesized results need to be interpreted in light of both the
limitations of the review and the studies it contains. An example here might be the inclusion of only
studies reported in English. This level of transparency allows readers to assess the review credibility and
applicability of findings.
Considerations/issues in data
analysis
There are a number of issues that researchers should be cognizant of with respect to data analysis. These
include:
• Having the necessary skills to analyze
• Concurrently selecting data collection methods and appropriate analysis
• Drawing unbiased inference
• Inappropriate subgroup analysis
• Following acceptable norms for disciplines
• Determining statistical significance
• Lack of clearly defined, objective and accurate outcome measurements
• Manner of presenting data
• Environmental/contextual issues
• Data recording method
• Partitioning ‘text’ when analyzing qualitative data
• Reliability and Validity
• Extent of analysis
Questions for Keeping the Bigger
Picture in Focus
• Questions related to your own expectations
• ☑ What do I expect to find, i.e. will my hypothesis bear out?
• ☑ What don’t I expect to find, and how can I look for it?
• ☑ Can my findings be interpreted in alternative ways? What are the implications?
• Questions related to research question, aims and objectives
• ☑ How should I treat my data in order to best address my research questions?
• ☑ How do my findings relate to my research questions, aims, and objectives?
• Questions related to theory
• ☑ Are my findings confirming my theories? How? Why? Why not?
• ☑ Does my theory inform/help to explain my findings? In what ways?
• ☑ Can my unexpected findings link with alternative theories?
• Questions related to methods
• ☑ Have my methods of data collection and/or analysis coloured my results? If so, in what
ways?
• ☑ How might my methodological shortcomings be affecting my findings?
References:
• https://study.sagepub.com/oleary3e/student-resources/analysing-data/steps-in-quantitative-analysis
• Gottschalk, L. A. (1995). Content analysis of verbal behavior: New findings and clinical applications. Hillside, NJ: Lawrence
Erlbaum Associates, Inc
• Jeans, M. E. (1992). Clinical significance of research: A growing concern. Canadian Journal of Nursing Research, 24, 1-4.
• Lefort, S. (1993). The statistical versus clinical significance debate. Image, 25, 57-62.
Kendall, P. C., & Grove, W. (1988). Normative comparisons in therapy outcome. Behavioral Assessment, 10, 147-158.
• Nowak, R. (1994). Problems in clinical trials go far beyond misconduct. Science. 264(5165): 1538-41.
• Resnik, D. (2000). Statistics, ethics, and research: an agenda for educations and reform. Accountability in Research. 8:
163-88
• Schroder, K.E., Carey, M.P., Venable, P.A. (2003). Methodological challenges in research on sexual risk behavior: I. Item
content, scaling, and data analytic options. Ann Behav Med, 26(2): 76-103.
• Shamoo, A.E., Resnik, B.R. (2003). Responsible Conduct of Research. Oxford University Press.
• Shamoo, A.E. (1989). Principles of Research Data Audit. Gordon and Breach, New York.
• Shepard, R.J. (2002). Ethics in exercise science research. Sports Med, 32 (3): 169-183.
• Silverman, S., Manson, M. (2003). Research on teaching in physical education doctoral dissertations: a detailed
investigation of focus, method, and analysis. Journal of Teaching in Physical Education, 22(3): 280-297.
• Smeeton, N., Goda, D. (2003). Conducting and presenting social work research: some basic statistical considerations. Br J
Soc Work, 33: 567-573.
• Thompson, B., Noferi, G. 2002. Statistical, practical, clinical: How many types of significance should be considered in
counseling research? Journal of Counseling & Development, 80(4):64-71.