CAMAD - Data Analysis
CAMAD - Data Analysis
Data Analysis
1
How to Analyze the Data?
1. Data Editing : All questionnaire which are filled
up and received from respondent may not be
accurate, precise, complete, uniform and
consistent. There is some lacking of accuracy, to
find these inaccuracy and incompleteness and to
eliminate the errors of filled questionnaire, data
should be edited.
2
3. Data Tabulation : After classification of
data on the basis of its characteristics like
age, gender, caste, literacy, income level,
religion, nationality, occupation, etc. data
should be summarized. Tabulation is the
process of summarization of data in rows
and columns having special characteristics
in a piece of paper. Such representation of
data in orderly and easily comprehensible
fashion is called tabulation.
3
4. Data Analysis : It is a mathematical
operation to investigate the properties of
data. Such examination of data relies on a
range of statistical theory as well as
mathematical calculation. Analysis of data
is a process of inspecting, cleaning,
transforming, and modeling data with
the goal of highlighting useful
information, suggesting conclusions,
and supporting decision making. Data
analysis has multiple facets and approaches,
encompassing diverse techniques under a
variety of names, in different business,
science, and social science domains.
4
Why is it necessary to analyze the data?
1. To measure the variables,
2. To explore phenomena and explain the phenomenon
3. To summarize data / describe data,
4. To identify relationship between variables and compare variables,
5. To examine the association between variables,
6. To forecast outcomes,
7. To test hypothesis
An analysis must have four elements:
1. Data/information (what)
2. Scientific reasoning/argument (what? who? where? how? what
happens?)
3.Finding (what results?)
4.Lesson/conclusion (so what? so how? therefore,…) 5
Basic guide to data analysis
“Analyse” NOT “narrate”, Go back to research flow
chart, Break down into research objectives and
research questions, Identify phenomena to be
investigated, Visualise the “expected” answers,
Validate the answers with data, Don’t tell something
not supported by data.
Data mining : Data mining is a particular data
analysis technique that focuses on modeling and
knowledge discovery for predictive rather than
purely descriptive purposes. Business intelligence
covers data analysis that relies heavily on
aggregation, focusing on business information.
Qualitative data analysis : Qualitative research
uses qualitative data analysis (QDA) to analyze text,
interview transcripts, photographs, art, field notes of
6
The process of data analysis
1. Assemble the data
2. Bringing data into order ( editing, coding,
tabulations, percentage)
3. Summarize the data ( statistical
summarization, measures of central
tendency, dispersion, graphical
presentation)
4. Select appropriate statistical methods
a. Examine difference
b. Analyze causal relationship
c. Investigate associations 7
Statisti
cs
Descriptive Inferential
Statistics Statistics
8
Descriptive Statistics
Descriptive statistics is the term given to the
analysis of data that helps describe, show or
summarize data in a meaningful way such
that, for example, patterns might emerge
from the data.
It does not make conclusions beyond the data
we have analyzed or reach conclusions
regarding any hypotheses we might have
made.
It provides a clear, concise, useful and
informative picture of a mass of numerical
figures.
9
Descriptive Statistics…
12
Importance / Advantages
Inferential statistics are frequently used to answer cause-and-
effect questions and make predictions.
They are also used to investigate differences between and among
groups.
Provide more detailed information than descriptive statistics.
Yield insight into relationships between variables.
Reveal causes and effects and make predictions.
Generate convincing support for a given theory.
Generally accepted due to widespread use in business and
academia.
Disadvantages
It is quite difficult to learn and use properly
Vulnerable to be misused and abused
13
Types of Inferential Analysis
Tools of Non-parametric Test : Inferential
procedures that are not based on parameters, which
require fewer requirements be satisfied to perform the
tests. They do not require that the population follow a
specific type of distribution.
Distribution : It measures the distribution of
population.
Correlation : It describes the correlation between two
variables. Positive, negative and no linear correlation.
Association : It describes the relationship between
two variables. Coefficient of association vary from 0 to
1 and 0 to -1.
Variance: It is used to test for difference among more
than two groups or data.
14
Types of Inferential Analysis (Contd…)
16
Importance of qualitative analysis
A more accurate reflection of complex reality
More balanced representation of different stakeholders
A better understanding of processes
Better relationship with respondents and more continuous
contact leading to more accurate information
Better understanding of difference and ability to get
sensitive information
Information may be more reliable if the investigation is
not influenced by expectations or fear of consequences.
19
Contd…
Stacked bar chart: It is drawn only for categorical
data. The section of the bar show the proportion of
the variables they represent in the relation to one
another.
Line diagram or trend curve: A line diagram is a
useful way of visually conveying the changes when
long term trends in a phenomena or situation need
to be studied or the changes in the sub category of a
variable is measured on an interval or a ratio scale.
It is useful to illustrating trends in birth or death
rates, and changes in population size.
The scattergram: The scattergram is effective tool
to display the how one variable changes in relation
to a change in the other variable. Data should be
measured on either interval or ratio scales. 20
Thank
You
21