Data Analysis
OBJECTIVES FOR THIS DAY
After the lesson, you should be able to:
Understand the importance and relevance of data analysis
Understand the importance of data quality and how to handle missing values, outliers,
and inconsistencies.
Understand measures of central tendency, dispersion, and distribution.
Before Data Collection
Determine the method of data analysis
Determine how to process the data
Consult a statistician
Prepare dummy tables
After Data Collection
Process the data
Prepare tables and graphs
Analyze and interpret findings
Consult again the statistician
Prepare for editing
Prepare for presentation
Data Analysis
Data analysis is the process of inspecting, refining, and modeling
data with the goal of discovering useful information, informing
conclusions, and supporting decision-making. It involves various
techniques and methods to interpret raw data and extract
meaningful insights.
Data analysis is the backbone of modern decision-making
processes across various domains, ranging from business and
finance to healthcare and scientific research. At its core, data
analysis involves the systematic examination of raw data to
uncover valuable insights, trends, and patterns that can inform
strategic decisions and actions.
Types of Data Analysis
Descriptive Analysis: Descriptive analysis involves summarizing and
describing the basic features of a dataset, such as its central tendency,
dispersion, and distribution. Descriptive statistics, histograms, and
frequency tables are often used in this type of analysis.
Inferential Analysis: Inferential analysis involves making inferences and
predictions about a population based on a sample of data. It uses
statistical methods such as hypothesis testing, confidence intervals, and
regression analysis to draw conclusions about relationships and
differences between variables.
Types of Descriptive Analysis
Frequency Distribution Analysis: This analysis technique organizes data
into intervals or categories and counts the frequency of observations
falling into each interval. It helps in understanding the distribution and
pattern of the data.
• Formula: Ef = N
E = sum of f= frequency N= sample size
Measures of central tendency : Statistical measures used to describe
the center or typical value of a dataset. They provide a single value that
represents the "center" of the data distribution. The three main
measures of central tendency are the :
Mode - a numeric value in a distribution that occurs most frequently.
Median - an index of average position in a distribution of numbers.
Mean - the point on the score scale that is equal to the sum of the
scores divided by the total number of scores.
Types of Inferential Analysis
Hypothesis Testing: Hypothesis testing involves making inferences
about population parameters based on sample data. This typically
involves formulating a null hypothesis (H0) and an alternative
hypothesis (H1), collecting sample data, and using statistical tests
to determine whether there is enough evidence to reject the null
hypothesis in favor of the alternative hypothesis.
Regression Analysis: Regression analysis is used to model the
relationship between one or more independent variables and a
dependent variable. It helps in making predictions about the
dependent variable and understanding the strength and direction
of the relationship between variables in the population.
How to conduct Data Analysis
Define Objectives: Clearly define the goals and objectives of the
analysis. Understand the questions you're trying to answer or the
problems you're trying to solve. This step sets the direction for the
entire analysis process.
Data Collection: Gather relevant data from diverse sources, ensuring
its accuracy, completeness, and relevance to the objectives of the
analysis. Utilize appropriate data collection methods and tools.
Choose Appropriate Data Analysis Methods: Select data analysis
methods that are appropriate for your research design and data type.
This could include quantitative methods or qualitative methods.
Data Preparation and Cleaning: Prepare your data for analysis by
cleaning and organizing it. Address issues such as missing values,
outliers, and data inconsistencies to ensure data quality.
Data Analysis: Conduct the data analysis using the selected
methods. Analyze quantitative data using statistical software
such as SPSS. For qualitative data, employ manual coding or
qualitative data analysis software such as NVivo.
Interpretation of Findings: Interpret the results of the data
analysis in relation to your research objectives and hypotheses.
Identify key themes, patterns, or relationships in the data and
discuss their significance within the context of existing literature.
Writing Up Results: Present the findings of your data analysis in
a clear and organized manner in the results section of your
thesis. Use tables, figures, and narrative descriptions to
communicate your findings effectively.