Data Analysis Training Final
Data Analysis Training Final
ሂደት
Training Using Spreadsheet
ውጤት
Adama
Dire International Hotel
Outline
01
Introduction
02
Objective
07
Data analysis using Excel
Zone
Region Woreda
Ministry of
School
Education
TEST
USE
COMMUNICATE REFINE
CALCULATE/DEDUCE
IMPLEMENT
TEST
REAL WORLD
SENSITIVE
SOLUTION
INTERPRETATION MODEL SOLUTION
Input
Process
Overview:
Every formula in Excel starts with the equal sign (“=”).
This sign is very meaningful and tells the computer:
“What’s written next to the Equal sign is not simple text. I wan
t to calculate something.
If a combination of a letter and a number appears next to it (e
g. B6, G78, D13) they refer to addresses of cells in the works
heet.
If arithmetic signs appear (signs like + - * / ) they refer to the
operations they mean (eg. adding, subtracting etc.)”
Formula
Examples of formulas:
=B4
In words: Show me the value of the cell in address B4.
=B4*7+3
In words: Multiply the value of the cell B4 by 7, and add 3 to i
t.
=(B4+B5+B6)*D6
In words: add up the values of the cells B4, B5, and B6, and
multiply their sum by the value of cell D6.
Cont’d
+ plus
- minus
* multiplication
/ division There is also:
^ exponential
for example =2^4 (will give the value of 16)
Cont’d
Examples of formulas:
One note:
You should take into consideration the order in which arithme
tic is calculated, consider the following two formulas:
=1+2*3+4
=(1+2)*3+4
They are not equal, the first one gives the value of 11 (multipl
ication is calculated first), and the second gives the value of
13.
Common mistakes when using formulas
Forgetting to write the Equal sign
Don’t forget to start it always with the “=” sign, otherwise, it will be ju
st written to the cell as text.
Mixing texts with numbers
Formulas can’t be based on cells that have mixed numbers and text
s. For example, if a cell contains something like this:
120 Kg. or 60 Km/h or 2,500 Birr
Formulas can’t be based on such kinds of cells.
You have 2 options:
A. Write the text (signs or letters) in an adjacent cell, and not togethe
r with the number.
B. Write the number only in the cell, and afterward use cell formattin
g to add special signs or letters (but either way don’t type them down
inside the cell with the numbers).
Relative Referencing
What it does?
sums items in a list matching a condition
Syntax:
sumif(in this range,values that meet this criteria,[sum-this-range])
Example:
=sumif(A1:A20,10) = sums the cells with the value of "10"
SUMIF
= CORREL(array1,array2)
Array1 is a cell range of values.
Array2 is a second cell range of values.
Correlation
Returns the correlation coefficient of the array1 and
array2 cell ranges. Use the correlation coefficient to
determine the relationship between two properties. For
example, you can examine the relationship between a
location's average temperature and the use of air
conditioners.
= CORREL(array1,array2)
Array1 is a cell range of values.
Array2 is a second cell range of values.
Rank Function
If
And
Or
Not
Choose
Iferror
Istext
IF
What it does?
checks whether a condition is met and returns one value if TRUE and another if FALSE
Syntax:
if(is-this-true,then do this, or this)
Example:
=if(25<15,"looser","winner") = winner
Other:
=sumif(condition range,condition, sum range)
IFERROR
What it does?
an easy way to handle errors in formulas
IFERROR returns the value you want incase of
an error with the formula
Syntax:
iferror(formula, value to return if there is an error)
Example:
=iferror(1/0,"can't divide by zero") can't divide by zero
=iferror(0/1,"can't divide by zero") 0
Text Formulas
Proper
Trim
Dollar
Rept
Text
Type
Error Types
Error Type When It Happens
#DIV/0! When you divide by ZERO
#N/A! When a formula or a function inside
a formula cannot find the referenced data.
#NAME? When the text in a formula is not recognized.
#NULL! When a space was used instead of a
comma in formulas that reference multiple
ranges. A comma is necessary to separate
range references.
Error Types
Error Type When It Happens
#NUM! When a formula has numeric data
#REF! When a reference is invalid.
#VALUE! When the wrong type of operand or
function argument is used
Absolute and Relative Referencing
• Absolute cell reference contains a ($)
in a Row and/or Column
– Do not change when copied or filled
– Use when you want to consistently refer to a certain cell
A1 Relative
A$1 Column is relative; Row is Constant
$A1 Row is relative; Column is absolute
$A$1 BOTH are Absolute
Show Formulas
Merge & Center
Database In Excel
Introduction
A database is a collection of logically related data designed to m
eet the information needs of one or more users
A database defines a structure for storing information.
Databases are typically organized into tables, which are collections
of related items.
A database is a collection of information that is organized so t
hat it can easily be accessed, managed, and updated
Database Function
Averages the values in a column of a list or database that
matches the conditions you specify.
= DAVERAGE (database,field,criteria)
The database is the range of cells that makes up the list
or database.
The field indicates which column is used in the function.
Field can be given as text with the column label enclosed
between double quotation marks, such as “-----” as a nu
mber that represents the position of the column within
the list.
Criteria are the range of cells that contains the conditions
you specify
DSUM Function
Adds the numbers in a column of a
list or database that match conditions you specify.
= DSUM (database,field,criteria)
The database is the range of cells that makes up
the list or database.
Field indicates which column is used in the functio
n. Field can be given as text with the column label
enclosed between double quotation marks, such as
”------” or as a number that represents the position
of the column within the list
Criteria are the range of cells that contains the co
nditions you specify
Summary
Function What it does
SUM(range) Adds a range of cells
(SUMIF(range,criteria,sum_ran Adds cells from sum_range if the condition specified in criteria on range is met.
ge)
AVERAGE(range) Calculates the mean (arithmetic average) of a range of cells
MEDIAN(range) Calculates the median value for a data set; half the values in the data set are greater t
han the median and half are less than the median
MAX(range) Returns the maximum value of a data set
MIN(range) Returns the minimum value of a data set
SMALL(range,k) Returns the kth smallest or kth largest value in a specified data range
LARGE(range,k)
COUNT(range) Counts the number of cells containing numbers in a range
COUNTA(range) Counts the number of non-blank cells within a range
COUNTBLANK(range) Counts the number of blank cells within a range
COUNTIF(range,value) Counts the number of cells in range that are the same as value.
VAR(range) and Calculates the variance of a sample or an entire population (VARP); equivalent to the
VARP(range) square of the standard deviation
STDEV(range) and STEVP(ra Calculates the standard deviation of a sample or an entire population (STDEVP); the s
nge) tandard deviation is a measure of how much values vary from the mean.
Summary
Mode =Mode(Range)
Median =Median(Range)
Mean =Average(Range)
Variance = var(range)
Standard Deviation =sqrt(var(range))
Correlation Coefficient =CORREL(Range1,Range2)
RAND( ) Returns a random value between 0 and 1.
ROUND( X, Y) Returns the value X with Y digits after decimal point.
SIN( X ) Returns the Sin for X.
COUNT( X:Y ) Returns the count of numerical values in the X:Y.
AVERAGE( X:Y) Returns the mean value in the list X:Y.
STDEV( X:Y) Returns the standard deviation of the list X:Y
Exercise
1. Prepare a table that shows Level 1 to 4 primary and secondary schools in number and
percentage.
2. Show the table in activity number 1 using a bar graph in comparison.
3. Compare all Woredas in levels 1 to 4 using bar graphs.
4. Compare all schools in levels 1 to 4 using input, process, and output standards.
5. Show percentage of schools which fit input, process, and output standards using a bar
graph.
6. Show the percentage of schools by level, urban, rural, and Woredas using a bar graph i
n a comparison format.
7. Using a bar graph compare the average of all school inspections versus the national av
erage.
8. Using a bar graph comparing the average of all school inspections versus the national a
verage
9. Which input, process and output standard more affected the 2006-2008 school inspec
tion?
10. Show all standards in the bar graph of your zone inspection report.
11. Show all Woredas in a bar graph that compares the worst standards.(at least by three s
tandards).
12. Show all standards in a line graph that compares statistical tools of range, variance, sta
ndard deviation, and coefficient of variation in a single table.
Thank-you