Ethio Coders
Ethio Coders
2. If typo-remove or fix
3. Understand why they exist, and the impact on questions we are trying to answer about
our data
4. The 5 number summary are important than mean and standard deviation when we have
outliers
5. be careful in reporting
To check whether your data is normally distributed, use normal quintile plot and
statistical methods like Kolmogorov-Smirnov test.
-lots of users and large datasets; variance on sheet size and how many there can be
between programs
-In excel, for help menu ‘press-F1
- to customize the quick access toolbar with file > Options> Quick Access Toolbar.
- the data menu has items such as sorting and filtering data
- Adding dollar sign to an address makes it absolute based on where dollar signs are
placed. A dollar sign in front of the letter fixes a column, while the dollar sign in front of
the number fixes the row; if both are prefixed with a dollar sign, it is fixed to the exact
cell.
- to save data for the first time, save in SAVE AS, then save continuously in the same file.
-Excel formulas: converts a text string to upper case letters: UPPER(text); returns the
logical value TRUE: TRUE(); returns the number of characters in a text string:
LEN(text); estimates standard deviation based on sample (ignores logical values and
text): STDEV(n1, n2, …); adds all the numbers in a range of cells: SUM(n1, n2, …);
removes all spaces from a text string except single spaces between words: TRIM(text)
- to substitute will not change the original cell, instead showing the transformed data in
the new cell. SUBSTITUTE({text}, {old_text}, {new_text}), where {new_text} is the
cell to change, {old_text} is the string sequence to be replaced, and {new_text} is the new
string in place of the old one.
start_num: position of the first character you want to extract in the text
num_chars: the number of characters you want MID to return from the text
* Logical functions IF, AND, OR, NOT: =IF(condition, value if TRUE, [value if
FALSE]); for AND: IF(AND({condition1}, {condition2}), …)
*Pivot tables
*Named ranges
- The operators: <> for not equal, >= greater than or equal to, <= less than or equal to, =
equal to
Example: If the maximum number of any one fruit exceeds 10 pieces in an order, or the
total order exceeds 20 pieces (all fruit) then mark it with ‘‘Special order’’. Otherwise,
mark it with ‘‘NO’’ is given as =IF(OR(Max(B2:D2)>10, E2>20), ‘‘Special order’’,
‘‘NO’’)
- Lookup functions: uses a keyword and index to ‘’look up’’ a value in a table. There
are both horizontal and vertical lookup functions, although we will focus on a vertical
one called VLOOKUP
-Data Visualization: Pie chart, bar charts, scatter and line pilots, histograms, box plots
To draw pie chart: select categories at the top of the tabe> hold down the control key and
select the bottom row with your mouse>copy highlighted data to another location>paste
using the transpose feature so it creates columns instead of rows > select and choose
‘’insert pie chart’’ as before.
5. Business Metrics
-Marketing: sales: leads, bookings, deals, margins; customers: CAC, CPA, Life time
value, Active users, chum; web traffic: CTR, CPC, CPL, Stickiness
-Growth:
-Sale
-Finance
Customer Acquisition Cost (CAC) = (total marketing expenses + total marketing and
sales salaries)/number of customers converted to paid customers
-conversion rates based on each level = current level numbers/ prior level numbers
-total sales and marketing cost = (marketing cost + sales and marketing salaries +
overhead costs for sales and marketing)/number of leads