Quantitative Analysis For Business Module
Quantitative Analysis For Business Module
LEARNING MODULE
in
Quantitative Analysis of Business
(MATH 213)
Quantitative Analysis of Business
Module Objectives
The main objective is to ensure that learners appreciate the importance of mathematics
and statistics for successful decision making in a range of business disciplines (e.g.
finance, economics, marketing etc.). They learn how to apply these mathematical skills
to manipulate and interpret numerical data. They are required to use a statistical
package to support them in their application of mathematics and statistics in their
analysis of business data.
Course Outline:
Prelims
I. Aasdasd
a. Sdfdsf
b. Sfdsad
c. Asd
d.
II. Sadsadsd
Midterm
III. Adsdsa
IV. Bgnb
Semi-Finals
V. Fsdasd
VI. Dsfdsf
Finals
VII. Asdsdsa
VIII. Dasd
IX.
Quantitative Analysis of Business
Introduction
Conceptual differences
Conceptual differences refer to what a specific type of analysis does. Here are some conceptual
differences between quantitative analysis and qualitative analysis:
Functional differences
Functional differences are differences in methodology. Here are some functional differences
between quantitative and qualitative analysis:
Quantitative methods
These are some quantitative data analysis methods:
Regression analysis
Regression analysis shows the relationship between different values. It allows data scientists to
show one dependent variable and several independent variables on a graph.
Linear programming
Linear programming is a data modeling technique that uses data and mathematics. In this
modeling technique, you minimize or maximize the linear function depending on the constraints.
Quasi-experiment
This type of analysis doesn't use randomization when evaluating information. It strives to
establish a cause-and-effect relationship within data.
Correlation analysis
Correlation analysis, or bivariate, is a process in which you identify relationships between
variables. After identifying relationships, you measure the function and magnitude of the
relationships.
Descriptive analysis
Descriptive analysis is a process in which you manipulate raw data to create a description. This
makes it simpler for viewers to interpret and understand.
Finance
Financial companies can use quantitative analysis together with unique insights to improve
financial processes and efficiency. Quantitative data can help financial companies determine:
Marketing
Through quantitative data and analysis, marketers have greater access to customer data
through tools like social media. Companies can minimize their marketing budgets by using
quantitative data to optimize media purchases and marketing campaigns.
Lesson 1
Collection and Presentation of data
Lesson 1.1
Data Types and Sampling methods
STATISTICS
Statistics is a collection of methods for collecting, displaying, analyzing, and drawing
conclusions from data.
Statistics is the science of collecting, organizing, presenting, analyzing, and
interpreting data to assist in making more effective decisions
Statistics is a mathematical body of science that pertains to the collection, analysis,
interpretation or explanation, and presentation of data, or as a branch of mathematics.
Some consider statistics to be a distinct mathematical science rather than a branch of
mathematics.
Types of statistics:
Descriptive statistics – Methods of organizing, summarizing, and presenting
data in an informative way
- summarizes data from a sample using indexes such as the
mean or standard deviation
Inferential statistics – The methods used to determine something about a
population on the basis of a sample
- draws conclusions from data that are subject to random
variation (e.g., observational errors, sampling variation.
Statistical data
The collection of data that are relevant to the problem being studied is commonly
the most difficult, expensive, and time-consuming part of the entire research
project.
Statistical data are usually obtained by counting or measuring items.
Primary data are collected specifically for the analysis desired
Secondary data have already been compiled and are available for
statistical analysis
Statistical data are usually obtained by counting or measuring items. Most data can be
put into the following categories:
Qualitative data are measurements that each fail into one of several categories.
(hair color, ethnic groups and other attributes of the population)
Qualitative data are generally described by words or letters. They are not as
widely used as quantitative data because many numerical techniques do not
apply to the qualitative data. For example, it does not make sense to find an
average hair color or blood type.
Qualitative data can be separated into two subgroups:
dichotomic (if it takes the form of a word with two options (gender - male
or female)
polynomic (if it takes the form of a word with more than two options
(education - primary school, secondary school and university).
Quantitative data are observations that are measured on a numerical scale
(distance traveled to college, number of children in a family, etc.)
Quantitative data are always numbers and are the result of counting or
measuring attributes of a population.
Quantitative data can be separated into two subgroups:
discrete (if it is the result of counting (the number of students of a given
ethnic group in a class, the number of books on a shelf, ...)
continuous (if it is the result of measuring (distance traveled, weight of
luggage, …)
Examples:
Interval has values of equal intervals that mean something. For example, a
thermometer might have intervals of ten degrees.
Examples:
Celsius Temperature.
Fahrenheit Temperature.
IQ (intelligence scale).
SAT scores.
Time on a clock with hands.
4. Ratio – consists of numerical measurements where the distance between
numbers is of a known, constant size, in addition, there is a nonarbitrary zero
point.
Age.*
Weight.
Height.
Sales Figures.
Ruler measurements.
Income earned in a week.
Years of education.
Number of children.
Variable
Variable is a logical set of attributes. Variables can “vary” – for example, be high or low.
How high, or how low, is determined by the value of the attribute (and in fact, an
attribute could be just the word “low” or “high”).
A variable is a characteristic of a statistical unit being observed that may assume more
than one of a set of values to which a numerical measure or a category from a
classification can be assigned.
Dependent and Independent Variable
Imagine that a tutor asks 100 students to complete a maths test. The tutor wants to
know why some students perform better than others. Whilst the tutor does not know the
answer to this, she thinks that it might be because of two reasons: (1) some students
spend more time revising for their test; and (2) some students are naturally more
intelligent than others. As such, the tutor decides to investigate the effect of revision
time and intelligence on the test performance of the 100 students. The dependent and
independent variables for the study are:
Therefore, the aim of the tutor's investigation is to examine whether these independent
variables - revision time and IQ - result in a change in the dependent variable, the
students' test scores. However, it is also worth noting that whilst this is the main aim of
the experiment, the tutor may also be interested to know if the independent variables -
revision time and IQ - are also connected in some way.
Population
Population –The entire set of individuals or objects of interest or the measurements
obtained from all individuals or objects of interest. For example, the population of
German people share a common geographic origin, language, literature, and genetic
heritage, among others traits, that distinguish them from people of different nationalities.
As another example, the Milky Way galaxy comprises a star population.
Typically, the population is very large, making a census or a complete enumeration of
all the values in the population impractical or impossible.
Sample – A portion, or part, of the population of interest. It is a set of data collected
and/or selected from a statistical population by a defined procedure. The sample usually
represents a subset of manageable size. Samples are collected and statistics are
calculated from the samples so that one can make a inferences or extrapolations from
the sample to the population.
Sampling is concerned
with the selection of a
subset of individuals
from within a statistical
population to estimate
characteristics of the
whole population.
A sample should have the same characteristics as the population it is representing.
Sampling can be:
with replacement: a member of the population may be chosen more than once
(picking the candy from the bowl)
without replacement: a member of the population may be chosen only once
(lottery ticket)
Sampling methods
Sampling methods can be:
random (each member of the population has an equal chance of being selected)
nonrandom
The actual process of sampling causes sampling errors. For example, the sample may
not be large enough or representative of the population. Factors not related to the
sampling process cause non-sampling errors. A defective counting device can cause a
non-sampling error.
The following may be applied to one-dimensional data. Depending on the circumstances, it may
be appropriate to transform the data before calculating a central tendency. Examples are squaring
the values or taking logarithms. Whether a transformation is appropriate and what it should be,
depend heavily on the data being analyzed.
Arithmetic mean or simply, mean
the sum of all measurements divided by the number of observations in the data
set.
Median
the middle value that separates the higher half from the lower half of the data set.
The median and the mode are the only measures of central tendency that can be
used for ordinal data, in which values are ranked relative to each other but are
not measured absolutely.
Mode
the most frequent value in the data set. This is the only central tendency
measure that can be used with nominal data, which have purely qualitative
category assignments.
Geometric mean
the nth root of the product of the data values, where there are n of these. This
measure is valid only for data that are measured absolutely on a strictly positive
scale.
Harmonic mean
the reciprocal of the arithmetic mean of the reciprocals of the data values. This
measure too is valid only for data that are measured absolutely on a strictly
positive scale.
Weighted arithmetic mean
an arithmetic mean that incorporates weighting to certain data elements.
Truncated mean or trimmed mean
the arithmetic mean of data values after a certain number or proportion of the
highest and lowest data values have been discarded.
Interquartile mean
a truncated mean based on data within the interquartile range.
Midrange
the arithmetic mean of the maximum and minimum values of a data set.
Midhinge
the arithmetic mean of the two quartiles.
Trimean
the weighted arithmetic mean of the median and two quartiles.
Winsorized mean
an arithmetic mean in which extreme values are replaced by values closer to the
median.
Any of the above may be applied to each dimension of multi-dimensional data, but the
results may not be invariant to rotations of the multi-dimensional space. In addition,
there are the
Geometric median
which minimizes the sum of distances to the data points. This is the same as the
median when applied to one-dimensional data, but it is not the same as taking
the median of each dimension independently. It is not invariant to different
rescaling of the different dimensions.
Quadratic mean (often known as the root mean square)
useful in engineering, but not often used in statistics. This is because it is not a
good indicator of the center of the distribution when the distribution includes
negative values.
Simplicial depth
the probability that a randomly chosen simplex with vertices from the given
distribution will contain the given center
Tukey median
a point with the property that every halfspace containing it also contains many
sample points
Frequency Distribution
A frequency distribution is an orderly arrangement of data classified according to the
magnitude of the observations. When the data are grouped into classes of appropriate
size indicating the number of observations in each class we get a frequency distribution.
By forming frequency distribution, we can summarize the data effectively. It is a method
of presenting the data in a summarized form. Frequency distribution is also known as
Frequency table.
Classes: A large number of observations varying in a wide range are usually classified
in several groups according to the size of their values. Each of these groups is defined
by an interval called class interval. The class interval between 10 and 20 is defined as
10-20.
Class limits: The smallest and largest possible values in each class of a frequency
distribution table are known as class limits. For the class 10-20, the class limits are 10
and 20. 10 is called the lower class limit and 20 is called the upper class limit.
Class limit: Class limit is the midmost value of the class interval. It is also known as the
mid value.
If the class is 0-10, lower limit is 0 and upper limit is 10. So the mid value is
(0+10)/2 = 10/2 = 5
.
Magnitude of a class interval: The difference between the upper and lower limit of a
class is called the magnitude of a class interval.
Class frequency: The number of observation falling within a class interval is called
class frequency of that class interval.
A frequency distribution table is one way to organize data so that it makes more sense.
The data so distributed is called frequency distribution and the tabular form is called
frequency distribution table. Let us see with the help of example how to construct
distribution table.
The frequency distribution table lists all the marks and also show how many times
(frequency) they occurred.
The number which tells us how many times a particular data appears is called the
frequency. For example, 2 marks have been scored by five students which means
marks 2 occurs five times. Therefore, the frequency of score 2 is five. Similarly, the
frequency of marks 5 is three because three students scored five marks.
If the frequency of the frequency distribution table is changed into relative frequency
then frequency distribution table is called as relative frequency distribution table. For a
data set consisting of n values. If f is the frequency of a particular value then the ratio 'fn
' is called its relative frequency.
Solved Example
Question: Find the relative frequency from the data given below:
Class interval Frequency
20-25 10
25-30 12
30-35 8
35-40 20
40-45 11
45-50 4
50-55 5
Solution:
Here n = 70
Class interval Frequency (f) Relative Cumulative Frequency (fn)
20-25 10 10 / 70 = 0.143
25-30 12 12 / 70 = 0.171
30-35 8 8 / 70 = 0.114
35-40 20 20 / 70 = 0.286
40-45 11 11 / 70 = 0.157
45-50 4 4 / 7 0 = 0.057
50-55 5 5 / 70 = 0.071
Total n = 70