Assignment Stats
Assignment Stats
Statistics may be defined as the science of the Collection, Presentation, Analysis and Interpretation of Numerical Facts and data. In other words we can say Statistics are the numerical statement of facts capable of analysis and interpretation and the science of statistics is the study of the principles and the methods applied in Collecting, presenting, analysis and interpreting the numerical data in any field of Inquiry.
Statistics is essential for a country. Different policies of the government are based on statistics. Suppose if the government wants to revise the pay scales of employees in view of an increase in the living cost, statistical methods will be used to determine the rise in the cost of living. Preparation of federal and provincial government budgets mainly depends upon statistics because it helps in estimating the expected expenditures and revenue from different sources. So statistics are the eyes of administration of the state.
In Economics:
Statistics play an important role in economics. Economics largely depends upon statistics. National income accounts are multipurpose indicators for the economists and administrators. Statistical methods are used for preparation of these accounts. In economics research statistical methods are used for collecting and analysis the data and testing hypothesis. The relationship between supply and demands is studies by statistical methods, the imports and exports, the inflation rate, the per capita income are the problems which require good knowledge of statistics
In Banking:
Statistics play an important role in banking. The banks make use of statistics for a number of purposes. The banks work on the principle that all the people who deposit their money with the banks do not withdraw it at the same time. The bank earns profits out of these deposits by lending to others on interest. The bankers use statistical approaches based on probability to estimate the numbers of depositors and their claims for a certain day
In auditing sampling techniques are commonly used. An auditor determines the sample size of the book to be audited on the basis of error.
Statistics In Finance:
Statistics provide help to investor in forecasting the future prospects of an investment before investing in it. So that he can expect maximum return for his investment and save himself from any loss.
TYPES OF AVERAGES:
The most common types of averages are: 1. 2. 3. 4. 5. Arithmetic Mean (or simple mean) Geometric Mean Harmonic Mean Median Mode
Arithmetic Mean:
The arithmetic mean is defined as the sum of all observations divided by the number of observations. If a set of n observations, then arithmetic mean is denoted by Formula for Ungrouped Data (read X bar).
Note:
Arithmetic mean for a population data is computed in the same way but it is denoted by (the
Geometric Mean:
Geometric mean can be defined in the following terms: Geometric mean is the nth positive root of the product of n positive given values Hence, geometric mean for a value of and given as under: containing values such as is denoted by
positive values with repeated values such as times respectively then geometric mean will becomes:
(For Grouped Data)
are
Where When n is large (i.e. greater than two) the computation of geometric mean becomes difficult by this formula. Making use of logarithms, the formula of geometric mean becomes as
For Ungrouped Data For Grouped Data
Note: If any value is zero or negative, then the geometric mean is not computed
The harmonic mean is one of several kinds of Average. Typically, it is appropriate for situations when the average of rates is desired. Harmonic mean in mathematical terms is defined as follows:
For Ungrouped Data For Grouped Data
MEDIAN:
The most middle value of an arranged data is called Median. In other words median is that quantity which divides the data into two equal parts, i.e. 50%of the data lie below the median and 50% lie above the median. It is also known as a positional average. It is denoted by read as X- tilde.
Median = Value of
item
number
. The median lies in the group (class) which corresponds to the cumulative frequency in which
Where = Lower class boundary of the model class = Frequency of the median class = Number of values or total frequency = Cumulative frequency of the class preceding the median class = Class interval size of the model class
MODE:
Mode is the value which occurs the greatest number of times in the data. When each value occur the same numbers of times in the data, there is no mode. If two or more values occur the same numbers of time, then there are two or more modes and distribution is said to be multi-mode. If the data having only one mode the distribution is said to be uni-model and data having two modes, the distribution is said to be bi-model.
Or
Where = Lower class boundary of the model class = Frequency of the model class (maximum frequency) = Frequency preceding the model class frequency = Frequency following the model class frequency = Class interval size of the model class
If x 1 and x2 are the means of the two groups computed from the values n1 and n2 then the mean x is given by the formula x = n1x1+n2x2/ n1+n2 If each observation in the data is replaced by x, the sum total of all the observations remains unchanged. x = x1,x2,x3,x4........,xn/n So x1,x2,x3,x4........,xn = nx Replacing each observation by x, we get x+x+x........+x = nx If every value of the variable x is either increased, decreased, divided or multiplied by a constant, the observations so obtained also increases, decreases, gets multiplied or gets divided respectively by the same constant. Algebraic sum of the deviation of a set of values from their
Quartiles, Percentiles & Deciles: Although quartiles, percentiles and deciles are not averages. They are included here because they are of same nature as of Median.
Quartile The values which divide data into four equal parts, where each part contains 25% observations are called Quartiles. These values are denoted by Q1, Q2 and Q3. Where, Q1 = First Quartile or Lower Quartile Q2 = Second Quartile = Median Q3 = Third Quartile or Upper Quartile
MEASURE OF DISPERSION:
A measure of dispersion is the degree of closeness of the values from its some central value i.e. means median or mode, etc. In other words, it may be concluded that measure of central tendency is a single number to represent a group of data whereas a measure of dispersion is used to measure differences in data. There are two types of measure of dispersion which are: (a) Absolute Measure of Dispersion (b) Relative Measure of Dispersion
RANGE: It is defined as the differences between the largest and smallest values in a set of data. Range = Max. Value Min. Value QUARTILE DEVIATION (Semi-Interquartile Range): Quartile deviation defines as Q.D= Q3-Q1/2 MEAN DEVIATION: It is defined as the mean of the absolute values of the deviation taken from some average (Mean or Median) called Mean Deviation or simply Mean Deviation about Mean and Mean Deviation about Median. Mean Deviation about Mean / Median: Mean Deviation about mean or median is defined as
Grouped Data Mean Deviation about Mean Mean Deviation about Median M.D( )=xi M.D( )=xi /n /n
VARIANCE AND STANDRAD DEVIATION: If are the n-values, then variance is defined as
Ungrouped Data
( ) OR
Standard Deviation
OR
OR
S.D=
Relative Measure of Dispersion: When the two distributions are in different units of measurement, it will be difficult to compare them. In such cases the relative measure of dispersion are used. Relative measures of dispersion are independent of units of measurements. They are generally expressed in the term of coefficients, ratios or percentages.
2. Coefficient of Quartile Deviation: The Coefficient of Quartile Deviation is defined as, C. of Q.D = Q3 Q1/Q3 + Q1
3. Coefficient of Mean Deviation: The Coefficient of Mean Deviation is defined as, C. of MD = MD ( )/ & C. of MD ( )/ ( )
4. Coefficient of Variation: The coefficient of variation is an important and the most commonly used dispersion. C. of V. = Standard deviation/Mean *100 = S.D/( ) * 100
relative measure of