Data Analysis UNIT-III
Data Analysis UNIT-III
1. Descriptive Analysis
Descriptive Analysis looks at data and analyzes past events for insight as to
how to approach future events. It looks at the past performance and
understands the performance by mining historical data to understand the cause
of success or failure in the past. Almost all management reporting such as
sales, marketing, operations, and finance uses this type of analysis.
Example: Let’s take the example of DMart, we can look at the product’s
history and find out which products have been sold more or which products
have large demand by looking at the product sold trends, and based on their
analysis we can further make the decision of putting a stock of that item in
large quantity for the coming year.
Diagnostic Analysis
Diagnostic analysis works hand in hand with Descriptive Analysis. As
descriptive Analysis finds out what happened in the past, diagnostic Analysis,
on the other hand, finds out why did that happen or what measures were taken
at that time, or how frequently it has happened. it basically gives a detailed
explanation of a particular scenario by understanding behavior patterns.
Example: Let’s take the example of Dmart again. Now if we want to find out
why a particular product has a lot of demand, is it because of their brand or is
it because of quality. All this information can easily be identified using
diagnostic Analysis.
3. Predictive Analysis
Information we have received from descriptive and diagnostic analysis, we can
use that information to predict future data. it basically finds out what is likely
to happen in the future. Now when future data doesn’t mean we have become
fortune-tellers, by looking at the past trends and behavioral patterns we are
forecasting that it might happen in the future.
Example: The best example would be Amazon and Netflix recommender
systems. You might have noticed that whenever you buy any product from
Amazon, on the payment side it shows you a recommendation saying the
customer who purchased this has also purchased this product that
recommendation is based on the customer purchase behavior in the past. By
looking at customer past purchase behavior analyst creates an association
between each product and that’s the reason it shows recommendation when
you buy any product.
4. Prescriptive Analysis
This is an advanced method of Predictive Analysis. Now when you predict
something or when you start thinking out of the box you will definitely have a
lot of options, and then we get confused as to which option will actually work.
Prescriptive Analysis helps to find which is the best option to make it happen
or work. As predictive Analysis forecast future data, Prescriptive Analysis on
the other hand helps to make it happen whatever we have forecasted.
Prescriptive Analysis is the highest level of Analysis that is used for choosing
the best optimal solution by looking at descriptive, diagnostic, and predictive
data.
Example: The best example would be Google’s self-driving car, by looking
at the past trends and forecasted data it identifies when to turn or when to slow
down, which works much like a human driver.
Statistical Analysis
Statistical Analysis is a statistical approach or technique for analyzing data
sets in order to summarize their important and main characteristics generally
by using some visual aids. This approach can be used to gather knowledge
about the following aspects of data:
1. Main characteristics or features of the data.
2. The variables and their relationships.
3. Finding out the important variables that can be used in our problem.
2. Collect Data
Find Your Source: Determine what information can be collected from
existing sources, and what you need to find elsewhere.
Standardize Collection: Create file storage and naming system ahead of
time.
Keep Track: Keep data organized in a log with dates and add any source
notes as you go.
Google trends
Keep it Succinct: Organize data in a way that makes it easy for different
audiences to skim through it to find the information most relevant to them.
Make it Visual: Use data visualizations techniques, such as tables and
charts, to communicate the message clearly.
Include an Executive Summary: This allows someone to analyze your
findings upfront and harness your most important points to influence their
decisions.
Statistics simply means numerical data, and is field of math that generally
deals with collection of data, tabulation, and interpretation of numerical data.
It is actually a form of mathematical analysis that uses different quantitative
models to produce a set of experimental data or studies of real life. It is an area
of applied mathematics concern with data collection analysis, interpretation,
and presentation. Statistics deals with how data can be used to solve complex
problems. Some people consider statistics to be a distinct mathematical
science rather than a branch of mathematics.
Statistics makes work easy and simple and provides a clear and clean picture
of work you do on a regular basis.
Basic terminology of Statistics :
Population –
It is actually a collection of set of individuals or objects or events whose
properties are to be analyzed.
Sample –
It is the subset of a population.
Types of Statistics :
1. Descriptive Statistics :
Descriptive statistics uses data that provides a description of the population
either through numerical calculation or graph or table. It provides a graphical
summary of data. It is simply used for summarizing objects, etc. There are two
categories in this as following below.
(a). Measure of central tendency –
Measure of central tendency is also known as summary statistics that is used to
represents the center point or a particular value of a data set or sample set.
In statistics, there are three common measures of central tendency as shown
below:
(i) Mean :
It is measure of average of all value in a sample set.
For example,
ii) Median :
It is measure of central value of a sample set. In these, data set is ordered from
lowest to highest value and then finds exact middle.
For example,
(iii) Mode :
It is value most frequently arrived in sample set. The value repeated most of
time in central set is actually mode.
For example,