Data Analytics Unit-1
Data Analytics Unit-1
K.Koteswari
P. Preetha
Introduction:
sources of revenue.
Better balance risk vs. reward to reduce loss.
future?
How to optimize the delivery system?
Primary Focus Areas for Analytics
Understanding Customer Behavior :
Understanding customer behavior has always been a challenge for
organizations.
Customer behavior analytics is the process of collecting, analyzing, and
Come to know about data sources needed and available for the project.
The team formulates initial hypothesis that can be later tested with
data.
Phase 2: Data Preparation –
Steps to explore, preprocess, and condition data prior to modeling and
analysis.
It requires the presence of an analytic sandbox, the team execute, load,
STASTICA.
Phase 4: Model Building –Team develops datasets for
testing, training, and production purposes.
Team also considers whether its existing tools will suffice for
and it found that out of total return items more than 60%
of products were supplied by two vendors only, where the
vendor provided the wrong specification about products.
Essential tools used in Descriptive
Analytics :
Correlation Analysis : It is a statistical measure
that indicates the strength of the relationship
between two variables.
5 Why Analysis : It is a very structured approach
where we try to dig into a problem and peel it layer
by layer to reach the root cause of the problem.
Cause and Effect Analysis : Here, we identify all
possible reasons for one problem then we pick up all
the reasons as a problem one by one and try to find
other causes for that problem.
3. Predictive Analytics
Predictive analytics is used to make predictions
about future trends or events and answers the
question, “What might happen in the future?”.
Predictive analytics is the heart of business analytics,
it aims to help the organization by predicting
probabilities of occurrence of a future event or future
values of any essential business metrics.
Once organizations have a stable setup for
descriptive analytics, Predictive analytics combines
this historical data with advanced business protocols
(policy and rules) to forecast future values of
business events.
Predictive analytics allows organizations to become
forward–looking, providing an appetite to consume
calculated risk by anticipating customer behavior and
business outcomes.
Ex: sales in the next month/ quarter, employee attrition,
components:
1. Business Context
2. Technology
3. Data Science
Business Context :
Business analytics projects start with the business context and
ability of the organization to ask the right questions.
Another good example of business context driving analytics is
the ‘did you forget feature’ used by the Indian online grocery
store bigbasket.com (Abraham et al., 2016). Many customers
have the tendency to forget items they intended to buy. The
customers may buy the forgotten items from a nearby store
where they live, resulting in reduction in basket size in the
future for online grocery stores such as bigbasket.com.
Alternatively, the customer may place another order for
forgotten items, but this time, the size of the basket is likely to
be small and results in unnecessary logistics cost. Thus, the
ability to predict the items that a customer may have
forgotten to order can have a significant impact on the profits
of online grocers such as bigbasket.com.
Another problem that online grocery customers face while
ordering the items is the time taken to place an order. Unlike
customers of Amazon or Flipkart, online grocery customers
order several items each time; the number of items in an
order may cross 100. Searching for all the items that a
customer would like to order is a time-consuming exercise,
especially when they order using smart phones. Thus, big
basket created a ‘smart basket’ which is a basket consisting
of items that a customer is likely to buy (recommended
basket) reducing the time required to place the order.
The above examples( ‘did you forget’ and smart basket
feature at bigbasket.com) manifest the importance of
business context in business analytics, that is, the ability to
ask the right questions is an important success criteria for
analytics projects.
Technology:
To find out whether a customer has forgotten to place an order for an item,
we need data. In both the cases, the point of sale data has to be captured
consisting of past purchases made by the customer. Information Technology
(IT) is used for data capture, data storage, data preparation, data analysis,
and data share. Today most data are unstructured data; data that is not in
the form of a matrix (rows and columns) is called unstructured data. Images,
texts, voice, video, click stream are few examples of unstructured data. To
analyse data, one may need to use software such as R, Python, SAS, SPSS,
Tableau, etc. for example, in the case of Target, technology can be used to
personalize coupons that can be sent to individual customers.
Data Science :
Data Science is the most important component of analytics, it consists of
determine goals and the end results they are trying to achieve. These
goals can include increased sales, customer satisfaction and brand
awareness.
2. Collecting data:
The second step in web analytics is the collection and storage of data.
analysis, including:
Hypothesis testing , which is a statistical means of testing an
assumption.
Linear regression analysis, which can be used to evaluate the