BI Vimlu
BI Vimlu
Pie Chart:
Consider a dataset of student exam scores and ● Confidence: Confidence measures the
their corresponding pass/fail status. The goal is reliability or strength of the association
to predict whether a student will pass (1) or fail between two itemsets in a rule. It's
(0) the exam based on their exam scores. We can calculated as the ratio of the number of
use logistic regression to build a model that transactions containing both the
predicts the probability of passing the exam antecedent and consequent of a rule to
based on the exam scores. the number of transactions containing the
antecedent.
Let's say we have two predictor variables:
exam1_score (x1) and exam2_score (x2). The Confidence(X→Y)=Support(X)Support(X∪Y)
logistic regression model can be represented as:
Example:
log(1−pp)=β0+β1x1+β2x2
Consider a dataset of supermarket transactions,
Where: and we want to find association rules. Let's say
we have the following rule: {Diapers} → {Beer}.
● p is the probability of the student passing
the exam given their exam scores. ● Support(Diapers → Beer) = 0.2: This means
● x1 and x2 are the exam1_score and that 20% of transactions contain both
exam2_score respectively. diapers and beer.
● β0, β1, and β2 are the coefficients of the ● Confidence(Diapers → Beer) = 0.8: This
model. means that among the transactions
containing diapers, 80% also contain beer.
The logistic regression model estimates the
coefficients β0, β1, and β2from the training data, In this example, a support of 0.2 indicates that
and the predicted probability p is used to make the rule is relevant in 20% of transactions, while a
predictions. If the predicted probability is greater confidence of 0.8 indicates that the rule is
than a certain threshold (e.g., 0.5), the student is accurate in 80% of cases where diapers are
predicted to pass (1); otherwise, they are purchased.
predicted to fail (0).
h) Different Formulae for Evaluation of
g) Association Rules and Evaluation using Classification Models:
Support and Confidence:
● Accuracy:
Association Rules:
Accuracy=TP + TN + FP + FNTP + TN
Association rules are patterns or relationships
discovered in datasets consisting of transactions ● Precision:
or items. They are used in market basket analysis
to identify co-occurrence relationships between Precision=TP + FPTP
different items in a transaction. Association rules
typically take the form of "if-then" statements, ● Recall (Sensitivity):
where antecedents imply consequents.
Recall=TP + FNTP
Evaluating Association Rules using Support
and Confidence: ● F1 Score:
F1 Score=2×Precision + RecallPrecision×Recall Recall=TP + FNTP
● Precision:
● Specificity:
Precision=TP + FPTP
Specificity=TN + FPTN ● Accuracy:
● False Positive Rate (FPR): Accuracy=TP + TN + FP + FNTP + TN
3 8 7
● Cluster 1: Ages 16 to 29.
● Cluster 2: Ages 36 to 66. 4 9 8
Similarities and Differences Between ERP and By analyzing data, retailers can identify trends,
BI forecast demand, and personalize marketing
campaigns to enhance customer satisfaction and
Similarities increase sales.
● Data Integration: Both ERP and BI integrate i) Implementing Business Intelligence Findings
data from various sources to provide a unified
view of business operations. Implementing BI findings involves several steps:
● Decision Support: Both offer tools for data
1. Identify Key Objectives: Determine the
analysis, reporting, and visualization to
specific business goals BI aims to address
support decision-making.
(optimize operations, improve customer
● Improving Efficiency: Both help improve
satisfaction, increase revenue).
operational efficiency, streamline processes,
2. Data Collection and Integration: Gather
and optimize resource allocation by providing
relevant data from various sources
insights into business performance.
(databases, spreadsheets, CRM systems,
external sources). Ensure data is cleaned,
Differences between ERP and BI Systems:
standardized, and integrated for analysis.
1. Scope and Focus: 3. Data Analysis and Insights Generation:
Utilize BI tools and techniques to analyze
● ERP systems: Manage core business processes data and generate actionable insights (data
(finance, HR, inventory, supply chain). visualization, statistical analysis, predictive
● BI systems: Analyze and interpret data to modeling, machine learning algorithms).
support decision-making across functions. 4. Interpretation and Decision-Making:
Interpret insights in the context of business
2. Real-time vs. Historical Data: objectives. Collaborate with stakeholders to
understand implications and devise
● ERP systems: Real-time transactional data implementation strategies.
(day-to-day operations). 5. Implementation Planning: Develop a plan
● BI systems: Historical data to identify trends for implementing changes based on BI
and insights over time. findings (reallocate resources, redesign
processes, launch new initiatives, revise
3. Functionality: existing strategies).
6. Monitoring and Evaluation: Continuously ● Clustering: Includes algorithms for discovering
monitor the implementation of BI-driven hidden patterns and grouping similar data
initiatives and track their impact on KPIs. points together (k-means, hierarchical
Evaluate the effectiveness of strategies and clustering, expectation-maximization (EM)).
make adjustments as needed. ● Association Rule Mining: Supports algorithms
for discovering relationships between variables
j) Business Intelligence (BI) Applications in (Apriori, FP-Growth) for market basket
Logistics analysis, recommendation systems, and
identifying cross-selling opportunities.
BI applications in logistics leverage data ● Visualization: Provides tools for exploring and
analytics to optimize supply chain operations, interpreting data analysis results (scatter plots,
enhance efficiency, and improve histograms, decision tree diagrams).
decision-making. BI in logistics enables Visualizations help users understand patterns,
organizations to: trends, and relationships in the data more
intuitively.
1. Demand Forecasting: Analyze data to
forecast future demand accurately (optimize
inventory levels, reduce stockouts, improve
customer service).
2. Route Optimization: Utilize BI tools to
analyze transportation data (traffic patterns,
delivery routes, vehicle utilization) to
minimize fuel costs, reduce transit times, and
improve delivery efficiency.
3. Warehouse Management: Implement BI
solutions for monitoring warehouse
operations, inventory levels, and order
fulfillment processes. Real-time analytics help
optimize warehouse layouts, reduce picking
times, and enhance overall efficiency.
4. Supplier Management: Analyze supplier
performance metrics (lead times, quality
levels, delivery reliability) to identify top
performers and optimize supplier
relationships. BI insights enable better
decision-making in supplier selection and
contract negotiations.
5. Risk Management: Utilize BI tools to identify
and mitigate risks in the supply chain
(disruptions, delays, inventory shortages).
Predictive analytics models help proactively
manage risks and implement contingency
plans to minimize their impact.