unit1 iba

Download as docx, pdf, or txt
Download as docx, pdf, or txt
You are on page 1of 11

1.

4 Role of Analytics for Data-Driven Decision Making

1.4.1 Applications and Uses of Business Analytics

Business analytics has several applications and uses across various domains:

1. Descriptive Analytics:

o Focuses on understanding the past and present using factual data.

o Helps evaluate a company’s current market position and the effectiveness of past
business decisions.

o Example: Assessing sales trends over the last quarter to determine successful
product lines.

2. Predictive Analytics:

o Utilized to analyze past performance data and predict future trends or outcomes.

o This approach aids in strategic planning by anticipating potential market changes or


customer behavior.

o Example: Forecasting demand for seasonal products based on historical data.

3. Prescriptive Analytics:

o Focused on creating strategies for optimization and improving corporate


performance.

o Uses both historical and real-time data to recommend actionable solutions.

o Example: Setting competitive prices in a department store based on pricing trends


and consumer purchase patterns.

1.4.2 Workings of Business Analytics

Business Analytics involves a structured process to ensure the effective analysis of data:

1. Identify Corporate Objectives:

o Clearly define the purpose of the analysis.

o Align analytical goals with the broader business strategy.

2. Select Analytical Strategies:

o Choose appropriate methods or models based on the type of analysis (descriptive,


predictive, or prescriptive).

o Example: Use regression models for predictive analytics.

3. Data Collection:
o Gather data from multiple systems and sources to ensure comprehensive coverage
of relevant information.

o Sources include customer databases, transaction records, and external market


reports.

4. Data Cleansing and Integration:

o Remove inconsistencies, errors, or redundancies in the data.

o Integrate all cleaned data into a centralized storage system like a data warehouse or
data mart for easier access and analysis.

1.4.3 Need/Importance of Business Analytics

Business Analytics is crucial for modern organizations to stay competitive and achieve strategic goals:

1. Facilitates Informed Decisions:

o Helps businesses make evidence-based decisions, reducing reliance on intuition.

o Example: Deciding on investment opportunities based on market analysis.

2. Enhances Organizational Performance:

o Leads to increased profitability, market share, and revenue.

o Boosts shareholder returns through optimized operations and strategies.

3. Improves Data Interpretation:

o Enhances understanding of both primary (internal) and secondary (external) data.

o Increases operational efficiency across various departments, like marketing, sales,


and logistics.

4. Provides a Competitive Advantage:

o Equips organizations to better use available information compared to competitors.

o Combines data with advanced models to improve decision-making processes,


making the company more agile and responsive.

1.4.4 Transforming Data into Insightful Knowledge

Business Analytics plays a pivotal role in converting raw data into actionable insights:

1. Making Informed Decisions:

o Combines large volumes of data with analytical tools to guide strategic choices.

o Helps businesses expand market share, improve profitability, and attract investors.

2. Handling Massive Data Volumes:

o Many businesses face challenges in processing and utilizing large datasets.


o Business Analytics simplifies this by extracting meaningful insights and aligning them
with business goals.

3. Four Key Benefits Across Industries:

o Enhances Performance: Provides a clear picture of successful and unsuccessful


strategies.

o Facilitates Decision-Making: Enables quicker and more precise decisions, especially


in time-sensitive scenarios.

o Reduces Risks: Guides organizations in understanding consumer behavior,


identifying trends, and forecasting performance.

o Promotes Innovation: Uses consumer insights to encourage new product


development and organizational change.

1.5 Types of Business Analytics

Business analytics is classified into four main categories, each increasing in complexity and bringing
insights closer to actionable decision-making. Below are detailed descriptions of these categories:

1. Descriptive Analytics

 Definition:
Summarizes historical and current data to provide insights into what has already happened.

 Key Features:

o Simplest form of analytics; uses data aggregation and mining.

o Makes data accessible to stakeholders like shareholders, marketing teams, and sales
managers.

 Applications:

o Identifies strengths, weaknesses, and patterns in customer behavior.

o Assists in creating targeted marketing strategies by understanding past trends.

o Example: Summarizing quarterly sales to identify the most profitable product line.

2. Diagnostic Analytics

 Definition:
Shifts the focus from past data to present occurrences, helping identify factors influencing
trends.

 Key Features:

o Uses techniques like drill-down analysis and data mining.


o Explores probabilities and likelihoods to determine the root causes of events.

o Employs sensitivity analysis and training algorithms for classification and regression.

 Applications:

o Identifies reasons behind unexpected sales dips or spikes.

o Example: Analyzing why customer satisfaction ratings dropped during a specific


period.

3. Predictive Analytics

 Definition:
Leverages statistical models and machine learning (ML) techniques to forecast future events.

 Key Features:

o Builds on descriptive analytics outcomes to create predictive models.

o Uses existing data, such as social media sentiment, to anticipate trends.

o Often employs sentiment analysis to assess user attitudes (positive, neutral,


negative).

 Applications:

o Forecasting product demand or predicting customer churn.

o Example: Using customer purchase data to predict which products will sell well in the
upcoming quarter.

4. Prescriptive Analytics

 Definition:
Goes beyond predictions to suggest the best course of action and the steps needed to
achieve the desired outcomes.

 Key Features:

o Relies on robust feedback systems and iterative analysis.

o Explores relationships between actions and their outcomes.

o Provides actionable recommendations, ensuring alignment with business objectives.

 Applications:

o Development of recommendation systems (e.g., Netflix or Amazon suggestions).

o Example: Advising optimal inventory levels to prevent stockouts during peak


demand.
1.6 Introduction to the Concepts of Big Data Analytics

Big Data Analytics involves handling massive volumes of data that cannot be processed or stored
using conventional methods. It is categorized based on the data’s structure:

Types of Data

1. Structured Data:

o Follows a defined format with clear rows and columns.

o Stored in databases like RDBMS and spreadsheets.

o Example: Customer data (names, addresses, and purchase history).

2. Semi-Structured Data:

o Partially organized but does not follow strict formal rules.

o Example: XML and JSON files.

3. Unstructured Data:

o Lacks a consistent format or structure.

o Example: Emails, social media posts, and videos.

1.6.1 Large-Scale Data Management Traits

Big data is defined by the following key characteristics:

1. Volume:

o Refers to the immense scale of data generated by sources like social media, IoT
devices, and transactions.

o Measured in terabytes, petabytes, or even zettabytes.

2. Diversity:

o Includes varied formats such as images, videos, and audio streams, in addition to
structured data like phone numbers and addresses.

o Around 80% of data is unstructured.

3. Velocity:

o Describes the rapid rate at which data is generated and processed, often in real-time.

o Example: Streaming analytics for live video feeds.

4. Veracity:

o Ensures the reliability, accuracy, and relevance of data for decision-making.

5. Value:
o Focuses on deriving actionable insights from data to guide business decisions.

1.6.2 Services for Big Data Management

Big data management solutions vary in complexity and functionality. Popular services include:

1. Data Cleansing:

o Identifies and resolves inaccuracies or inconsistencies in data sets.

o Example: Removing duplicate entries from customer databases.

2. Data Integration:

o Combines data from multiple sources into a unified system for analysis.

o Example: Merging sales data from physical stores and e-commerce platforms.

3. Data Preparation:

o Prepares raw data for analysis by organizing and structuring it.

o Example: Formatting transaction logs for statistical analysis.

4. Data Enrichment:

o Enhances datasets by adding new information or correcting errors.

o Example: Using geolocation data to segment customer demographics.

5. Data Migration:

o Transfers data from one environment to another, such as moving from on-premise
systems to the cloud.

6. Data Analytics:

o Applies techniques like statistical modeling and machine learning to extract insights.

o Example: Using sentiment analysis to gauge customer opinions from social media.

1.7 Overview of Machine Learning Algorithms

Machine Learning (ML) refers to the study and development of algorithms that allow computers to
improve their performance and make decisions or predictions based on data without explicit
programming. It is a crucial subset of artificial intelligence (AI) and finds application in numerous
fields, including medicine, email filtering, speech recognition, and computer vision.

1.7.1 Machine Learning Basics

How Does Machine Learning Work?


Machine learning operates through the following steps:

1. Making a Decision:
o ML algorithms predict or classify patterns based on the provided data.

o Data may be labeled (structured) or unlabeled.

2. Error Function:

o The algorithm uses an error function to assess the accuracy of its predictions.

o The function compares predictions against known examples to determine deviation.

3. Model Optimization:

o Weights within the model are adjusted iteratively to reduce the error.

o The algorithm updates itself continuously until it reaches the desired accuracy.

Applications of Machine Learning

 Predictive analytics in business.

 Exploratory data analysis and mining.

 Use of neural networks for tasks mimicking human brain functions.

1.7.2 Machine Learning Methods

Machine learning methods are categorized into three primary types:

1. Supervised Learning:

o Definition:
Utilizes labeled datasets to train models that can classify data or predict outcomes.

o Features:

 Involves adjusting weights to achieve an optimal fit.

 Techniques: Neural networks, linear regression, logistic regression, random


forests, support vector machines (SVM), and naive Bayes.

o Applications:

 Classifying spam emails.

 Predicting house prices based on historical data.

2. Unsupervised Learning:

o Definition:
Analyzes and groups unlabeled datasets to identify hidden patterns and clusters.

o Features:

 Suitable for exploratory data analysis and pattern recognition.

 Techniques: Neural networks, k-means clustering, probabilistic clustering,


Principal Component Analysis (PCA), and Singular Value Decomposition
(SVD).
o Applications:

 Consumer segmentation and cross-selling strategies.

 Reducing the number of features in models through dimensionality


reduction.

3. Semi-Supervised Learning:

o Definition:
Combines elements of supervised and unsupervised learning. A small labeled
dataset guides the analysis of a larger unlabeled dataset.

o Features:

 Useful when labeled data is scarce or expensive to produce.

o Applications:

 Feature extraction in cases with limited labeled data.

1.7.3 Reinforcement Learning

Definition:
Reinforcement learning is a behavioral learning model that learns through trial and error rather than
pre-supplied training data.

Features:

 Continuously improves its strategy by reinforcing successful outcomes.

 Operates autonomously to develop optimal solutions for specific problems.

Applications:

 Autonomous driving systems.

 Interaction between humans and computers.

 Writing sports match reports and identifying security threats.

1.8 Introduction to Relevant Statistical Software Packages

Statistical software packages are collections of programs designed to facilitate statistical analysis and
related tasks such as data management. These tools are critical for organizing, interpreting, and
presenting data to provide scientific insights into patterns and trends.

What is Statistical Software?

Statistical software refers to programs designed to perform complex statistical analyses using
algorithms and mathematical methods such as regression analysis and time series analysis. These
tools streamline the analysis process, ensuring accuracy and efficiency.
Benefits of Statistical Software

1. Enhanced Productivity: Automates data analysis and management, saving time and effort.

2. Accuracy: Reduces human errors, improving the reliability of results.

3. Customization: Offers personalized features for specific analytical needs.

4. Data-Driven Insights: Leverages extensive datasets to support decision-making.

Relevant Statistical Software Packages

1. SPSS (Statistical Package for Social Sciences)

o Popular for analyzing complex statistical data.

o Generates descriptive statistics, parametric/non-parametric analysis, and


presentation-ready reports.

o Primarily used for quantitative data analysis.

2. Stata

o Provides tools for data management, analysis, and visualization.

o Features both command-line and graphical user interfaces.

o Coding expertise not required, making it user-friendly.

3. R

o Free and open-source statistical software with extensive libraries.

o Ideal for both linear and non-linear modeling, offering interactive reports and apps.

o Requires coding expertise. Used extensively for quantitative data analysis.

4. Python

o Open-source with vast libraries and frameworks.

o Widely used for machine learning tasks.

o Notable for simplicity, readability, and versatility.

5. SAS (Statistical Analysis Software)

o Cloud-based platform with pre-built applications for data manipulation and statistical
modeling.

o Preferred by data scientists and business analysts.

o Requires coding knowledge and is used for numerical data analysis.

6. MATLAB (MATrix LABoratory)


o Offers a programming language and analytical platform for matrix operations, data
visualization, and algorithm implementation.

o Frequently used by engineers and scientists for quantitative data analysis.

7. Epi-data

o Free software designed for public health researchers.

o Supports data entry, management, and basic statistical analysis.

o Commonly used in epidemiology and public health.

8. Epi-info

o Public domain software by the CDC for epidemiology.

o Features include data entry, analysis, mapping, and visualization.

o Used in disease surveillance and outbreak investigations.

9. NVivo

o Specializes in qualitative data analysis (e.g., interviews, surveys, social media).

o Allows importing and analyzing unstructured text, audio, video, and images.

o Widely used for mixed-methods research.

10. Mini-tab

o Facilitates fundamental to moderately complex statistical analysis.

o Automates calculations and generates visualizations.

o Ideal for understanding quantitative data insights.

11. Dedoose

o Web-based tool for qualitative and quantitative analysis.

o Cost-effective, user-friendly, and team-oriented, with robust security features.

12. ATLAS.ti

o Leading qualitative analysis software with integrated AI features.

o Common in academic and corporate research for sentiment analysis and auto-
coding.

13. MAXDQA 12

o Advanced software for analyzing data using mixed methods.

o Efficient for literature reviews and categorizing unstructured data.

o Costly and less collaborative for team use.

You might also like