0% found this document useful (0 votes)
3 views

Notes

The document outlines the 4 Vs of Big Data—Volume, Velocity, Variety, and Veracity—which are essential for businesses to manage and analyze data effectively. It also discusses four types of analytics: Descriptive, Diagnostic, Predictive, and Prescriptive, each serving different purposes in data-driven decision-making. Additionally, it categorizes data into structured, unstructured, and semi-structured types, highlighting their characteristics and use cases.

Uploaded by

Adarsh Kr
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
3 views

Notes

The document outlines the 4 Vs of Big Data—Volume, Velocity, Variety, and Veracity—which are essential for businesses to manage and analyze data effectively. It also discusses four types of analytics: Descriptive, Diagnostic, Predictive, and Prescriptive, each serving different purposes in data-driven decision-making. Additionally, it categorizes data into structured, unstructured, and semi-structured types, highlighting their characteristics and use cases.

Uploaded by

Adarsh Kr
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 6

The 4 Vs of Big Data in business analytics are key characteristics that

define big data and help businesses understand and manage their data
effectively. They are:

1. Volume

o Definition: Refers to the sheer quantity of data generated


and collected by businesses from various sources like social
media, sensors, transactions, and web traffic.

o Example: E-commerce platforms like Amazon generate


terabytes of data daily from user searches, purchases, and
reviews.

o Importance: Analyzing large volumes of data helps


businesses uncover patterns, predict trends, and make data-
driven decisions.

2. Velocity

o Definition: Refers to the speed at which data is generated,


collected, and processed. Businesses need to process data in
real-time or near-real-time to gain timely insights.

o Example: Streaming platforms like Netflix analyze user


behavior in real-time to recommend content.

o Importance: Fast data processing enables companies to


respond to customer needs quickly, optimize operations, and
gain a competitive advantage.

3. Variety

o Definition: Describes the different types of data available,


including structured, unstructured, and semi-structured data
from various sources.

o Example: Businesses collect data from emails, social media,


videos, financial transactions, and IoT sensors.

o Importance: Managing diverse data sources provides a


comprehensive view of business performance and customer
preferences.

4. Veracity

o Definition: Refers to the accuracy, quality, and reliability of


data. It ensures that decisions made using the data are
trustworthy.
o Example: Financial institutions verify transaction data for
accuracy to detect fraud.

o Importance: Reliable data helps organizations make


informed decisions, minimize risks, and build customer trust.

These 4 Vs help businesses harness the power of big data to derive


actionable insights, predict market trends, and improve overall
performance.

ANALYTICS

In business analytics, there are four primary types of analytics that


organizations use to extract insights from data and make informed
decisions. These are:

1. Descriptive Analytics

 Definition: Descriptive analytics answers the question, "What


happened?" by analyzing historical data to identify patterns and
trends.

 Techniques Used: Data aggregation, data mining, and


visualization tools like dashboards and reports.

 Example: A retail store uses descriptive analytics to track monthly


sales, customer purchases, and product performance.

 Purpose: Provides insights into past performance and helps


businesses understand their current state.

2. Diagnostic Analytics

 Definition: Diagnostic analytics answers the question, "Why did it


happen?" by investigating the causes behind past events using
data.

 Techniques Used: Drill-down analysis, data discovery, and


correlation analysis.

 Example: If a company experiences a sudden drop in sales,


diagnostic analytics helps determine if it was due to pricing
changes, competitor actions, or seasonal demand.

 Purpose: Identifies the root causes of issues, enabling businesses


to take corrective actions.

3. Predictive Analytics
 Definition: Predictive analytics answers the question, "What could
happen in the future?" by using statistical models and machine
learning algorithms to forecast future outcomes.

 Techniques Used: Regression analysis, time series analysis, and


artificial intelligence (AI).

 Example: An e-commerce platform uses predictive analytics to


forecast customer demand for products based on browsing history
and previous purchases.

 Purpose: Helps businesses anticipate trends, manage resources


effectively, and make proactive decisions.

4. Prescriptive Analytics

 Definition: Prescriptive analytics answers the question, "What


should we do?" by recommending actions to achieve desired
outcomes using optimization algorithms and simulations.

 Techniques Used: Optimization models, simulation, and decision


analysis.

 Example: A logistics company uses prescriptive analytics to


determine the most efficient delivery routes, reducing fuel costs and
improving delivery times.

 Purpose: Provides actionable recommendations to maximize


business performance and minimize risks.

Type of Question
Purpose Example
Analytics Answered

Descriptive What Understand past Monthly sales report


Analytics happened? performance of a store

Diagnostic Why did it Analyze reasons for


Identify root causes
Analytics happen? customer churn

Predictive What could Forecast future Predict next quarter’s


Analytics happen? outcomes sales

Prescriptive What should Suggest optimal


Recommend actions
Analytics we do? supply chain routes
By using a combination of these four types of analytics, businesses can
make data-driven decisions, improve operational efficiency, and gain a
competitive advantage.

DATA

In business analytics and data management, data is classified into


three main types based on its organization and structure:

1. Structured Data

 Definition: Structured data is organized in a predefined format,


typically stored in relational databases with rows and columns. It
follows a fixed schema, making it easy to search and analyze using
SQL.

 Characteristics:

o Highly organized and easily searchable

o Stored in databases, spreadsheets, and data warehouses

o Uses tables with defined data types (e.g., integers, text,


dates)

 Examples:

o Customer information (Name, Email, Phone Number) stored in


a CRM system

o Sales transactions in an Excel sheet or SQL database

o Banking records with account details and balances

 Use Cases:

o Financial reporting

o Inventory management

o Employee databases

2. Unstructured Data

 Definition: Unstructured data lacks a fixed format or organization,


making it more complex to store and analyze. It does not fit neatly
into relational databases and requires specialized tools for
processing.

 Characteristics:
o Does not have a predefined data model

o Large and complex (big data)

o Often requires AI, machine learning, or natural language


processing (NLP) for analysis

 Examples:

o Emails, text messages, social media posts

o Videos, images, and audio files

o IoT sensor data and raw logs

 Use Cases:

o Sentiment analysis from customer reviews

o Image and voice recognition

o Analyzing customer feedback from social media

3. Semi-Structured Data

 Definition: Semi-structured data is a mix of structured and


unstructured data. It contains some organization, such as tags or
metadata, but does not fit into a traditional database format.

 Characteristics:

o Has some structure but is not rigidly organized like structured


data

o Often stored in NoSQL databases or XML/JSON formats

o Requires transformation before analysis

 Examples:

o JSON or XML files storing product details

o Emails (structured headers + unstructured body content)

o Sensor data with timestamps and raw readings

 Use Cases:

o Web scraping and data extraction

o Managing real-time analytics from IoT devices

o Storing flexible customer profiles in NoSQL databases


Comparison Table: Structured vs. Unstructured vs. Semi-
Structured Data

Analysis
Type Format Storage Examples
Tools

Customer
Predefined, SQL databases, SQL, Excel,
Structured records, sales
tabular spreadsheets Power BI
data

No Social media
Data lakes, AI, NLP,
Unstructured predefined posts, images,
cloud storage Hadoop
format videos

NoSQL NoSQL,
Semi- Partially Emails, IoT logs,
databases, Python,
Structured organized web data
JSON/XML MongoDB

By understanding these data types, businesses can choose the right tools
and strategies to store, manage, and analyze data effectively.

You might also like