Reviewerku
Reviewerku
Reviewerku
1. Volume: The substantial amount of data processed, requiring significant storage and
processing capacity.
2. Velocity: The speed at which data is generated and processed. High-velocity data
requires quick and efficient processing solutions.
3. Variety: The different formats and types of data (structured, unstructured, semi-
structured).
4. Veracity: The quality or fidelity of data. High-quality data (high signal-to-noise ratio) is
crucial for accurate analysis.
5. Value: The usefulness of data for an enterprise. Value depends on the quality and
timeliness of data processing and analysis.
1. Structured Data: Conforms to a data model or schema and is often stored in tabular
form (e.g., banking transactions, customer records).
2. Unstructured Data: Does not conform to a data model or schema. Examples include text
files (tweets, blog posts) and binary files (images, videos).
3. Semi-structured Data: Has some structure but is not relational. Examples include XML
and JSON files, EDI files, spreadsheets, RSS feeds.
4. Metadata: Provides information about a dataset’s characteristics and structure (e.g.,
XML tags, file attributes).
Big Data: Field focused on analyzing, processing, and storing large datasets from various
sources when traditional methods are inadequate.
Big Data Characteristics: Involves large, diverse, complex, and often unstructured data.
Structured Data: Data that is organized and easily searchable (e.g., databases).
Unstructured Data: Data without a pre-defined format (e.g., text, images).
Semi-structured Data: Contains both structured and unstructured elements (e.g., JSON
files).
Types of Analytics
1. Descriptive Analytics
o Answers questions about past events.
o Example questions: What was the sales volume over the past 12 months?
o Provides historical data insights via reports or dashboards.
2. Diagnostic Analytics
o Determines the cause of past events.
o Example questions: Why were Q2 sales less than Q1 sales?
o Involves complex queries and multi-dimensional data analysis.
3. Predictive Analytics
o Predicts future events based on past data.
o Example questions: What are the chances a customer will default on a loan if they
miss a payment?
o Uses models to forecast outcomes and identify risks/opportunities.
4. Prescriptive Analytics
o Suggests actions based on predictive analytics results.
o Example questions: Which drug provides the best results among three options?
o Involves simulation of scenarios and incorporates both internal and external data.
Operational Optimization
Actionable Intelligence
Identification of New Markets
Accurate Predictions
Fault and Fraud Detection
More Detailed Records
Improved Decision-Making
Scientific Discoveries
Definition: BI helps organizations gain insights into performance by analyzing data from
business processes and information systems.
Purpose: Enables management to correct issues and enhance organizational performance.
Data Consolidation: Typically, data is consolidated into an enterprise data warehouse
for analytical queries.
Dashboards: BI outputs are often displayed on dashboards for easy access and analysis
by managers.
1. Human-Generated Data
o Definition: Data resulting from human interaction with systems.
oExamples: Social media, blog posts, emails, photo sharing.
2. Machine-Generated Data
o Definition: Data generated by software programs and hardware devices.
o Examples: Web logs, sensor data, telemetry data.
Data Types
1. Structured Data
o Definition: Data that conforms to a data model or schema.
o Storage: Often stored in relational databases.
o Examples: Banking transactions, invoices, customer records.
2. Unstructured Data
o Definition: Data that does not conform to a data model or schema.
o Growth: Makes up 80% of enterprise data and has a faster growth rate.
o Examples: Text files, media files (video, image, audio).
3. Semi-Structured Data
o Definition: Data with a defined level of structure but not relational.
o Formats: Hierarchical or graph-based.
o Examples: XML, JSON, sensor data.
4. Metadata
o Definition: Information about a dataset’s characteristics and structure.
o Importance: Crucial for processing, storage, and analysis in Big Data
environments.
o Examples: XML tags, attributes of a digital photograph.
Marketplace Dynamics
The evolving business landscape has forced companies to shift from internal efficiency and cost-
cutting to external focus and innovation. This change, driven by market disruptions and
economic fluctuations, emphasizes the need for companies to leverage external data and
advanced analytics.
Key Points:
Economic crises like the dot-com bubble burst in 2000 and the global recession in 2008
prompted businesses to improve efficiency and cut costs.
Post-recession, companies have focused on innovation to gain competitive advantages
and grow revenue.
Integration of external data with internal data enhances business intelligence and
decision-making.
Business Architecture
Modern enterprise architecture increasingly integrates business architecture with technology
architecture. This holistic approach aligns strategic vision with operational execution through
well-defined linkages between abstract and concrete business elements.
Key Points:
Business architecture includes elements like mission, vision, strategy, and key
performance indicators (KPIs).
A layered system approach divides the organization into strategic, tactical, and
operational layers, each with distinct roles and metrics.
Big Data enriches business architecture by providing context and insights across
organizational layers.
BPM focuses on optimizing business processes to deliver value efficiently. Big Data and
intelligent BPM systems enhance process management by enabling adaptive, goal-driven
execution.
Key Points:
Business processes describe work activities, relationships, and responsible actors within
an organization.
BPM systems (BPMS) integrate process models, organizational roles, business rules, and
user interfaces to create cohesive business applications.
Combining Big Data analytics with BPM allows for adaptive and responsive process
execution.
Advancements in ICT have accelerated Big Data adoption by providing essential tools and
environments for data analytics, digitization, and scalable computing.
Key Points:
Data analytics and data science techniques are crucial for extracting insights from large
datasets.
Digitization replaces physical mediums with digital ones, facilitating data collection and
interaction.
Affordable technology and commodity hardware make Big Data solutions accessible to
businesses of all sizes.
Social media and hyper-connected devices provide rich data sources for analysis.
Cloud computing offers scalable, on-demand resources for Big Data processing.
Key Points:
IoE combines smart connected devices with business processes to create unique value
propositions.
Big Data is central to IoE, enabling real-time analysis and decision-making.
Examples like precision agriculture demonstrate how IoE and Big Data can optimize
workflows and generate value.