CS 329 Lecture One 2025
CS 329 Lecture One 2025
By Dr Christina MURO
5/29/2025 1
Lecture one: Overview of Big Data
Outline
◼ What is Big Data
◼ Characterristics of Big Data
◼ Distinction of Big Data with Other Disciplines
◼ Who is generating Big data
◼ What is Driving Big Data
◼ Value of Big Data
◼ Challenges of in handling Big Data
◼ Challenges of Conventional Systems
2
What is Big Data?
◼ There is not a consensus as to how to define Big Data:
◼ —“A collection of data sets so large and complex that it
becomes difficult to process using on-hand database
management tools or traditional data processing applications.” -
wiki
◼ —“Big data exceeds the reach of commonly used hardware
environments and software tools to capture, manage, and
process it with in a tolerable elapsed time for its user
population.” - Tera- data magazine article, 2011
◼ —“Big data refers to data sets whose size is beyond the ability
of typical database software tools to capture, store, manage and
analyze.” - The McKinsey Global Institute, 2011
5/29/2025 3
What is Big Data?
◼ Big Data refers to datasets grow so large and complex that it is
difficult to capture, store, manage, share, analyze and visualize
within current computational architecture.
◼ Big Data is a phenomenon that is characterized by the rapid
expansion of raw data.
◼ This data is being collected and generated so quickly that is
inundating (overwhelming) government and society.
◼ By Big Data, we mean:
❑ The data is too big to fit in main memory
❑ We need data structures on the data
❑ Words like “index” or “metadata” suggest that there are underlying
data structures
❑ These data structures are also too big to fit in main memory.
5/29/2025 4
Characteristics of Big Data (5Vs)
5/29/2025 5
Types of Big Data
5/29/2025 6
Harnessing Big Data
5/29/2025 7
Who is Generating Big Data
5/29/2025 8
The Model Has Changed
5/29/2025 9
What’s driving Big Data
5/29/2025 10
What’s driving Big Data
◼ Increased data volumes being captured
and store.
◼ Growing variation in types of datasets for
analysis.
◼ Rising demand for real-time integration of
analytical results
5/29/2025 11
Challenges of Conventional Systems
5/29/2025 12
Big Data Analytics
◼ Big data analytics is the process of examining large and
complex datasets (known as "big data") to uncover valuable
insights, patterns, trends, and correlations, ultimately helping
organizations make data-informed decisions.
◼ The main difference between big data analytics and traditional
data analytics is the type of data handled and the tools used to
analyze it.
◼ Traditional data analytics relies on statistical methods and tools
like structured query language (SQL) for querying databases.
◼ Big data analytics involves massive amounts of data in various
formats, including structured, semi-structured and unstructured
data. Big data analytics employs advanced techniques
like machine learning and data mining to extract information
from complex data sets
5/29/2025 13
What are the advantages of Big Data Analytics?
5/29/2025 14
Types of Big Data Analytics
Descriptive Analytics
❑ is the process of analyzing historical data to understand
5/29/2025 15
Types of Big Data Analytics
Diagnostic Analytics
❑ Investigates past data to understand the root causes of
events, behaviors, and outcomes, helping businesses
and organizations identify trends and make more
informed decisions.
5/29/2025 16
Types of Big Data Analytics
Predictive Analytics
❑ uses historical data and statistical algorithms to forecast
future events. It aims to predict what is likely to happen
based on past trends and patterns. Predictive analytics
answers the question, "What could happen?".
5/29/2025 17
Types of Big Data Analytics
Prescriptive Analytics
❑ goes beyond describing the best course of action based on
data recollecting past events or predicting future outcomes
to, using techniques like machine learning and simulation to
optimize decision-making
❑
5/29/2025 18
Types of Big Data Analytics
5/29/2025 19
The Lifecycle Phases of Big Data Analytics
5/29/2025 20
Uses and Examples of Big Data Analytics
◼ There are many different ways that Big Data analytics can be used in
order to improve businesses and organizations. Here are some
examples:
❑ Using analytics to understand customer behaviour in order to
optimize the customer experience.
❑ Predicting future trends to make better business decisions.
5/29/2025 21
Uses and Examples of Big Data Analytics
◼ Risk Management
❑ Use Case: Banco de Oro, a Phillippine banking company,
jet engines for airlines and armed forces across the globe,
uses Big Data analytics to analyze how efficient the engine
designs are and if there is any need for improvements.
5/29/2025 22
Uses and Examples of Big Data Analytics
5/29/2025 23
Uses and Examples of Big Data Analytics
• E-commerce - Predicting customer trends and optimizing
prices are a few of the ways e-commerce uses Big Data
analytics.
5/29/2025 24
Uses and Examples of Big Data Analytics
5/29/2025 25
26
27
End!!!!
5/29/2025 28