0% found this document useful (0 votes)
10 views25 pages

BDA Lec1

The document outlines a course on Big Data Analytics taught by Dr. Nesma Mahmoud, detailing course materials, announcements, and a schedule of topics. It defines Big Data, its characteristics (5Vs), and distinguishes between data analytics, data analysis, and data science. The course aims to equip students with skills in Hadoop and Spark, which are highly sought after in the job market.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
10 views25 pages

BDA Lec1

The document outlines a course on Big Data Analytics taught by Dr. Nesma Mahmoud, detailing course materials, announcements, and a schedule of topics. It defines Big Data, its characteristics (5Vs), and distinguishes between data analytics, data analysis, and data science. The course aims to equip students with skills in Hadoop and Spark, which are highly sought after in the job market.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 25

4th grade

Big Data
Analytics
Dr. Nesma Mahmoud
More Info:
https://nesmaamahmoud.blogspot.com/p/about.html
Email: [email protected]
Course Materials & Announcements
● All course material (lecture notes “slides", assignments, any supplemental notes or
documentation), will be made available (posted)online on weeklybasis, on the webpage:

○ https://nesmaamahmoud.blogspot.com/p/big-data-analytics-2024.html

● All course announcements will be made available (posted)online on the webpage :

○ https://www.facebook.com/groups/859162975610627/
Lecture 1: Course
Info & Big Data
Review
What will we learn in this lecture?
What is Big Data?
01.

Big Data General Architecture


02.

Big data Analytics vs. Others


03.
01. What is Big Data?
Data in Everywhere
Big Data Definition
● No single definition; here is from Wikipedia:

● Big data is the term for a collection of data sets so large and complex that it becomes difficult to
process using on-hand database management tools or traditional data processing applications.

● The challenges include capture, curation(‫)العناية‬, storage, search, sharing, transfer, analysis, and
visualization.
Characteristics of Big Data (5V’s Model)

The definition of big data is data


that contains greater variety,
arriving in increasing volumes and
with more velocity. This is also
known as the 3 Vs.
Characteristics of Big Data (5V’s Model)
● The 5Vs of
Big Data are
Volume(size),
Velocity
(Frequency),
Variety
(Types),
Veracity
(Accuracy),
Value
(Business).
Types of Big Data
02. Big Data General
Architecture
Big Data Architecture

Fig. Components of Big Data Architecture (Source: Software Architecture Academy)


Big data Analytics
03. vs. Others
Big Data Analytics Real-world Example (Netflix)
Big Data Analytics Real-world Example (Netflix)
What is Data ANALYTICS?
Data analytics is the process of examining data sets in order to find trends and draw conclusions about
the information they contain.

For example, the data from consumers of an e-commerce store might indicate the products in which
the customers are interested. This conclusion from the customer data might help the organization to
increase the stock of that product or make an important business decision.

https://medium.com/analytics-vidhya/data-analytics-101-basics-of-data-
analytics-for-beginners-3b8b9ca14185
What is Big Data ANALYTICS?
Big data analytics refers to the systematic processing and analysis of large amounts of data and
complex data sets, known as big data, to extract valuable insights.

Big data analytics allows for the uncovering of trends, patterns and correlations in large amounts of
raw data to help analysts make data-informed decisions.

https://www.ibm.com/topics/big-data-analytics
DATA ANALYTICS VS. DATA ANALYSIS
Data Analysis
When we talk about Data Analysis, we call it a “detailed examination of
data” (which must already exist). Since the data already exists, the data
must pertain to something that happened in the past. As such, data
analysis answers the question, “What happened?”

Data Analytics
On the other hand, Data Analytics is defined as the “systematic
computational analysis of data”. Hence, Data Analytics is further
concerned about conducting logical, systematic, and deductive reasoning
to provide insights for how to act in the future.

More Info: https://adwiteeya.medium.com/data-analysis-vs-data-analytics-a08c0fc4603c


DATA ANALYTICS VS. DATA SCIENCE
“Data science is the discipline of making data useful.”
A data scientist generates questions, whereas a data analyst answers pre-existing queries.
Big DATA ANALYTICS VS. DATA ANALYTICS VS. DATA SCIENCE
Why taking this course?
• Learning Hadoop eco-system is rare in our region.
• You will be among “the few”
• Learning a different way of programming
(distributed)
• Requires different mentality
• Having Hadoop and Spark in your CV is a BIG plus
• Dealing with current real-life data with ease.
• The most required and Top jobs
DATA Jobs

https://medium.com/@lunadoan/data-job-market-2024-insights-you-need-to-boost-your-career-d05c7e18a5c1
Course Schedule
NOTE: This schedule is not final and may change over the course of the semester

Main Topics
● Big Data Review
● Introduction to Big Data Analytics
● Big Data Analytics Lifecycle
● MapReduce for Big Data Analytics
● Introduction to Spark
● Streaming Big Data Analytics
● Data Mining/Machine Learning for Big Data Analytics
● Recommendation Systems
Thanks!
Do you have any questions?

CREDITS: This presentation template was created by Slidesgo, and includes icons by
Flaticon, and infographics & images by Freepik

You might also like