ETB 1 (Big Data)
ETB 1 (Big Data)
Learning Objectives
1. Understand Big Data Concepts: Gain a comprehensive understanding of the
fundamental principles of Big Data, including the 7 Vs and the significance of
Big Data in today’s digital landscape.
2. Explore Big Data Technologies: Familiarize yourself with essential Big Data
technologies and tools such as Hadoop, Spark, NoSQL databases, and data
warehouses used for data storage, processing, and analysis.
3. Analyze Data Lifecycle: Learn the complete data lifecycle, from data ingestion
and processing to storage and analysis, understanding how each phase
contributes to effective decision-making.
4. Apply Data Analytics Techniques: Understand various data analysis
techniques, including descriptive, diagnostic, predictive, and prescriptive
analytics, and how these techniques can be applied to extract actionable
insights.
5. Evaluate Ethical and Privacy Considerations: Discuss the ethical implications
of Big Data analysis, including issues of data privacy, security, and
governance, and how to ensure responsible data handling.
Agenda
► Introduction to Big data
► What is big data
► Characteristics of Big data
► Importance and Benefits
► Working of big data
► Big data challenges
► Business applications
INTRODUCTION
► Big Data refers to large and complex datasets that traditional data processing
applications cannot handle efficiently.
Introduction to Big data
What is Data?
► The quantities, characters, or symbols on which operations are
performed by a computer, which may be stored and
transmitted in the form of electrical signals and recorded on
magnetic, optical, or mechanical recording media.
► Technologies
▪ Data Storage
▪ Data Processing
▪ Data Ingestion
▪ Data Analysis and Machine Learning
▪ Data Visualization
► Frameworks and Ecosystems
▪ Apache Hadoop Ecosystem
▪ Apache Spark Ecosystem
▪ Lambda Architecture
BIG DATA TECHNOLOGIES
► Cloud-Based Big Data Technologies
▪ Amazon Web Services (AWS)
▪ Google Cloud Platform (GCP)
▪ Microsoft Azure