0% found this document useful (0 votes)
2 views16 pages

PPT 1.1.4

Download as pptx, pdf, or txt
Download as pptx, pdf, or txt
Download as pptx, pdf, or txt
You are on page 1/ 16

Computer Science & Engineering


CHANDIGARH UNIVERSITY, MOHALI

BIG Data Analytics


21CSH-471

BY : Urvashi

Assistant Professor (Chandigarh


University)
Contents to be covered in UNIT
1
UNIT-1 Overview of Computing Paradigm Contact Hours:15

Chapter-1 Introduction to Big Data – Definition and Characteristics; The 5 V’s of Big Data – Volume: Data at scale,
Understandin Velocity: Real-time data processing, Variety: Structured, semi-structured, unstructured data, Veracity:
g Big Data Uncertainty and trustworthiness in data, Value: Transforming data into insights; Challenges and
and the 5 V’s Opportunities in Big Data; Big Data Use Cases in Real-World Applications

Chapter – 2 Fundamentals of Big Data Architecture: Data ingestion, storage, processing and visualization layers
Big Data Streaming Data in Big Data: Tools such as Spark, Apache Kafka and Flink
Architecture Real-World Big Data Architecture: Lambda and Kappa Architectures, Hybrid Architecture for batch and
real-time processing

Chapter – 3 Introduction to the Hadoop Ecosystem; HDFS (Hadoop Distributed File System): Architecture and
The Hadoop Functionality; MapReduce Programming Model: Workflow and Applications; YARN (Yet Another
Ecosystem Resource Negotiator): Resource Management; Tools in the Ecosystem: Pig, HBase, Flume, and Oozie;
Data Processing with Hadoop: ETL, Analytics and Reporting.
Course Outcomes

CO1 Understand the Fundamentals of Big Data.

CO2 Master Big Data Architecture and Tools

CO3 Explore the Hadoop Ecosystem and Data Processing Models

CO4 Develop Data Science Skills and Tools

CO5 Implement Real-Time Data Analytics and Visualization

3
Challenges and Opportunities in
Big Data
Big Data, defined by the characteristics of volume,
velocity, variety, veracity, and value (the "5Vs"), is
transforming industries by enabling organizations to
extract meaningful insights from massive datasets.
However, its adoption presents both significant
challenges and immense opportunities. This report
explores these aspects in detail.
Challenges in Big Data:

1. Data Management and Storage


1. The rapid growth of data makes storage a critical issue. Traditional database systems
struggle to handle the volume and velocity of big data.
2. High storage costs and the need for efficient retrieval mechanisms are persistent hurdles.
2. Data Quality and Integration
1. Data comes from diverse sources and formats, leading to inconsistencies, errors, and
duplication.
2. Integrating structured and unstructured data into a coherent framework is complex and
time-intensive.
Challenges in Big Data:
3. Processing and Analysis
o Big data requires advanced computational capabilities for real-time or near-real-time
processing.
o Existing analytical tools may not scale effectively, leading to delays in deriving actionable
insights.
4. Privacy and Security
o Big data often contains sensitive information, raising concerns about data breaches and
misuse.
o Ensuring compliance with privacy regulations like GDPR and CCPA adds another layer of
complexity.
Challenges in Big Data:
5. Skill Shortages
o There is a scarcity of professionals skilled in big data technologies like
Hadoop, Spark, and machine learning algorithms.
o This skills gap increases the cost and slows down the adoption of big data
solutions.
6. Ethical Concerns
o The use of predictive analytics raises ethical questions about bias,
discrimination, and the unintended consequences of decision-making based
on data insights.

Opportunities in Big Data:
1. Enhanced Decision-Making
o Advanced analytics enables data-driven decision-making,
improving efficiency and effectiveness across industries like
healthcare, finance, and manufacturing.
2. Personalization and Customer Insights
o Businesses can leverage big data to understand consumer
behavior and preferences, enabling personalized marketing and
improved customer experience.
Opportunities in Big Data:
3. Innovation and Product Development
o Analyzing trends and consumer feedback fosters innovation,
guiding companies to develop products that meet market
demands.
4. Operational Efficiency
o Big data analytics optimizes processes, reduces waste, and
predicts maintenance needs, particularly in industries like logistics
and energy.
Opportunities in Big Data:
5. Improved Healthcare Outcomes
o Big data plays a critical role in personalized medicine, epidemic
tracking, and operational management in hospitals, leading to better
patient care.
6. Smart Cities and IoT
o Integrating big data with IoT devices supports the development of
smart cities, improving urban planning, traffic management, and
sustainability.
7. Risk Management
o Predictive analytics helps in identifying risks and fraudulent activities,
particularly in the financial sector.
Strategies to Address Challenges
1. Investing in Scalable Infrastructure
o Organizations should adopt cloud-based solutions for cost-
effective and scalable storage and processing.
2. Standardization and Governance
o Implementing robust data governance policies ensures data
quality and facilitates integration.
3. Upskilling the Workforce
o Continuous training programs for employees in big data tools and
techniques can bridge the skills gap.
Strategies to Address Challenges
4. Advanced Security Measures
oEncryption, access controls, and regular audits can
mitigate security risks.
5. Collaborative Ecosystems
oPartnerships between academia, industry, and
government can foster innovation and address
regulatory and ethical challenges.
Future Trends in Big Data

Emerging Trends:

• AI Integration: Enhanced analytics with AI.


• Edge Computing: Decentralized data processing.
• Data Democratization: Wider access across
organizations.
• Focus on Real-Time Analytics: Faster decision-making.

Visuals: Trend graphs or futuristic imagery.


13
Reference Books
TEXT BOOKS

1. Mohammed Guller, Big Data Analytics with Spark, Apress,2015


2. Tom Mitchell, “Machine Learning”, McGraw Hill, 3rdEdition,1997
3. Michael Minelli, Michehe Chambers, “Big Data, Big Analytics: Emerging Business
Intelligence and Analytic Trends for Today’s Business”, 1stEdition, Ambiga Dhiraj, Wiely
CIO Series, 2013.
4. Arvind Sathi, “Big Data Analytics: Disruptive Technologies for Changing the Game”,1st
Edition, IBM Corporation, 2012.

REFERENCE BOOKS
5. Chris Eaton, Dirk deroos et al., “Understanding Big data”, McGraw Hill, 2012.
6. Vignesh Prajapati, “Big Data Analytics with R and Hadoop”, Packet Publishing 2013.
7. JyLiebowitz, “Big Data and Business Analytics”, CRC press, 2013.
For more insight
Web sources 
1. https://www.alliant.edu/blog/4-top-
online-resources-data-analytics?
utm_source=chatgpt.com
2. https://www.alliant.edu/blog/4-top-
online-resources-data-analytics?
utm_source=chatgpt.com
3. https://www.coursera.org/articles/
big-data-technologies?
utm_source=chatgpt.com
4. https://careerfoundry.com/en/ Big Data Big Big Data and
Analytics Analytics
blog/data-analytics/where-to-find- Wiley
free-datasets/?
utm_source=chatgpt.com
THANK YOU

For queries
Email: [email protected]

You might also like