0% found this document useful (0 votes)
30 views7 pages

BIG Data Analytics 21CSH-471: Computer Science & Engineering

Uploaded by

ballurvsh
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
30 views7 pages

BIG Data Analytics 21CSH-471: Computer Science & Engineering

Uploaded by

ballurvsh
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 7

Computer Science & Engineering


CHANDIGARH UNIVERSITY, MOHALI

BIG Data Analytics


21CSH-471

BY : Urvashi

Assistant Professor (Chandigarh


University)
Contents to be covered in UNIT
3
Unit-3 Data Science in Big Data Contact Hours:15
Chapter-1
The Iterative Introduction to Data Science Projects: Stages and Lifecycle; Iterative process in Data Science: Problem
Nature of Definition, Data collection and exploration, Model development and evaluation; Refinement and
Data Science deployment; Importance of Iteration: Continuous improvement and error correction; Tools supporting
Projects Iteration: Notebooks, Version Control and CI/CD.

Introduction to Data Science Notebooks: Characteristics – Interactive, reproducible and modular


Chapter – 2 workflow, Key benefits – Visualization, documentation and collaboration;
Notebooks in Programming Languages for Data Science: Python – Libraries like pandas, NumPy and Matplotlib, R –
Data Science Strengths in statistical analysis and visualization; Mechanisms and Tolls in Notebooks: Code cells,
markdown, widgets, and extensions, Integration with Git and other data tools.

Chapter – 3 Major Data Science Notebooks: Jupyter Notebook, Google Colab and Zeppelin, Comparing features:
Notebooks Offline vs. cloud, extensions and performance;
and Data Getting started with Jupyter Notebook: Installation, environment setup, and basic usage, Working with
Science tools Python and R in Jupyter; Introduction to Tableau: Key features and use-cases, Data connection,
in Big Data visualization building and dashboard creation; Collaboration and Presentation tools for Data Insights.
Course Outcomes

CO1 Understand the Fundamentals of Big Data.

CO2 Master Big Data Architecture and Tools

CO3 Explore the Hadoop Ecosystem and Data Processing Models

CO4 Develop Data Science Skills and Tools

CO5 Implement Real-Time Data Analytics and Visualization

3
Questions?
• What is the primary business or research question we are trying to answer,
and how will success be measured?

• What data sources are available, and what are the key characteristics,
patterns, or anomalies in the collected data?

• Which algorithms or techniques are best suited for solving the problem,
and how are hyperparameters tuned during the development process?

• How do the evaluation metrics (e.g., accuracy, precision, recall, RMSE)


indicate the model's performance, and is it generalizable to unseen data?

4
Reference Books
TEXT BOOKS

1. Mohammed Guller, Big Data Analytics with Spark, Apress,2015


2. Tom Mitchell, “Machine Learning”, McGraw Hill, 3rdEdition,1997
3. Michael Minelli, Michehe Chambers, “Big Data, Big Analytics: Emerging Business
Intelligence and Analytic Trends for Today’s Business”, 1stEdition, Ambiga Dhiraj, Wiely CIO
Series, 2013.
4. Arvind Sathi, “Big Data Analytics: Disruptive Technologies for Changing the Game”,1st
Edition, IBM Corporation, 2012.

REFERENCE BOOKS
5. Chris Eaton, Dirk deroos et al., “Understanding Big data”, McGraw Hill, 2012.
6. Vignesh Prajapati, “Big Data Analytics with R and Hadoop”, Packet Publishing 2013.
7. JyLiebowitz, “Big Data and Business Analytics”, CRC press, 2013.
For more insight
Web sources 
1. https://www.alliant.edu/blog/4-top-
online-resources-data-analytics?
utm_source=chatgpt.com
2. https://www.alliant.edu/blog/4-top-
online-resources-data-analytics?
utm_source=chatgpt.com
3. https://www.coursera.org/articles/
big-data-technologies?
utm_source=chatgpt.com
4. https://careerfoundry.com/en/ Big Data Big Big Data and
Analytics Analytics
blog/data-analytics/where-to-find- Wiley
free-datasets/?
utm_source=chatgpt.com
THANK YOU

For queries
Email: [email protected]

You might also like