Department of Computer Science and Engineering
COURSE DELIVERY PLAN
(This document is to be presented to the students during the first week of commencement of
classes and send a soft copy to every student of the class by the course instructor)
ACADEMIC YEAR: 2022 -2023 DEPARTMENT:CSE
PROGRAMME:B. E SEMESTER:VII
SUBJECT TITLE: Big Data Analytics SUBJECT CODE: 18CS72
TOTAL DURATION (HRS): 50 CREDITS: 4
PREREQUISITE: Data Mining and Data Warehousing, Machine Learning
INSTRUCTOR (S):Dr.S.Usha, Dr.D.Kirubha, Prof.Deepa Konnur
COURSEWARE HOME PAGE:
TEXTBOOKS
1. Raj Kamal and Preeti Saxena, “Big Data Analytics Introduction to Hadoop, Spark,
and Machine-Learning”, McGraw Hill Education, 2018 ISBN: 9789353164966,
9353164966
2. Douglas Eadline, "Hadoop 2 Quick-Start Guide: Learn the Essentials of Big Data
Computing in the Apache Hadoop 2 Ecosystem", 1stEdition, Pearson Education,
2016. ISBN-13: 978-9332570351
REFERENCE BOOKS
1. Tom White, “Hadoop: The Definitive Guide”, 4th Edition, O‟Reilly Media,
2015.ISBN-13: 978-9352130672
2. Boris Lublinsky, Kevin T Smith, Alexey Yakubovich, "Professional Hadoop
Solutions", 1stEdition, Wrox Press, 2014ISBN-13: 978-8126551071
3. Eric Sammer, "Hadoop Operations: A Guide for Developers and
Administrators",1stEdition, O'Reilly Media, 2012.ISBN-13: 978-9350239261
4. ArshdeepBahga, Vijay Madisetti, "Big Data Analytics: A Hands-On Approach", 1st
Edition, VPT Publications, 2018. ISBN-13: 978-0996025577
MOOC
1. https://www.edx.org/search?q=Big+data+analytics&tab=course
2. https://nptel.ac.in/courses/106104189
Journal
Paper Details
Reference
J1 https://journalofbigdata.springeropen.com/
J2 https://bdataanalytics.biomedcentral.com/
J3 https://www.springer.com/journal/42421
J4 https://www.techscience.com/journal/jbd
Web Resources (W1,W2,W3…………………..,)
Web Reference Website Link
W1 https://www.coursera.org/articles/big-data-analytics
https://www.techtarget.com/searchbusinessanalytics/definition/big-
W2
data-analytics
W3 https://www.ibm.com/in-en/analytics/big-data-analytics
PROGRAMME EDUCATIONAL OBJECTIVES (PEO) –
(to be achieved by the graduate after 4 years of graduation)
PEO No. Program Educational Objectives Statements
Graduates of the program will possess strong educational foundation in mathematics,
PEO 1 science, and computer science & engineering necessary for pursuing career and/ or
higher studies in computing as well as other allied professions.
Graduates with an interest in, and aptitude for lifelong learning will be engaged in
learning, understanding, and applying new ideas and technologies as the field evolves
PEO 2 to solve engineering problems, design appropriate computing systems that are
technically sound economically viable and socially acceptable and be responsible
engineering and computing professionals.
Graduates will be informed leaders, effective communicators who work efficiently
PEO 3
with diverse teams, promote and practice appropriate ethical moral and codes.
PROGRAMME OUTCOMES (PO COMMON TO ALL PROGRAMMES)
(To be achieved by the student after every semester/year/and at the time of graduation)
Apply knowledge of mathematics and science, with fundamentals of Computer Science &
PO1
Engineering to be able to solve complex engineering problems related to CSE.
Apply mathematical foundations, algorithmic principles, and computer Science theory in
PO2 the modelling and design of computer-based systems in a way that demonstrates
comprehension of tradeoffs involved in design choices.
Analyze a problem and identify and define the computing requirements appropriate to its
PO3
solution.
Design and development principles in the construction of software systems of varying
PO4
complexity.
Design, implement, and evaluate a software or a software/hardware system, component, or
process to meet desired needs within realistic constraints such as memory, runtime
PO5
efficiency, as well as appropriate constraints related to economic, environmental, social,
political, ethical, health and safety, manufacturability, and sustainability considerations.
Use the techniques, skills, and modern engineering tools necessary for practice as a CSE
PO6
professional.
Work effectively as an individual, and as a member or leader in diverse teams and in
PO7
multidisciplinary environment.
Demonstrate knowledge of contemporary issues and understand professional, ethical, legal,
PO8
security and social issues and responsibilities.
Analyze the local and global impact of computing on individuals, organizations, and
PO9
society.
Demonstrate knowledge and understanding of the engineering and management principles
PO10 including financial implications and apply these to his/her work, as a member and leader in
a team, and to manage project work as part of a multidisciplinary team.
PO11 Communicate effectively in both verbal and written forms.
Recognize the need for and be motivated to engage in life-long learning and continuing
PO12
professional development.
PROGRAM SPECIFIC OUTCOMES (PSO)
Understand, apply, and demonstrate advanced technical skills in problem solving and
PSO1 leadership, as well as an understanding of system integration and the practical technological
problems of end users.
An ability to design, implement, and evaluate a software or a software/hardware system,
PSO2 component, or process to meet desired needs within realistic constraints such as memory,
runtime efficiency, as well as other socio-economic constraints.
GRADUATE ATTRIBUTES (NBA):All graduates of any higher education programs are
expected to have identified technical/ functional, generic and managerial competencies. The
competencies that a graduate of a program should have are called Graduate Attributes. The
Attributes a graduating engineer should have are generally identified by the Accreditation agency
for Engineering and Technical Education, namely, National Board of Accreditation (NBA) in
India. The Graduate Attributes of Engineering Programs as identified by NBA are
1. Engineering knowledge: Apply the knowledge of mathematics, science, engineering
fundamentals, and an engineering specialization to the solution of complex engineering
problems.
2. Problem analysis: Identify, formulate, research literature, and analyze complex
engineering problems reaching substantiated conclusions using first principles of
mathematics, natural sciences, and engineering sciences.
3. Design/development of solutions: Design solutions for complex engineering
problems and design system components or processes that meet the specified needs with
appropriate consideration for the public health and safety, and the cultural, societal, and
environmental considerations.
4. Conduct investigations of complex problems: Use research-based knowledge and
research methods including design of experiments, analysis and interpretation of data,
and synthesis of the information to provide valid conclusions.
5. Modern tool usage: Create, select, and apply appropriate techniques, resources, and
modern engineering and IT tools including prediction and modeling to complex
engineering activities with an understanding of the limitations.
6. The engineer and society: Apply reasoning informed by the contextual knowledge to
assess societal, health, safety, legal and cultural issues and the consequent responsibilities
relevant to the professional engineering practice.
7. Environment and sustainability: Understand the impact of the professional
engineering solutions in societal and environmental contexts, and demonstrate the
knowledge of, and need for sustainable development.
8. Ethics: Apply ethical principles and commit to professional ethics and responsibilities
and norms of the engineering practice.
9. Individual and team work: Function effectively as an individual, and as a member or
leader in diverse teams, and in multidisciplinary settings.
10. Communication: Communicate effectively on complex engineering activities with
the engineering community and with society at large, such as, being able to comprehend
and write effective reports and design documentation, make effective presentations, and
give and receive clear instructions.
11. Project management and finance: Demonstrate knowledge and understanding of
the engineering and management principles and apply these to one’s own work, as a
member and leader in a team, to manage projects and in multidisciplinary environments.
12. Life-long learning: Recognize the need for, and have the preparation and ability to
engage in independent and life-long learning in the broadest context of technological
change.
The Bloom’s Taxonomy is to be followed in curriculum development, courseware
development, planning and delivery of contents, Assessment, Mapping, Data Analysis
and CQI (Continuous Quality Improvement)
Attainment
COURSE OUTCOMES (CO)
Threshold (%)
CO – 1 Understand fundamentals of Big Data analytics. 60
CO – 2 Understand Hadoop framework and Hadoop Distributed File system. 60
Illustrate the concepts of NoSQL using MongoDB and Cassandra for
CO – 3 60
Big Data.
Demonstrate the Map Reduce programming model to process the big
CO – 4 60
data along with Hadoop tools
Apply machine Learning algorithms for real world big data and
CO – 5 analyze web contents and Social Networks to provide analytics with 60
relevant visualization tools.
PLAN FOR MAPPING OF COs TO POs
COURSE PROGRAMME OUTCOMES (PO) PSO
S.
OUTCOMES
No PO PO PO PO PO PO PO PO PO PO PO PO PSO PSO
(CO)
1 2 3 4 5 6 7 8 9 10 11 12 1 2
1 CO-1 2 - - 1 - - - - - - - 1 - -
2 CO-2 2 - 1 1 - - - - - - - 1 - -
3 CO-3 2 - 2 1 - - - - - - - 1 1 -
4 CO-4 2 1 2 1 - - - - - - - 1 1 -
5 CO-5 2 2 2 1 - - - - - - - 1 1 -
Note: 1: Slight (Low) 2: Moderate (Medium) 3: Substantial (High)
Bridging the Curriculum Gap Big data analytics in the cloud
(Additional Topics beyond Predictive analytics
syllabus/Seminars/Assignments) Deep learning
https://journalofcloudcomputing.springeropen.com/articles/10.1186/s13677-
022-00301-w
Related Website URLs https://www.ibm.com/in-en/analytics/predictive-analytics
https://journalofbigdata.springeropen.com/articles/10.1186/s40537-014-0007-
7
https://nptel.ac.in/courses/106104189
Related Video Course Materials https://www.youtube.com/watch?v=OP8BsGnqi9c
(min. 3 no.s) https://www.youtube.com/watch?v=KcecJfxbd-4
COURSE DELIVERY PLAN
Instructional
Period Pertaining Topic Learning Outcome Cumulative
Topic / Session topic Methods /
# CO/CLOs (TLO) Hours
Activities
UNIT IIntroduction to Big Data Analytics (Text Book 1: Chapter 1: 1.2 -1.7)
To know about the basic Black Board,
1 Introduction to Big Data CO1 1
concept of Big Data Interaction
Black Board,
2 Scalability CO1 To be familiarize Scalability 2
Interaction
To know about the Parallel Black Board,
3 Parallel Processing CO1 3
Processing Interaction
To be familiarize with the
Designing Data Black Board,
4 CO1 concepts of Designing Data 4
Architecture Interaction
Architecture
To be familiarize with the Black Board,
5 Data Sources CO1 5
concepts of Data Sources Interaction
To know the basic concepts Black Board,
6 Quality CO1 6
of Quality Interaction
To be familiarize with Black Board,
7 Preprocessing and storing CO1 7
Preprocessing and storing Interaction
To be familiarize with Data Black Board,
8 Data storage and Analysis CO1 8
storage and Analysis Interaction
Big data Analytics To be familiarize with big Black Board,
9 CO1 9
applications data Analytics applications Interaction
To know the case studies Group
10 Case Studies CO1 10
about big data Activities
UNIT II Introduction to Hadoop, Hadoop Distributed File System Basics, Essential Hadoop Tools
(Text Book 1: Chapter 2 :2.1-2.6, Text Book 2: Chapter 3, Text Book 2: Chapter 7 (except walk throughs))
To know about the basic Black Board,
11 Introduction to Hadoop CO2 11
concept of Hadoop Interaction
To know about Hadoop and Black Board,
12 Hadoop and its Ecosystem CO2 12
its Ecosystem Interaction
Hadoop Distributed File To know about Hadoop Black Board,
13 CO2 13
System Distributed File System Interaction
To know about MapReduce
MapReduce Framework Black Board,
14 CO2 Framework and 14
and Programming Model Interaction
Programming Model
Black Board,
15 Hadoop Yarn CO2 To know about Hadoop Yarn 15
Interaction
To be familiarize with Black Board,
16 Hadoop Ecosystem Tools CO2 16
Hadoop Ecosystem Tools Interaction
To know about HDFS Black Board,
17 HDFS Design Features CO2 17
Design Features Interaction
To be familiarize with
Components, HDFS User Power Point
18 CO2 Components and HDFS User 18
Commands Presentation
Commands
Using Apache Pig, Hive, To be familiarize with using Video
19 CO2 19
Sqoop Apache Pig, Hive, Sqoop presentation
To be familiarize with Video
20 Flume, Oozie, HBase CO2 20
Flume, Oozie, HBase presentation
UNIT III NoSQL Big Data Management, MongoDB, and Cassandra (Text Book 1: Chapter 3: 3.1-3.7)
To know about the basic Black Board,
21 Introduction CO3 21
concept of NoSQL Interaction
Period Topic / Session topic Pertaining Topic Learning Outcome Instructional Cumulative
# CO/CLOs (TLO) Methods / Hours
Activities
To be familiarize with Black Board,
22 NoSQL Data Store CO3 22
NoSQL Data Store Interaction
To be familiarize with the
NoSQL Data Architecture Black Board,
23 CO3 NoSQL Data Architecture 23
Patterns Interaction
Patterns
NoSQL to Manage Big To be familiarize with Black Board,
24 CO3 24
Data NoSQL to Manage Big Data Interaction
NoSQL to Manage Big To execute the concepts of Black Board,
25 CO3 25
Data NoSQL to Manage Big Data Interaction
To execute the concepts of
Shared-Nothing
Shared - Nothing Black Board,
26 Architecture for Big Data CO3 26
Architecture for Big Data Interaction
Tasks
Tasks
To know the procedure of Power Point
27 MongoDBDatabases CO3 27
MongoDB Databases Presentation
To be familiarize with Video
28 MongoDBDatabases CO3 28
MongoDB Databases presentation
To know the procedure of Power Point
29 Cassandra Databases CO3 29
Cassandra Databases Presentation
To be familiarize with Video
30 Cassandra Databases CO3 30
Cassandra Databases presentation
UNIT IV MapReduce, Hive and Pig (Text Book 1: Chapter 4: 4.1-4.6)
To know about the basic Black Board,
31 Introduction CO4 31
concept of MapReduce Interaction
To know about MapReduce Black Board,
32 MapReduce Map Tasks CO4 32
Map Tasks Interaction
To be familiarize with Black Board,
33 MapReduce Map Tasks CO4 33
MapReduce Map Tasks Interaction
Reduce Tasks and To know about Reduce Tasks Black Board,
34 CO4 34
MapReduce Execution and MapReduce Execution Interaction
To be familiarize with
Reduce Tasks and Black Board,
35 CO4 Reduce Tasks and 35
MapReduce Execution Interaction
MapReduce Execution
Composing MapReduce To know about Composing
Black Board,
36 for Calculations and CO4 MapReduce for Calculations 36
Interaction
Algorithms and Algorithms
Composing MapReduce To be familiarize with
Power Point
37 for Calculations and CO4 Reduce Tasks and 37
Presentation
Algorithms MapReduce Execution
To know about the Hive Video
38 Hive CO4 38
presentation
To be familiarize with Power Point
39 HiveQL CO4 39
HiveQL Presentation
Video
40 Pig CO4 To know about the Pig 40
presentation
UNIT V Machine Learning Algorithms for Big Data Analytics, Text, Web Content, Link, and Social Network
Analytics
Text Book 1: Chapter 6: 6.1 to 6.5
Text Book 1: Chapter 9: 9.1 to 9.5
To know about the basic
Introduction, Estimating Black Board,
41 CO5 concept of Estimating the 41
the relationships Interaction
relationships
Instructional
Period Pertaining Topic Learning Outcome Cumulative
Topic / Session topic Methods /
# CO/CLOs (TLO) Hours
Activities
To know about Outliers,
Outliers, Variances,
Variances, Probability Black Board,
42 Probability Distributions, CO5 42
Distributions, and Interaction
and Correlations
Correlations
To be familiarize with Black Board,
43 Regression analysis CO5 43
Regression analysis Interaction
To know about finding
Finding Similar Items,
Similar Items, Similarity of Black Board,
44 Similarity of Sets and CO5 44
Sets and Collaborative Interaction
Collaborative Filtering
Filtering
To be familiarize with
Frequent Itemsets and Black Board,
45 CO5 Frequent Item sets and 45
Association Rule Mining Interaction
Association Rule Mining
Black Board,
46 Introduction, Text mining CO5 To know about Text mining 46
Interaction
To be familiarize with Web
Web Mining, Web Content Power Point
47 CO5 Mining, Web Content and 47
and Web Usage Analytics Presentation
Web Usage Analytics
Page Rank, Structure of To know about the Page
Video
48 Web and analyzing a Web CO5 Rank, Structure of Web and 48
presentation
Graph analyzing a Web Graph
To be familiarize with Social Power Point
49 Social Network as Graphs CO5 49
Network as Graphs Presentation
To know about the Social Video
50 Social Network Analytics CO5 50
Network Analytics presentation
METHODOLOGY TO MAP OBJECTIVE WITH OUTCOME
Course outcomes are achieved through
a. Suitable Analogies
b. Classroom teaching
c. Assignments
d. Tutorials
e. Internal Assessment test and Model Exam
f. Brain storming
g. Group discussion and role play
h. Seminars
FACULTY SIGNATURE HOD