0% found this document useful (0 votes)
38 views5 pages

BD Course Handout

The document provides details about the Big Data course offered at Kalinga Institute of Industrial Technology, including the course code, title, credits, instructor, timings, objectives, outcomes, contents, textbooks, and lesson plan. The course aims to help students understand big data concepts and technologies like Hadoop, analyze large and streaming data, and apply techniques using tools such as MapReduce, Pig, and Hive. It covers topics ranging from big data overview and characteristics to frameworks, visualization, and applications. The lesson plan outlines 30 lectures across 6 units to teach concepts and skills related to big data analytics.

Uploaded by

honeymodder
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
38 views5 pages

BD Course Handout

The document provides details about the Big Data course offered at Kalinga Institute of Industrial Technology, including the course code, title, credits, instructor, timings, objectives, outcomes, contents, textbooks, and lesson plan. The course aims to help students understand big data concepts and technologies like Hadoop, analyze large and streaming data, and apply techniques using tools such as MapReduce, Pig, and Hive. It covers topics ranging from big data overview and characteristics to frameworks, visualization, and applications. The lesson plan outlines 30 lectures across 6 units to teach concepts and skills related to big data analytics.

Uploaded by

honeymodder
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 5

KALINGA INSTITUTE OF INDUSTRIAL TECHNOLOGY

Deemed to be University
BHUBANESWAR-751024

School of Computer Engineering


Autumn Semester 2023-24

Course Handout

1. Course code : CS 3032


2. Course Title : Big Data
3. LTP Structure :
L T P Total Credit
3 0 0 3 3
4. Course Faculty : Dr. Subhranshu Sekhar Tripathy
Contact Address and Time : Faculty Cabin No:- 401 Campus 14 , Block B.
Timings for Meeting:- 6:00-6:30 P.M
5. Timings :- 6:00-6:30 P.M.
6. Course offered to the School : Computer Engineering
7. Course Objective:

● To understand the concept and principles of big data.

● To explore the big data stacks and the technologies associated with it.

● To evaluate the different NoSQL databases and frameworks required to handle the big data.

● To formulate the concepts, principles and techniques focusing on the applications to industry
and real world experience.
● To contextually integrate and correlate large amounts of information to gain faster insights for
real time scenarios.
8. Course Outcome:
CO # Detail
CO1 Understand the concept of big data and its analytics in the real world
CO2 Analyse various big data technology foundations
CO3 Apply filtering technique to stream data
CO4 Apply Hadoop ecosystem paradigm using MapReduce, YARN, Pig, Hive, Scoop,
HBase to solve data intensive problems
CO5 Analyse big data framework like Hadoop and NoSQL to efficiently store and process
big data to generate analytics
CO6 Present appropriate solutions to big data analytics frameworks and visualization.
9. Course Contents
The course focuses on basic and essential topics in Big Data.
Unit # Unit Detailed Area
1 Overview of Importance of Data, Characteristics of Data, Analysis of
Big Data unstructured data, Introduction to Big Data, Challenges of
conventional systems, Data analytic, Evolution of analytic
scalability, Big Data Analytics, Key Big Data terminologies, Big
1
Data analytics lifecycle, Cloud Computing and Big Data.
2 Big Data Exploring the Big Data Stack, Data Sources Layer, Ingestion Layer,
Technology Storage Layer, Physical Infrastructure Layer, Platform Management
Foundations Layer, Security Layer, Monitoring Layer, Analytics Engine,
Visualization Layer, Big Data Applications, Virtualization.
3 Streaming Introduction to Streams Concepts – Stream data model and
architecture – Stream Computing, Sampling data in a stream –
Filtering streams, Counting distinct elements in a stream.
4 Hadoop Introduction to Hadoop, Hadoop Ecosystem, Hadoop Distributed
Ecosystem File System, MapReduce, YARN, Pig and PigLatin, Hive, Scoop,
HBase
5 Storing Data Data Models, RDBMS and Hadoop, Non-Relational Database,
in Big Data Introduction to NoSQL, Types of NoSQL, Polyglot Persistence,
Sharding
context.
6 Frameworks Distributed and Parallel Computing for Big Data, Big Data
And Visualizations – Visual data analysis techniques, interaction
Visualization techniques, applications

10. Text Book:


TB1. Big Data, Black Book, DT Editorial Services, Dreamtech Press, 2016
11. Reference Books:
RB1. Big Data and Analytics, Seema Acharya, Subhashini Chellappan, Infosys Limited,
Publication: Wiley India Private Limited,1st Edition 2015
RB2. Discovering, Analyzing, Visualizing and Presenting Data by EMC Education
Services (Editor), Wiley, 2014
RB3. Stephan Kudyba, Thomas H. Davenport, Big Data, Mining, and Analytics, Components of
Strategic Decision Making, CRC Press, Taylor & Francis Group. 2014
RB4. Norman Matloff , THE ART OF R PROGRAMMING, No Starch Press, Inc.2011
RB5. Big Data For Dummies, Judith Hurwitz et al. Wiley 2013.
RB6. Glenn J. Myatt, Making Sense of Data, John Wiley & Sons, 2007 Pete Warden,Big
Data Glossary, O’Reilly, 2011.
12. Pre-requisites:

● DBMS

13. Lesson Plan:


Lecture No. Unit Topics Lesson #
1-6 Overview of 1
● Importance of Data
Big Data
● Characteristics of Data, Analysis of Unstructured Data

● Combining Structured and Unstructured Sources


2
● Introduction to Big Data

● Challenges of conventional systems


3
● Data analytic

● Evolution of Analytic scalability


4
● Big Data Analytics

● Key Big Data terminologies


5
● Big Data analytics lifecycle

2
Lecture No. Unit Topics Lesson #
6
● Cloud Computing and Big Data

● Discussion
7-11 Big Data 7
● Exploring the Big Data Stack
Technology
Foundations ● Data Sources Layer

● Ingestion Layer
8
● Storage Layer

● Physical Infrastructure Layer

● Platform Management Layer


9
● Security Layer

● Monitoring Layer
10
● Analytics Engine

● Visualization Layer
11
● Big Data Applications, Virtualization.
12-14 Streaming 12
● Introduction to Streams Concepts

● Stream data model and architecture


13
● Stream Computing

● Sampling data in a stream


14
● Filtering streams

● Counting distinct elements in a stream.


15-22 Hadoop 15
● Introduction to Hadoop
Ecosystem
● Hadoop Ecosystem
16
● Hadoop Distributed File System

● MapReduce
17
● YARN
18
● Hive
19
● Pig and PigLatin
20
● HBase
21
● Scoop
22
● Discussion
23-30 Storing Data 23
● Data Models
in Big Data
context 24
● RDBMS and Hadoop
25
● Non-Relational Database

3
Lecture No. Unit Topics Lesson #
26
● Introduction to NoSQL
27
● Types of NoSQL
28
● Types of NoSQL cont...
29
● Polyglot Persistence
30
● Sharding

● Discussion
31-36 Framework 31
● Distributed and Parallel Computing for Big Data
&
visualization 32
● Big Data Visualizations – Visual data analysis
techniques
33
● Interaction techniques and applications
34
● Big Data Visualizations – Visual data analysis
techniques cont...
35
● Big Data Visualizations – Visual data analysis
techniques cont...
36
● Interaction techniques and applications

● Discussions

14. Assessment Components:


Sr # Assessment Time Weightage/ Course Lecture No. Mode
Component Marks
From To
1 Mid-Semester 1.5 Hrs 20 1 18 Closed Book
Examination
2 Activity based Through 30 1 36 Open Book,
Teaching and out Closed Book
15. Learning semester
3 End-Semester 3 Hrs 50 1 36 Closed Book
Examination
Assessment plan for activity based learning:

Considering the guidelines circulated and after discussing with the faculty members, following
activity based teaching and learning is proposed and Component wise distributions of the
activities are listed below.

4
1 2 3 4
5 6 7 8
9 10 11 12
13 14 15 16
17 18 19 20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38

Sl.No. Activity Date of Submission Marks

1 Assignment 10-08-2023 5

2 Assignment 24-08-2023 5

3 Class Test 06-09-2023 5

4 Group Activity 10-10-2023 5

5 Assignment 20-10-2023 5

6 Quiz 08-11-2023 5

You might also like