Big Data Syllabus
Big Data Syllabus
Course
Programme Elective-II Credits: 3
Category:
Lecture-Tutorial-
CourseType: Theory 3-0-3
Practice:
Prerequisites: 17IT3502- Data Warehousing And Continuous Evaluation: 30
Mining Semester 70
EndEvaluation:
Total Marks: 100
Course Upon successful completion of the course, the student will be able to:
Outcomes CO1 Analyze Hadoop Architecture—Name Node, Big Data Lifecycle.
CO2 Master the concepts of Hadoop Distributed File System.
CO3 Acquire knowledge on Map Reduce Framework.
CO4 Apply Pig and Hive concepts for Data Processing.
Contribution PO PO PO PO PO PO PO PO PO PO PO P PS PS
of 1 2 3 4 5 6 7 8 9 10 11 O O1 O2
Course CO1 M L M 12
Outcomes CO2 L M
towards CO3 M M
achievement CO4
of Program M L H
Outcomes
(L-Low,
Course UNIT I
Content Introduction to Big Data:
Big Data-definition, Characteristics of Big Data (Volume, Variety,
Velocity), Data in the Warehouse and Data in Hadoop, Why is Big Data
Important? Patterns for Big Data Development.
Introduction to Hadoop:
Data, Data Storage and Analysis, Comparison with Other Systems:
RDBMS, Grid Computing, Volunteer Computing, A Brief History of
Hadoop, Apache Hadoop and the Hadoop Ecosystem, Hadoop Releases.
UNIT II
Hadoop Distributed File System: The Design of HDFS, HDFS Concepts,
Blocks, Namenodes and Datanodes, Basic Filesystem Operations, Hadoop
Filesystems, Interfaces, The Java Interface, Reading Data from a
HadoopURL, Data Flow, Anatomy of a FileRead, Anatomy of a FileWrite,
Coherency Model.
UNIT III
Map Reduce–A Weather Dataset, Data Format, Analyzing the Data with
Unix Tools, Analyzing the Data with Hadoop, Map and Reduce, Java Map
Reduce, Scaling Out, Hadoop Streaming, Hadoop Pipes.
Pig-Installation and Running of Pig, Execution Types, Running Pig
Programs, Pig Latin Editors, Comparison with databases, Pig Latin,
Functions, Data Processing Operators.
UNITIV:
Hive-Installing Hive, An Example, Running Hive, Comparison with
Traditional Databases, HiveQL, Tables, Querying Data.
Textbooks and Text Book(s):
Reference [1].Dirk deRoos, Chris Eaton, George Lapis, Paul Zikopoulos, Tom
books Deutsch , “Understanding Big DataAnalyticsforEnterprise ClassHadoop and
StreamingData”, 1st Edition, TMH,2012.
[2].TomWhite, Hadoop,“ The Definitive Guide”, 3rd Edition, O’Reilly
Publications, 2012
Reference Books: