0% found this document useful (0 votes)
118 views2 pages

Big Data Syllabus

The document provides information about the 17IT4604A BIGDATA course. It includes 3-4 sentences summarizing key details: The course is a 3 credit Programme Elective-II course that focuses on analyzing Hadoop architecture, mastering Hadoop Distributed File System concepts, acquiring knowledge of the Map Reduce framework, and applying Pig and Hive for data processing. The course content covers introduction to big data and Hadoop, Hadoop Distributed File System, Map Reduce, Pig, and Hive. Upon completing the course, students will be able to analyze Hadoop architecture, master HDFS concepts, acquire knowledge of Map Reduce, and apply Pig and Hive for data processing.

Uploaded by

Avinash Bommina
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
118 views2 pages

Big Data Syllabus

The document provides information about the 17IT4604A BIGDATA course. It includes 3-4 sentences summarizing key details: The course is a 3 credit Programme Elective-II course that focuses on analyzing Hadoop architecture, mastering Hadoop Distributed File System concepts, acquiring knowledge of the Map Reduce framework, and applying Pig and Hive for data processing. The course content covers introduction to big data and Hadoop, Hadoop Distributed File System, Map Reduce, Pig, and Hive. Upon completing the course, students will be able to analyze Hadoop architecture, master HDFS concepts, acquire knowledge of Map Reduce, and apply Pig and Hive for data processing.

Uploaded by

Avinash Bommina
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 2

17IT4604A BIGDATA

Course
Programme Elective-II Credits: 3
Category:
Lecture-Tutorial-
CourseType: Theory 3-0-3
Practice:
Prerequisites: 17IT3502- Data Warehousing And Continuous Evaluation: 30
Mining Semester 70
EndEvaluation:
Total Marks: 100

Course Upon successful completion of the course, the student will be able to:
Outcomes CO1 Analyze Hadoop Architecture—Name Node, Big Data Lifecycle.
CO2 Master the concepts of Hadoop Distributed File System.
CO3 Acquire knowledge on Map Reduce Framework.
CO4 Apply Pig and Hive concepts for Data Processing.
Contribution PO PO PO PO PO PO PO PO PO PO PO P PS PS
of 1 2 3 4 5 6 7 8 9 10 11 O O1 O2
Course CO1 M L M 12
Outcomes CO2 L M
towards CO3 M M
achievement CO4

of Program M L H
Outcomes
(L-Low,
Course UNIT I
Content Introduction to Big Data:
Big Data-definition, Characteristics of Big Data (Volume, Variety,
Velocity), Data in the Warehouse and Data in Hadoop, Why is Big Data
Important? Patterns for Big Data Development.
Introduction to Hadoop:
Data, Data Storage and Analysis, Comparison with Other Systems:
RDBMS, Grid Computing, Volunteer Computing, A Brief History of
Hadoop, Apache Hadoop and the Hadoop Ecosystem, Hadoop Releases.
UNIT II
Hadoop Distributed File System: The Design of HDFS, HDFS Concepts,
Blocks, Namenodes and Datanodes, Basic Filesystem Operations, Hadoop
Filesystems, Interfaces, The Java Interface, Reading Data from a
HadoopURL, Data Flow, Anatomy of a FileRead, Anatomy of a FileWrite,
Coherency Model.
UNIT III
Map Reduce–A Weather Dataset, Data Format, Analyzing the Data with
Unix Tools, Analyzing the Data with Hadoop, Map and Reduce, Java Map
Reduce, Scaling Out, Hadoop Streaming, Hadoop Pipes.
Pig-Installation and Running of Pig, Execution Types, Running Pig
Programs, Pig Latin Editors, Comparison with databases, Pig Latin,
Functions, Data Processing Operators.
UNITIV:
Hive-Installing Hive, An Example, Running Hive, Comparison with
Traditional Databases, HiveQL, Tables, Querying Data.
Textbooks and Text Book(s):
Reference [1].Dirk deRoos, Chris Eaton, George Lapis, Paul Zikopoulos, Tom
books Deutsch , “Understanding Big DataAnalyticsforEnterprise ClassHadoop and
StreamingData”, 1st Edition, TMH,2012.
[2].TomWhite, Hadoop,“ The Definitive Guide”, 3rd Edition, O’Reilly
Publications, 2012

Reference Books:

[1].Michael Berthold, DavidJ. Hand, “Intelligent Data Analysis”, Springer,


2007.
[2].David Loshin, "Big Data Analytics: From Strategic Planning to
Enterprise Integration with Tools,Techniques,NoSQL,and Graph”,
MorganKaufmann Publishers, 2013
[3].Hadoop in Practice by AlexHolmes, MANNING
[4].Hadoop in Action by ChuckLam, MANNING
E-resources [1].Big Data Use cases for Beginners | Real Life Case Studies |
and Success Stories https://www.youtube.com/watch?v=HHR0-
Other digital iJp2sM
materials [2]. Alexey Grishchenko, Hadoop vs MPP,
https://0x0fff.com/hadoop-vs-mpp/
[3].Random notes on bigdata- SlideShare: Available
www.slideshare.net/yiranpang/random-notes-on-big-data-
26439474

You might also like