Big Data Lecture
Big Data Lecture
20CS2005
The student will be able to
1. understand the importance and challenges of Big data
2. design applications using HADOOP and RHADOOP
3. identify the appropriate function of PIG data model to be used in development
4. model Big data applications schema and use HIVE QL
5. develop applications with Cassandra.
6. build applications with HDFS and MapReduce
Module 2: Data Analysis using R and Hadoop
Features of R language - HADOOP Features - HDFS and MapReduce architecture - R and Hadoop
Integrated Programming Environment (RHIPE) Introduction - Architecture of RHIPE - RHIPE function
reference - RHADOOP Introduction - Architecture of RHADOOP - RHADOOP function reference, SQL on
HADOOP.
Big Data
Hadoop and
Spark
History of
Spark
Given to the
Apache Software Exists as a next
Started at UC generation real-
Berkeley Foundation and
time and batch
AM PLab by the license was processing
Matei Zaharia changed to Apache framework
2.0
201
2009 3 Present
2010 2014
THANK YOU