0% found this document useful (0 votes)
22 views12 pages

Sample Tlep

Uploaded by

sonika
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
22 views12 pages

Sample Tlep

Uploaded by

sonika
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 12

Course Code: 21VMT1S403

Course Title: Big Data Analytics


Credits: 4
Semester / Academic Year: 4th Semester/2024-2025
Semester Dates: 13th July 2024 to 20th December 2024
Faculty: Prof. Kavitha S N

Teaching-Learning-Evaluation-Plan (TLEP)
Course Outcomes
CO1: Comprehend the core concepts of Big Data and the Analytics process and techniques.

CO2: Analyze MapReduce Programming Model.

CO3: Identify the need for the distributed file system to analyze Big Data.

CO4: Explore the usage of various tools of Hadoop.

CO5: Implement the concepts of NoSQL and Machine learning in Big Data analytics.

Course Topics & Coverage


Unit 1: Big Data and Analytics
Introduction to Big Data, Types of Digital Data, Evolution of Big Data, 3Vs of Big Data
Challenges with Big Data, Applications of Big Data, Analytics Process, Model Analytics,
Analytical Model Requirements
Types of Data Sources, Sampling, Types of Data Elements, Visual Data Exploration, and
Exploratory Statistical Analysis
Learning Outcome: Comprehend the analytics process and learn to visualize
data exploration.

Unit 2: Predictive Analytics


Target Definition, Linear Regression, Logistic Regression, Decision Trees, Neural Networks,
Support Vector Machines
Learning Outcome: Determine the application of Predictive Analysis and Neural Networks.

Unit 3: Descriptive Analytics and Survival Analysis


Association Rules, Sequence Rules, Segmentation, Survival Analysis Measurements, Kaplan
Meier Analysis, Parametric Survival Analysis, Proportional Hazards Regression
Learning Outcome: Learn to apply descriptive and survival analysis.

Unit 4: Meet Hadoop


Data, Data Storage and Analysis, Comparison with Other Systems,
RDBMS, Grid Computing, A brief history of Hadoop, Apache Hadoop, and
Hadoop Ecosystem
Learning Outcome: Exploring the relevance and application of Hadoop.

Unit 5: MapReduce
Analyzing Data with Unix tools, Analysing Data with Hadoop, Scaling Out, Hadoop Streaming
Learning Outcome: Comprehend and Analyse MapReduce programming model.

Unit 6: The Hadoop Distributed File System


The Design of HDFS, HDFS Concepts, Blocks, Name nodes and Data Nodes, The Command
Line Interface, Hadoop File System Interface, The Java Interface, Data Flow
Learning Outcome: Apply a distributed file system to analyze big data.

Unit 7: YARN
Anatomy of a YARN Application Run, YARN compared to MapReduce 1, Scheduling in YARN,
Framework for Processing Data in Hadoop: YARN and MapReduce, Running Applications
before Hadoop 2, Tracking Job Tracker, Tracking Task Tracker, launching a MapReduce
application, scouting out the YARN architecture, and Launching a YARN-based application
Learning Outcome: Determine YARN Frameworks for processing Big Data.

Unit 8: Pig: Hadoop Programming


Admiring the Pig Architecture, Going with the Pig Latin Application Flow, Working
through the ABCs of Pig Latin, Evaluating Local and Distributed Modes of Running Pig scripts,
Checking out the Pig Script Interfaces, Scripting with Pig Latin.
Learning Outcome: Implement the concepts of Hadoop programming and Pig.

Unit 9: Hadoop and the Data Warehouse:


Comparing and Contrasting Hadoop with Relational Databases, NoSQL data stores, ACID
versus BASE data stores, Structured data storage and Processing in Hadoop
Learning Outcome: Compare and evaluate Hadoop with Relational Databases.

Unit 10: Native SQL Access to Hadoop Data:


SQL’s Importance for Hadoop, Looking at What SQL Access Actually Means, SQL Access and
Apache Hive, IBM Big SQL
Learning Outcome: Assess the importance of SQL in Big Data Analysis.

Unit 11: Applying Structure to Hadoop Data with Hive


Saying Hello to Hive, Getting Started with Apache Hive, Examining the Hive Clients, Working
with Hive Data Types, Creating and Managing Databases and Tables, Seeing How the Hive
Data Manipulation Language Works.
Learning Outcome: Infer and implement the various features of Hive.

Unit 12: Deploying Hadoop


Working with Hadoop Cluster Components, Hadoop Cluster Configurations, Alternate
Deployment Form Factors, Virtualized servers, and Cloud deployments
Learning Outcome: Deploying Hadoop cluster components and summarize cloud
deployments.
Unit 13: NoSQL Databases
MongoDB, CouchDB, Cassandra, Redis, Big Table, HBase, Hyper Table
Learning Outcome: Determine the working of NoSQL databases.

Unit 14: Servers and Processing


EC2, Google App Engine, Elastic Bean Stack, Processing- R, Yahoo pipes, Elastic Search,
Big Sheets
Learning Outcome: Explore various cloud servers and processing.

Unit 15: NLP and Machine Learning


Natural Language Toolkit, Open NLP, Boiler pipe, Machine Learning – WEKA, Mahout scikits
Learning Outcome: Assess the relevance of NLP and Machine Learning in terms of Big Data
Analysis.
Weekly TLEP
The TLEP for online courses will be articulated as a weekly plan instead of the topic-wise plan. This
facilitates the specification of the learning goals and tasks for the learner on a weekly timeline while
mapping the various learning elements/tasks as belonging to one of the Four Quadrants of Online
Learning.

Quadrant 1: eTutorial

Quadrant 2: eContent

Quadrant 3: Discussions & Collaborative Learning

Quadrant 4: eAssessment

Important Note on Following the Plan: Please go through the tasks indicated in the four quadrants,
preferably, in the order of the numbers stated against each task.

Theory Application Practice

Week 1 Big Data and Analytics

Quadrant 1 6. Attend the live session #1 on Introduction to Big Data.


eTutorial 7. Attempt solving the questions for Practice

Quadrant 2 3. Before the live session, watch the eLearning content on “Introduction to
Big Data”.
eContent
4. Read the eLM on “Unit 1: Introduction to Big Data”.
9. Watch this lecture on “Introduction to Big Data”
from IIT Kanpur - NPTEL by Prof. Rajiv Mishra.
https://www.youtube.com/watch?v=rvJgArru8dI&t=1702s

Quadrant 3 10. Participate in collaborative learning by discussing the questions for


Practice.
Discussions

Quadrant 4 1. Take a pre-assessment on “Introduction to Big Data”.


eAssessmen 2. Follow the additional learning plan to improve your understanding on
t Introduction to Big Data, based on the report of the pre-assessment.
5. Take the formative assessment for “ Introduction to Big Data”.
8. After the live session, repeat the formative assessment for
“Introduction to Big Data” for self-assessment.

Week 2 Predictive Analytics


Quadrant 1 1. Revise “Introduction to Big Data” recording of the live Session.
eTutorial 5. Attend the live session #2 on “Predictive Analytics”.

Quadrant 2 2. Watch the eLearning content on “Predictive Analytics”.


eContent 3. Read the eLM on “Predictive Analytics”.
8. Watch this lecture on “Big Data Predictive Analytics”
from IIT Kanpur - NPTEL by Prof. Rajiv Mishra.
https://www.youtube.com/watch?
v=zHENiOM6aAs&list=PLFW6lRTa1g813IyYHLRP_bWJEKQDeEcSP&index=2
9

Quadrant 3 9. Participate in collaborative learning by discussing the questions for


Practice.
Discussions

Quadrant 4 4. Take the formative assessment for “Predictive Analytics”


eAssessmen 6. Repeat the formative assessment for “Predictive Analytics” for self-
t assessment.
7. Attempt solving the questions for Practice on “Predictive Analytics”.

Week 3 Descriptive Analytics and Survival Analysis

Quadrant 1 1. Revise “Predictive Analytics” recording of the live Session.


eTutorial 5. Attend the live session #3 on “Descriptive Analytics and Survival
Analysis”

Quadrant 2 2. Watch the eLearning content on “Descriptive Analytics and Survival


Analysis”.
eContent
3. Read the eLM on “Descriptive Analytics and Survival Analysis”.
8. Read this lecture on https://www.intechopen.com/chapters/64244

Quadrant 3 9. Participate in collaborative learning by discussing the questions for


Practice.
Discussions

Quadrant 4 4. Take the formative assessment for “Descriptive Analytics and Survival
Analysis”.
eAssessmen
t 6. Repeat the formative assessment for “Descriptive Analytics and Survival
Analysis” for self-assessment.
7. Attempt solving the questions for Practice on “Descriptive Analytics and
Survival Analysis”

Week 4 Meet Hadoop, MapReduce

Quadrant 1 1. Revise “Descriptive Analytics and Survival Analysis” recording of the live
Session.
eTutorial
5. Attend the live session #4 on “Hadoop” and “MapReduce”.

Quadrant 2 2. Watch the eLearning content on “Hadoop” and “MapReduce”.


eContent 3. Read the eLM on “Unit 4: Hadoop” and “Unit 5:MapReduce”.
8. Read this lecture on https://en.wikipedia.org/wiki/Apache_Hadoop and
on https://www.databricks.com/glossary/mapreduce

Quadrant 3 9. Participate in collaborative learning by discussing the questions for


Practice
Discussions

Quadrant 4 4. Take the formative assessment for “Hadoop” and “MapReduce”


eAssessment 6. After the live session, repeat the formative assessment for “Hadoop”
and “MapReduce” for self-assessment.
7.Attempt solving the questions for Practice on “Hadoop” and
“MapReduce”.
Week 5 The Hadoop Distributed File System

Quadrant 1 1. Revise “Meet Hadoop and MapReduce” recording of the live session.
eTutorial 5. Attend the live session #5 on “The Hadoop Distributed File System”.

Quadrant 2 2. Watch the eLearning content on “The Hadoop Distributed File System”.
eContent 3. Read the eLM on “Unit 6: The Hadoop Distributed File System”.
8. Read this lecture on https://www.databricks.com/glossary/hadoop-
distributed-file-system-hdfs

Quadrant 3 9. Participate in collaborative learning by discussing the questions for


Practice.
Discussions

Quadrant 4 4. Take the formative assessment for “The Hadoop Distributed File
System”.
eAssessmen
t 6. After the live session, repeat the formative assessment for “The Hadoop
Distributed File System” for self-assessment.
7. Attempt solving the questions for Practice on “The Hadoop Distributed
File System”.

Week 6 YARN

Quadrant 1 1. Revise “The Hadoop Distributed File System” recording of the live
Session.
eTutorial
5. Attend the live session #6 on “YARN”.

Quadrant 2 2. Watch the eLearning content on “LYARN”.


eContent 3. Read the eLM on “Unit 7: YARN”.
8. Read this lecture on https://techmonitor.ai/what-is/what-is-yarn

Quadrant 3 9. Participate in collaborative learning by discussing the questions for


Practice.
Discussions

Quadrant 4 4. Take the formative assessment for “YARN”


eAssessmen 6. After the live session, repeat the formative assessment for “YARN” for
t self-assessment.
7. Attempt solving the questions for Practice on “YARN”

Week 7 Pig: Hadoop Programming, Hadoop and the Data


Warehouse

Quadrant 1 1. Revise “YARN” recording of the live Session.


eTutorial 5. Attend live session #7 on “Pig: Hadoop Programming” and “Hadoop and
the Data Warehouse”.

Quadrant 2 2. Watch the eLearning content on “Pig: Hadoop Programming” and”.


eContent 3. Read the eLM on “Unit 8: Pig: Hadoop Programming” and “Unit 9:
Hadoop and the Data Warehouse”.
8. Read this lecture on https://pig.apache.org/ and on
https://www.qubole.com/blog/hadoop-and-data-warehouse

Quadrant 3 9. Participate in collaborative learning by discussing the questions for


Practice.
Discussions

Quadrant 4 4. Take the formative assessment for “Pig: Hadoop Programming”


eAssessmen 6. After the live session, repeat the formative assessment for “Pig: Hadoop
t Programming” for self-assessment.
7. Attempt solving the questions for Practice on “Pig: Hadoop
Programming” and “Hadoop and the Data Warehouse”.

Week 8 Native SQL Access to Hadoop Data

Quadrant 1 1. Revise “Hadoop and the Data Warehouse” and “Hadoop and the Data
Warehouse” recording of the live session.
eTutorial
5. Attend live session #8 on “Native SQL Access to Hadoop Data”.

Quadrant 2 2. Watch the eLearning content on “Native SQL Access to Hadoop Data. 3.
Read the eLM on “Unit 10: Native SQL Access to Hadoop Data”.
eContent
8. Read this lecture on https://jethro.io/sql-on-hadoop

Quadrant 3 9. Participate in collaborative learning by discussing the questions for


Practice.
Discussions

Quadrant 4 4. Take the formative assessment for “Native SQL Access to Hadoop Data”.
eAssessmen 6. After the live session, repeat the formative assessment for “Native SQL
t
Access to Hadoop Data” for self-assessment.
7. Attempt solving the questions for Practice on “Native SQL Access to
Hadoop Data”

Week 9 Applying Structure to Hadoop Data with Hive

Quadrant 1 1. Revise “Native SQL Access to Hadoop Data” recording of the live
Session.
eTutorial
5. Attend the live session #9 on “Applying Structure to Hadoop Data
with Hive”.

Quadrant 2 2. Watch the eLearning content on “Applying Structure to Hadoop Data


with Hive”.
eContent
3. Read the eLM on “Unit 11: Applying Structure to Hadoop Data with
Hive”.
8. Read this lecture on https://hive.apache.org/

Quadrant 3 9. Participate in collaborative learning by discussing the questions for


Practice.
Discussions

Quadrant 4 4. Take the formative assessment for “Applying Structure to Hadoop


Data with Hive”.
eAssessment
6. After the live session, repeat the formative assessment for “Applying
Structure to Hadoop Data with Hive” for self-assessment.
7. Attempt solving the questions for Practice on “Applying Structure to
Hadoop Data with Hive”.

Week 10 Deploying Hadoop

Quadrant 1 1. Revise the “Applying Structure to Hadoop Data with Hive” recording of
the live Session.
eTutorial
5. Attend live session #10 on “Deploying Hadoop”.

Quadrant 2 2. Watch the eLearning content on “Deploying Hadoop”.


eContent 3. Read the eLM on “Unit 12: Deploying Hadoop”.
8. Read this lecture on
https://hadoop.apache.org/docs/r1.2.1/deployment_layout.html

Quadrant 3 9. Participate in collaborative learning by discussing the questions for


Practice.
Discussions

Quadrant 4 4. Take the formative assessment for “Deploying Hadoop”


eAssessmen 6. After the live session, repeat the formative assessment for “Deploying
t Hadoop” for self-assessment.
7. Attempt solving the questions for Practice on “Deploying Hadoop”.
10. Attempt Continuous Internal Assessment 3.

Week 11 NoSQL Databases, Servers and Processing

Quadrant 1 1. Revise the “Deploying Hadoop” recording of the live Session.


eTutorial 5. Attend live session #11 on “NoSQL Databases” and “Servers and
Processing”

Quadrant 2 2. Watch the eLearning content on “NoSQL Databases” and “Servers and
Processing”.
eContent
3. Read the eLM on “Unit 13: NoSQL Databases” and “Unit 14: Servers and
Processing”
8. Read this lecture on
https://www.mongodb.com/resources/basics/databases/nosql-explained
And on https://www.broadberry.com/markets/big-data-server-storage-
solutions

Quadrant 3 9. Participate in collaborative learning by discussing the questions for


Practice.
Discussions

Quadrant 4 4. Take the formative assessment for “NoSQL Databases” and .


eAssessmen 6. After the live session, repeat the formative assessment for “NoSQL
t Databases” and “Servers and Processing” for self-assessment.
7. Attempt solving the questions for Practice on “NoSQL Databases” and
“Servers and Processing”.

Week 12 NLP and Machine Learning

Quadrant 1 1. Revise “NoSQL Databases” and “Servers and Processing” recording of


eTutorial the live Session.
5. Attend the live session #12 on “NLP and Machine Learning”.

Quadrant 2 2. Watch the eLearning content on “NLP and Machine Learning”.


eContent 3. Read the eLM on “Unit 15: NLP and Machine Learning ”.
8. Read this lecture on https://www.lexalytics.com/blog/machine-
learning-natural-language-processing/

Quadrant 3 9. Participate in collaborative learning by discussing the questions for


Practice.
Discussions

Quadrant 4 4. Take the formative assessment for “NLP and Machine Learning”.
eAssessmen 6. After the live session, repeat the formative assessment for “NLP and
t Machine Learning” for self-assessment.
7. Attempt solving the questions for Practice on “NLP and Machine
Learning”.

Week 13 Self-Study for the Final Exam


Quadrant 1 1. Revise the eLMs and eLearning materials.
eContent
Quadrant 2 4. Live session, if required, to address specific queries of learners
eTutorial
Quadrant 3 3. Take a mock exam of the final examination
eAssessment
Quadrant 4 4. Participate in collaborative learning on preparing for the summative
Discussions assessment (final exam)

Week 14 Self-Study for the Final Exam


Quadrant 1 1. Revise the eLMs and eLearning materials.
eContent
Quadrant 2 4. Live session, if required, to address specific queries of learners
eTutorial
Quadrant 3 3. Take a mock exam of the final examination
eAssessment
Quadrant 4 4. Participate in collaborative learning on preparing for the summative
Discussions assessment (final exam)

Week 15-16 <<6th December 2024>>

Evaluation Plan
SNO Assessment Item Date Weightage Remarks
1 Continuous Assessment 1 15%$ Computer-Based
Assignment
$
2 Continuous Assessment 2 15% Computer-Based
Assignment
3 Continuous Assessment 3 15%$ Computer-Based
Assignment
4 Final Examination 70% Computer-Based,
Proctored
$
The best scores in 2 out of the 3 assessments for a total of 30% will be taken as part of the
final assessment for grading.

You might also like