Big Data Technologies: Course Code Level Program

This document provides details about a course on Big Data Technologies including: 1) The course code, level, department, duration, examination scheme and proposed instructors. 2) An overview of the course description which introduces the challenges of large data sets and technologies to address storage, analysis and search. 3) The objective is to introduce students to current big data scenarios and provide knowledge of key technologies to solve big data problems.

Uploaded by

Subodh dhungel

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

97 views3 pages

Big Data Technologies: Course Code Level Program

Uploaded by

Subodh dhungel

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Big Data Technologies

Course Code Level B.E.

Lecture 3 Program Computer Engineering/Software
Engineering/Computer Science and
Information Technology
Tutorial 1 Year IV
Practical Course Proposed by Sumit Shrestha, Kushum Sharma

Examination
Scheme
Internal Final Examination Remarks
Duration
20 80 3 Hrs Internal Assessment may include minor
tests,
assignments and small project works

Course Description: - The growth of information systems has given rise to large
amount of data which do not qualify as traditional definition of data. This scenario
has given us new possibilities but at same time pose serious challenges. Such
challenges lies in effective storage, analysis and search of such large set of data.
Fortunately, a number of technologies have been developed that answer such
challenges. This course introduces this scenario along with technologies and how
they answer these challenges.

Objective of the Course:- To introduce student to current scenarios of big data

and provide various facets of big data. It also provides them with technologies
playing key role in it and equips them with necessary knowledge to use them for
solving various big data problems in different domains.

Course Contents
1 Introduction to Big Data 8 hrs
1.1 Big Data Overview
1.2 Background of Data Analytics
1.3 Role of Distributed System in Big Data
1.4 Role of Data Scientist
1.5 Current Trend in Big Data Analytics

2 Google File System 7 hrs

2.1 Architecture
2.2 Availability
2.3 Fault tolerance
2.4 Optimization for large scale data

3 Map-Reduce Framework 10 hrs

3.1 Basics of functional programming
3.1.1 Fundamentals of functional programming
3.1.2 Real world problems modeling in functional style
3.2 Map reduce fundamentals
3.3 Data flow (Architecture)
3.4 Real world problems
3.5 Scalability goal
3.6 Fault tolerance
3.7 Optimization and data locality
3.8 Parallel Efficiency of Map-Reduce

4 NoSQL 6 Hrs
4.1 Structured and Unstructured Data
4.2 Taxonomy of NoSQL Implementation
4.3 Discussion of basic architecture of Hbase, Cassandra and MongoDb

5 Searching and Indexing Big Data 7 Hrs

5.1 Full text Indexing and Searching
5.2 Indexing with Lucene
5.3 Distributed Searching with elasticsearch

6 Case Study: Hadoop 5 Hrs

6.1 Introduction to Hadoop Environment
6.2 Data Flow
6.3 Hadoop I/O
6.4 Query languages for Hadoop
6.5 Hadoop and Amazon Cloud

Practical
Student will get opportunity to work in big data technologies using various
dummy as well as real world problems that will cover all the aspects discussed in
course. It will help them gain practical insights in knowing about problems faced
and how to tackle them using knowledge of tools learned in course.
1. HDFS: Setup a hdfs in a single node to multi node cluster, perform basic
file system operation on it using commands provided, monitor cluster
performance
2. Map-Reduce: Write various MR programs dealing with different aspects of it
as studied in course
3. Hbase: Setup of Hbase in single node and distributed mode, write program
to write into hbase and query it
4. Elastic Search: Setup elastic search in single mode and distributed mode,
Define template, Write data in it and finally query it
5. Final Assignment: A final assignment covering all aspect studied in order to
demonstrate problem solving capability of students in big data scenario.

References
1. Jeffrey Dean, Sanjay Ghemawat, MapReduce:Simplified Data Processing on
Large Clusters
2. Sanjay Ghemawat, Howard Gobioff, and Shun-Tak Leung, The Google File
System
3. Fay Chang, Jeffrey Dean, Sanjay Ghemawat, Wilson C. Hsieh, Deborah
A. Wallach, Mike Burrows, Tushar Chandra, Andrew Fikes, and Robert E.
Gruber, Bigtable: A Distributed Storage System for Structured Data
4. http://hadoop.apache.org/
5. http://hbase.apache.org/
6. http://www.elasticsearch.org/guide/
7. Tom White, Hadoop: The Definitive Guide
8. Lars George, Hbase: The Definitive Guide
9. Jason Rutherglen, Ryan Tabora, Jack Krupansky, Lucene and Solr: The
Definitive Guide

Digital Notes of Big Data Analytics Dated 5.1.2024
No ratings yet
Digital Notes of Big Data Analytics Dated 5.1.2024
175 pages
Big Data SV Publication
No ratings yet
Big Data SV Publication
142 pages
Mastering Data Structures and Algorithms with Python: Unlock the Secrets of Expert-Level Skills
From Everand
Mastering Data Structures and Algorithms with Python: Unlock the Secrets of Expert-Level Skills
Larry Jones
No ratings yet
Bda Sem 7 Book
No ratings yet
Bda Sem 7 Book
188 pages
Bda U1
No ratings yet
Bda U1
80 pages
BDA Techmax (Searchable)
No ratings yet
BDA Techmax (Searchable)
150 pages
Free Oracle 1Z0-071 Certification Sample Questions and Answers - DBExam
0% (2)
Free Oracle 1Z0-071 Certification Sample Questions and Answers - DBExam
4 pages
Big Data Analytics (R18a0529)
No ratings yet
Big Data Analytics (R18a0529)
134 pages
MS Access 2010 Tutorial PDF
100% (2)
MS Access 2010 Tutorial PDF
141 pages
Data Science and Big Data Analytics - Unit - 1
No ratings yet
Data Science and Big Data Analytics - Unit - 1
47 pages
Sap Hana Tutorial
93% (14)
Sap Hana Tutorial
160 pages
Sybca Bigdata
No ratings yet
Sybca Bigdata
97 pages
Big Data Analytics Digital Notes
No ratings yet
Big Data Analytics Digital Notes
119 pages
Introduction of Subject
No ratings yet
Introduction of Subject
28 pages
Data Dictionary
No ratings yet
Data Dictionary
24 pages
Data Base Management Systems Laboratory: Department of Computer Science Engineering
No ratings yet
Data Base Management Systems Laboratory: Department of Computer Science Engineering
74 pages
Big Daa R18 Manual
No ratings yet
Big Daa R18 Manual
84 pages
Syllabus
No ratings yet
Syllabus
7 pages
Oracle ASM Stuff
100% (1)
Oracle ASM Stuff
6 pages
CS8091 BDA Unit1
No ratings yet
CS8091 BDA Unit1
63 pages
COMP9313: Big Data Management
No ratings yet
COMP9313: Big Data Management
79 pages
Ashish Presentation Stage1 Modify LR
No ratings yet
Ashish Presentation Stage1 Modify LR
24 pages
Zero Lecture: Big Data Analytics Lab BCA04206 From: Megha Garg
No ratings yet
Zero Lecture: Big Data Analytics Lab BCA04206 From: Megha Garg
19 pages
L8 Big Data Management en
No ratings yet
L8 Big Data Management en
58 pages
Unit 1
No ratings yet
Unit 1
19 pages
IV Yr II Sem Lesson Plans
No ratings yet
IV Yr II Sem Lesson Plans
19 pages
Big Data Analytics - Sem 7 CVMU
No ratings yet
Big Data Analytics - Sem 7 CVMU
4 pages
Tuning PostgreSQL With Pgbench
No ratings yet
Tuning PostgreSQL With Pgbench
11 pages
Ais Chapter 09
No ratings yet
Ais Chapter 09
14 pages
Seminar Report 5th Sem
No ratings yet
Seminar Report 5th Sem
7 pages
Big Data Syllabus
No ratings yet
Big Data Syllabus
2 pages
CS8091 Bigdata QB 2022-2023 Final
No ratings yet
CS8091 Bigdata QB 2022-2023 Final
6 pages
Big Data Processing: Jiaul Paik
No ratings yet
Big Data Processing: Jiaul Paik
47 pages
BDA - Unit-1
No ratings yet
BDA - Unit-1
24 pages
Siddharth Big Data Report 1000016431
No ratings yet
Siddharth Big Data Report 1000016431
6 pages
Coursera Report Divyansh Sahai CSF443
No ratings yet
Coursera Report Divyansh Sahai CSF443
7 pages
Addendum (Accessing Big Data) Summer 2022 - Somayeh Alizadeh
No ratings yet
Addendum (Accessing Big Data) Summer 2022 - Somayeh Alizadeh
5 pages
Big Data
No ratings yet
Big Data
25 pages
SEM VII BDA Syllabus Theory
No ratings yet
SEM VII BDA Syllabus Theory
4 pages
CC ZG522 Course Handout
No ratings yet
CC ZG522 Course Handout
6 pages
Big Data
No ratings yet
Big Data
41 pages
BD Course Handout
No ratings yet
BD Course Handout
5 pages
Coursera Report Ishaan Taneja 1000016551
No ratings yet
Coursera Report Ishaan Taneja 1000016551
7 pages
Prepared by Richa Btech (Cse) 6 Sem Dav University Jalandhar
No ratings yet
Prepared by Richa Btech (Cse) 6 Sem Dav University Jalandhar
30 pages
Topic 1 Big Data Technologies
No ratings yet
Topic 1 Big Data Technologies
5 pages
Gujarat Technological University: Sr. No. Content Total Hrs % Weightage 1 13
No ratings yet
Gujarat Technological University: Sr. No. Content Total Hrs % Weightage 1 13
3 pages
BDA Syllabus - Sem VII - Mumbai University
No ratings yet
BDA Syllabus - Sem VII - Mumbai University
3 pages
Course Outline of CSE 761 Big Data Analytics
No ratings yet
Course Outline of CSE 761 Big Data Analytics
3 pages
43 - InfyTQ Interview Experience Batch
No ratings yet
43 - InfyTQ Interview Experience Batch
4 pages
Deep Learning with Fast.ai: Definitive Reference for Developers and Engineers
From Everand
Deep Learning with Fast.ai: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
IOT Analytics - AI361
No ratings yet
IOT Analytics - AI361
3 pages
IGNOU MCA Data Science and Big Data Previous Years Unsolved Papers MCS 226
From Everand
IGNOU MCA Data Science and Big Data Previous Years Unsolved Papers MCS 226
Manish Soni
No ratings yet
Syllabus
No ratings yet
Syllabus
3 pages
Big Data and Analytics Syllabus 2021
No ratings yet
Big Data and Analytics Syllabus 2021
3 pages
4.7.1 Bda-Mba
No ratings yet
4.7.1 Bda-Mba
2 pages
Big Data Technologies Course Outline
No ratings yet
Big Data Technologies Course Outline
2 pages
PROG
No ratings yet
PROG
11 pages
Gujarat Technological University: Prerequisite: Rationale
No ratings yet
Gujarat Technological University: Prerequisite: Rationale
4 pages
r18 - Big Data Analytics - Cse (DS)
0% (1)
r18 - Big Data Analytics - Cse (DS)
1 page
Cap456-Introduction To Big Data
No ratings yet
Cap456-Introduction To Big Data
1 page
B.Tech. CS - CE and CSE Syllabus 3rd Year 2024-25
No ratings yet
B.Tech. CS - CE and CSE Syllabus 3rd Year 2024-25
2 pages
Bigdata
No ratings yet
Bigdata
2 pages
Comprehensive Guide to Glue for Scientific Data Exploration: Definitive Reference for Developers and Engineers
From Everand
Comprehensive Guide to Glue for Scientific Data Exploration: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Big Data Analytics
No ratings yet
Big Data Analytics
3 pages
Create Procedure AS Select From Where AND: 'London' 'WA1 1DP'
No ratings yet
Create Procedure AS Select From Where AND: 'London' 'WA1 1DP'
3 pages
Big Data Analytics Comp Syllabus Sem7
No ratings yet
Big Data Analytics Comp Syllabus Sem7
4 pages
Big Data Analytics With Lab
No ratings yet
Big Data Analytics With Lab
3 pages
Training For Bigdata and Hadoop: #I Background and Introduction
No ratings yet
Training For Bigdata and Hadoop: #I Background and Introduction
9 pages
Big Data Analytics Course Outline (Fall 2020) : Dr. Tariq Mahmood 830 Am - 11 Am (Monday) Scope
No ratings yet
Big Data Analytics Course Outline (Fall 2020) : Dr. Tariq Mahmood 830 Am - 11 Am (Monday) Scope
3 pages
Big Data Analytics Syllabus
No ratings yet
Big Data Analytics Syllabus
2 pages
3sem DBMS Manual
No ratings yet
3sem DBMS Manual
41 pages
Few-Shot Machine Learning: Doing More with Less Data
From Everand
Few-Shot Machine Learning: Doing More with Less Data
Robert Johnson
No ratings yet
OAS: Cheat Sheet: File / Directories
No ratings yet
OAS: Cheat Sheet: File / Directories
8 pages
SQL - 03
100% (1)
SQL - 03
29 pages
Open Roles-GN India
No ratings yet
Open Roles-GN India
20 pages
Final Exam Semester 2 - Part I
0% (1)
Final Exam Semester 2 - Part I
19 pages
Data Block Based On Procedures
No ratings yet
Data Block Based On Procedures
5 pages
Assignment 1: A) Create The Tables With The Appropriate Integrity Constraints
No ratings yet
Assignment 1: A) Create The Tables With The Appropriate Integrity Constraints
16 pages
Maintenance of Electronic Records
No ratings yet
Maintenance of Electronic Records
4 pages
Performance Tunning
No ratings yet
Performance Tunning
7 pages
Evaluation Measures For Text Summarization
No ratings yet
Evaluation Measures For Text Summarization
26 pages
Unit 1 - Watermark
No ratings yet
Unit 1 - Watermark
50 pages
Akhil Chelikani - LinkedIn
No ratings yet
Akhil Chelikani - LinkedIn
9 pages
Database Fundamentals: Robert J. Robbins Johns Hopkins University
No ratings yet
Database Fundamentals: Robert J. Robbins Johns Hopkins University
31 pages
SAS Big Data Analytics Expanded
No ratings yet
SAS Big Data Analytics Expanded
4 pages
Answer Key For SQL Questions
No ratings yet
Answer Key For SQL Questions
2 pages
Praktikum Sistem Basis Data: Nama: Wanson Bernando Silalahi NPM: 217510021
No ratings yet
Praktikum Sistem Basis Data: Nama: Wanson Bernando Silalahi NPM: 217510021
7 pages
Test1 1617
No ratings yet
Test1 1617
4 pages
SQL 02 MySQL Design Handout
No ratings yet
SQL 02 MySQL Design Handout
3 pages
Vijaya Bharathi
No ratings yet
Vijaya Bharathi
2 pages

Big Data Technologies: Course Code Level Program

Uploaded by

Big Data Technologies: Course Code Level Program

Uploaded by

Big Data Technologies

Course Code Level B.E.

Objective of the Course:- To introduce student to current scenarios of big data

2 Google File System 7 hrs

3 Map-Reduce Framework 10 hrs

5 Searching and Indexing Big Data 7 Hrs

6 Case Study: Hadoop 5 Hrs

You might also like