Big Data Syllabus

The course on Big Data Technologies aims to equip students with knowledge and skills to address challenges in storing, analyzing, and searching large datasets. It covers topics such as the Google File System, Map-Reduce Framework, NoSQL databases, and practical applications using Hadoop and Elasticsearch. Students will engage in hands-on projects to apply their learning to real-world big data problems.

Uploaded by

sharproentgen3

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views

Big Data Syllabus

Uploaded by

sharproentgen3

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

BIG DATA TECHNOLOGIES

CT 765 07
Course Objectives: -
The growth of information systems has given rise to large amount of data which do not qualify as
traditional definition of data. This scenario has given us new possibilities but at same time pose serious
challenges. Such challenges lie in effective storage, analysis and search of such large set of data.
Fortunately, a number of technologies have been developed that answer such challenges. This course
introduces this scenario along with technologies and how they answer these challenges.
In this context, the specific objective of the course is to introduce student to current scenarios of big data
and provide various facets of big data. It also provides them opportunity to be familiar with the
technologies playing key role in it and equips them with necessary knowledge to use them for solving
various big data problems in different domains.

1. Introduction to Big Data [7 hours]

1. Big Data Overview
2. Background of Data Analytics
3. Role of Distributed System in Big Data
4. Role of Data Scientist
5. Current Trend in Big Data Analytics

2. Google File System [7 hours]

1. Architecture
2. Availability
3. Fault tolerance
4. Optimization for large scale data

3. Map-Reduce Framework [10 hours]

1. Basics of functional programming
2. Fundamentals of functional programming
3. Real world problems modeling in functional style
4. Map reduce fundamentals
5. Data flow (Architecture)
6. Real world problems
7. Scalability goal
8. Fault tolerance
9. Optimization and data locality
10. Parallel Efficiency of Map-Reduce

4. NoSQL [6 hours]
1. Structured and Unstructured Data
2. Taxonomy of NoSQL Implementation
3. Discussion of basic architecture of Hbase, Cassandra and MongoDb

5. Searching and Indexing Big Data [7 hours]

1. Full text Indexing and Searching
2. Indexing with Lucene
3. Distributed Searching with elasticsearch
6. Case Study: Hadoop [8 hours]
1. Introduction to Hadoop Environment
2. Data Flow
3. Hadoop I/O
4. Query languages for Hadoop
5. Hadoop and Amazon Cloud

Practical
Students will get opportunity to work in big data technologies using various dummy as well as real world
problems that will cover all the aspects discussed in course. It will help them gain practical insights in
knowing about problems faced and how to tackle them using knowledge of tools learned in course.
1. HDFS: Setup a hdfs in a single node to multi node cluster, perform basic file system operation on
it using commands provided, monitor cluster performance
2. Map-Reduce: Write various MR programs dealing with different aspects of it as studied in course
3. Hbase: Setup of Hbase in single node and distributed mode, write program to write into hbase
and query it
4. Elastic Search: Setup elastic search in single mode and distributed mode, Define template, Write
data in it and finally query it
5. Final Assignment: A final assignment covering all aspect studied in order to demonstrate problem
solving capability of students in big data scenario.

References
1. Jeffrey Dean, Sanjay Ghemawat MapReduce:Simplified Data Processing on Large Clusters
2. Sanjay Ghemawat, Howard Gobioff, and Shun-Tak Leung The Google File System
3. http://wiki.apache.org/hadoop/

Evaluation Scheme:
The questions will cover all the chapters of the syllabus. The evaluation scheme will be as indicated in the
table below:

Chapters Hours Marks Distribution*

1 7 12

2 7 13

3 10 18

4 6 11

5 7 13

6 8 13

Total 45 80
*There could be a minor deviation in Marks distribution

Data Science and Big Data Analytics
No ratings yet
Data Science and Big Data Analytics
2 pages
Big Data Technologies: Course Code Level Program
No ratings yet
Big Data Technologies: Course Code Level Program
3 pages
IOT Analytics - AI361
No ratings yet
IOT Analytics - AI361
3 pages
Big Data Analytics Syllabus
No ratings yet
Big Data Analytics Syllabus
2 pages
Siddharth_Big_Data_Report_1000016431
No ratings yet
Siddharth_Big_Data_Report_1000016431
6 pages
Big Data Analytics Comp Syllabus Sem7
No ratings yet
Big Data Analytics Comp Syllabus Sem7
4 pages
Coursera Report Divyansh Sahai CSF443
No ratings yet
Coursera Report Divyansh Sahai CSF443
7 pages
Big Data Technologies Course Outline
No ratings yet
Big Data Technologies Course Outline
2 pages
4.7.1 BDA-MBA
No ratings yet
4.7.1 BDA-MBA
2 pages
CC ZG522 Course Handout
No ratings yet
CC ZG522 Course Handout
6 pages
Coursera Report Ishaan Taneja 1000016551
No ratings yet
Coursera Report Ishaan Taneja 1000016551
7 pages
Syllabus of Course Big Data Integration
No ratings yet
Syllabus of Course Big Data Integration
9 pages
Syllabus
No ratings yet
Syllabus
3 pages
Course Outline PDF
No ratings yet
Course Outline PDF
4 pages
BCA-BIGDATA-FIFTH_SEM-APPROVED-SYLLABUS
No ratings yet
BCA-BIGDATA-FIFTH_SEM-APPROVED-SYLLABUS
23 pages
BDA Syllabus
No ratings yet
BDA Syllabus
4 pages
Seminar Report 5th Sem
No ratings yet
Seminar Report 5th Sem
7 pages
Bite411l Big-data-Analytics TH 1.0 73 Bite411l 67 Acp
No ratings yet
Bite411l Big-data-Analytics TH 1.0 73 Bite411l 67 Acp
2 pages
CSE704 Data Analytics Syllabus Theory
No ratings yet
CSE704 Data Analytics Syllabus Theory
2 pages
Old M.tech BDA Curriculum
No ratings yet
Old M.tech BDA Curriculum
32 pages
CS8091 Bigdata Analytics Lessonplan With Date
No ratings yet
CS8091 Bigdata Analytics Lessonplan With Date
11 pages
AIADS 7th Sem Syllabus Signed
No ratings yet
AIADS 7th Sem Syllabus Signed
19 pages
COMP9313: Big Data Management
No ratings yet
COMP9313: Big Data Management
79 pages
Big Data and Analytics Syllabus 2021
No ratings yet
Big Data and Analytics Syllabus 2021
3 pages
Big Data Analytics Course Outline (Fall 2020) : Dr. Tariq Mahmood 830 Am - 11 Am (Monday) Scope
No ratings yet
Big Data Analytics Course Outline (Fall 2020) : Dr. Tariq Mahmood 830 Am - 11 Am (Monday) Scope
3 pages
Syllabus: Chandigarh University, Gharuan
No ratings yet
Syllabus: Chandigarh University, Gharuan
2 pages
Big Data Technology E1UJ502B
No ratings yet
Big Data Technology E1UJ502B
11 pages
113 Ce 74
No ratings yet
113 Ce 74
4 pages
Appendix-74
No ratings yet
Appendix-74
42 pages
CS8091 Syllabus
No ratings yet
CS8091 Syllabus
2 pages
r18 - Big Data Analytics - Cse (DS)
0% (1)
r18 - Big Data Analytics - Cse (DS)
1 page
BD Course Handout
No ratings yet
BD Course Handout
5 pages
Big Data Syllabus
No ratings yet
Big Data Syllabus
6 pages
IV Yr II Sem Lesson Plans
No ratings yet
IV Yr II Sem Lesson Plans
19 pages
CO PO MAPPING BDA WITH JUSTIIFICATON
No ratings yet
CO PO MAPPING BDA WITH JUSTIIFICATON
4 pages
Big Data-2
No ratings yet
Big Data-2
3 pages
CS8091-Big-Data-Analytics
No ratings yet
CS8091-Big-Data-Analytics
28 pages
3174207
No ratings yet
3174207
4 pages
Big Data - 2 Marks-1
No ratings yet
Big Data - 2 Marks-1
1 page
Syllabus of Big Data Analysis - Proposed
No ratings yet
Syllabus of Big Data Analysis - Proposed
2 pages
CIT 4401Big Data Analytics Course Outline
No ratings yet
CIT 4401Big Data Analytics Course Outline
5 pages
SIC - HLDD - Big Data - v1.4.GUIA - TEMARIO
No ratings yet
SIC - HLDD - Big Data - v1.4.GUIA - TEMARIO
5 pages
bda
No ratings yet
bda
1 page
Course Pack
No ratings yet
Course Pack
1 page
Big Data Syllabus
No ratings yet
Big Data Syllabus
3 pages
Information Technology Engineering Syllabus Sem Viii Mumbai University
No ratings yet
Information Technology Engineering Syllabus Sem Viii Mumbai University
60 pages
Institute of Technology: Practical List
No ratings yet
Institute of Technology: Practical List
4 pages
Data Science and Big Data Analytics_ Unit_1
No ratings yet
Data Science and Big Data Analytics_ Unit_1
47 pages
Course File 15IT423E
No ratings yet
Course File 15IT423E
7 pages
CS8091 BDA Unit1
No ratings yet
CS8091 BDA Unit1
63 pages
2024 25 ODD CE449 BDA Syllabus
No ratings yet
2024 25 ODD CE449 BDA Syllabus
4 pages
BIG Data Syllabus
No ratings yet
BIG Data Syllabus
2 pages
Essentials of Big Data Griet
No ratings yet
Essentials of Big Data Griet
2 pages
Big Data Analytics-Digital Notes
No ratings yet
Big Data Analytics-Digital Notes
86 pages
CCS334 BDA Syllabus
No ratings yet
CCS334 BDA Syllabus
5 pages
Blda Pract 2024
No ratings yet
Blda Pract 2024
59 pages
00 Overview
No ratings yet
00 Overview
35 pages
Mastering Apache Iceberg: Managing Big Data in a Modern Data Lake
From Everand
Mastering Apache Iceberg: Managing Big Data in a Modern Data Lake
Robert Johnson
No ratings yet
NoSQL Essentials: Navigating the World of Non-Relational Databases
From Everand
NoSQL Essentials: Navigating the World of Non-Relational Databases
Kameron Hussain
No ratings yet
Mastering Database Design
From Everand
Mastering Database Design
Ted Noreux
No ratings yet
Cloud Security Practices
No ratings yet
Cloud Security Practices
39 pages
Infoblox Poster Ipv6 Best Practices
No ratings yet
Infoblox Poster Ipv6 Best Practices
1 page
Epic AZ Detector Calibration Procedure 9202-0183 Rev
No ratings yet
Epic AZ Detector Calibration Procedure 9202-0183 Rev
56 pages
Core Banking System Architecture Detailed Design
No ratings yet
Core Banking System Architecture Detailed Design
3 pages
BIM and GIS Data Integration Guidelines (June 2023 Edition)
No ratings yet
BIM and GIS Data Integration Guidelines (June 2023 Edition)
14 pages
how_to_configure_devicenet_anybus_nettool
No ratings yet
how_to_configure_devicenet_anybus_nettool
29 pages
Threats in the Digital World Cyber Attacks and Data Breaches
No ratings yet
Threats in the Digital World Cyber Attacks and Data Breaches
7 pages
Nexus Multicast
No ratings yet
Nexus Multicast
106 pages
adding-genai-to-your-fraud-prevention-strategy
No ratings yet
adding-genai-to-your-fraud-prevention-strategy
16 pages
cyberark-identity-lifecycle-management
No ratings yet
cyberark-identity-lifecycle-management
2 pages
Download ebooks file iOS Test Driven Development by Tutorials First Edition Learn Real World Test Driven Development Joshua Greene all chapters
100% (3)
Download ebooks file iOS Test Driven Development by Tutorials First Edition Learn Real World Test Driven Development Joshua Greene all chapters
55 pages
Patient Data Console
No ratings yet
Patient Data Console
47 pages
Report
No ratings yet
Report
13 pages
How To Create Gov Email
No ratings yet
How To Create Gov Email
8 pages
NC Studio Gen6A Manual V8 R6 PDF
No ratings yet
NC Studio Gen6A Manual V8 R6 PDF
186 pages
Quectel EG06 Series Hardware Design V1.3
No ratings yet
Quectel EG06 Series Hardware Design V1.3
99 pages
Geemarc V2T10 - FR-DE-UK-SP
No ratings yet
Geemarc V2T10 - FR-DE-UK-SP
56 pages
RS422 Serial Port Connector Pin Layout
No ratings yet
RS422 Serial Port Connector Pin Layout
2 pages
VIPUL GROVER Resume
No ratings yet
VIPUL GROVER Resume
1 page
CS3491AI & ML Lab Manual
No ratings yet
CS3491AI & ML Lab Manual
105 pages
07+eth Trunk+IStack+and+CSS
No ratings yet
07+eth Trunk+IStack+and+CSS
25 pages
College of Computer Studies: Software Project Management Plan
No ratings yet
College of Computer Studies: Software Project Management Plan
23 pages
Database Software Market White Paper
No ratings yet
Database Software Market White Paper
72 pages
Policy For Broadband at Residance of Rly Officers 2012
No ratings yet
Policy For Broadband at Residance of Rly Officers 2012
2 pages
Simplification of CAD Models by Automatic Recognition and Suppression of Blend Chains
No ratings yet
Simplification of CAD Models by Automatic Recognition and Suppression of Blend Chains
12 pages
PHP Web Development
No ratings yet
PHP Web Development
3 pages
Product Information X-Ray Suitcase Leonardo DR Mini II - Vet - EN
No ratings yet
Product Information X-Ray Suitcase Leonardo DR Mini II - Vet - EN
6 pages
Business Continuity Plan
No ratings yet
Business Continuity Plan
1 page
Python Notes
No ratings yet
Python Notes
2 pages
Kace Sma 12.0 Adminguide En-Us
No ratings yet
Kace Sma 12.0 Adminguide En-Us
963 pages

Big Data Syllabus

Uploaded by

Big Data Syllabus

Uploaded by

BIG DATA TECHNOLOGIES

1. Introduction to Big Data [7 hours]

2. Google File System [7 hours]

3. Map-Reduce Framework [10 hours]

5. Searching and Indexing Big Data [7 hours]

Chapters Hours Marks Distribution*

You might also like