0% found this document useful (0 votes)
247 views7 pages

Seminar Report 5th Sem

This document is a seminar report submitted by Mirtunjaya Goswami on the topic of "Big-Data Computing". It provides an overview of the 8 week MOOC course conducted under the guidance of Prof. Rajiv Misra. The course covered introductions to big data concepts, enabling technologies like Hadoop and Spark, storage platforms, streaming platforms, and applications of big data in machine learning, graph processing, and other domains. It aimed to provide understanding of big data problems, systems, and techniques used in today's technologies. The report acknowledges the support and guidance of the external supervisor and others involved in helping complete the course.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
247 views7 pages

Seminar Report 5th Sem

This document is a seminar report submitted by Mirtunjaya Goswami on the topic of "Big-Data Computing". It provides an overview of the 8 week MOOC course conducted under the guidance of Prof. Rajiv Misra. The course covered introductions to big data concepts, enabling technologies like Hadoop and Spark, storage platforms, streaming platforms, and applications of big data in machine learning, graph processing, and other domains. It aimed to provide understanding of big data problems, systems, and techniques used in today's technologies. The report acknowledges the support and guidance of the external supervisor and others involved in helping complete the course.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 7

MOOC Seminar Report

on

Big-Data Computing

(CSE V Semester MOOC Seminar) 2020-2021

Submitted to: Submitted by:

Faculty Name: Name: Mirtunjaya Goswami

Mr. Akash Chauhan Roll. No: 35

Course /Branch:
B.tech/C.S E (DS AI)

Section: K

DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING


GRAPHIC ERA HILL UNVERSITY ,DEHRADUN
CERTIFICATE

Certifies that Mr. Mirtunjaya Goswami (Roll No.- 1918471) has completed
MOOC Seminar on the topic “Big-Data Computing” from NPTEL under the
guidance of Prof. Rajiv Misra for fulfillment of CSE V Semester MOOC
Seminar (SCS-501) in Graphic Era Hill University, Dehradun. Students have
successfully Completed this Course as best of my knowledge.

Date: 24/10/21

Faculty Name Mr. Akash Chauhan

CC-CSE-k-V- Sem

CSE Department
GEHU, Dehradun
CERTIFICATE
ACKNOWLEDGMENT

I would like to thank particularly my External Supervisor Prof. Rajiv


Misra for his patience, support and encouragement throughout the completion
of this Course.
At last but not the least I greatly indebted to all other people who directly

or indirectly helped me during this course.

Mr. MIRTUNJAYA GOSWAMI


Roll No- 1918471 CSE-K- V-Sem
Session: 2020-2021 GEHU,
Dehradun
TABLE OF CONTENTS

COURSE LAYOUT
Week 1 : Introduction to Big Data
Week 2 : Introduction to Enabling Technologies
for Big Data
Week 3 : Introduction to Big Data Platforms
Week 4 : Introduction to Big Data Storage
Platforms for Large Scale Data Storage
Week 5 : Introduction to Big Data Streaming
Platforms for Fast Data
Week 6 : Introduction to Big Data Applications
(Machine Learning)
Week 7 : Introduction of Big data Machine
learning with Spark
Week 8 : Introduction to Big Data Applications
(Graph Processing)
Big Data Computing
In today's fast-paced digital world , the incredible amount of data
being generated every minute has grown tremendously from sensors
used to gather climate information, posts to social media sites, digital
pictures and videos, purchase transaction records, and GPS signals
from cell phone to name a few. This amount of large data with
different velocities and varieties is termed as big data and its
analytics enables professionals to convert extensive data through
statistical and quantitative analysis into powerful insights that can
drive efficient decisions. This course provides an in-depth
understanding of terminologies and the core concepts behind big
data problems, applications, systems and the techniques, that
underlie today's big data computing technologies. It provides an
introduction to some of the most common frameworks such as
Apache Spark, Hadoop, MapReduce, Large scale data storage
technologies such as in-memory key/value storage systems, NoSQL
distributed databases, Apache Cassandra, HBase and Big Data
Streaming Platforms such as Apache Spark Streaming, Apache
Kafka Streams that has made big data analysis easier and more
accessible. And while discussing the concepts and techniques, we
will also look at various applications of Big Data Analytics using
Machine Learning, Deep Learning, Graph Processing and many
others. The course is suitable for all UG/PG students and practicing
engineers/ scientists from the diverse fields and interested in learning
about the novel cutting edge techniques and applications of Big Data
Computing.

PREREQUISITES : Data Structure & Algorithms, Computer


Architecture, Operating System, Database Management Systems

INDUSTRY SUPPORT : Companies like Amazon, Microsoft,


Google, IBM, Facebook
REFERENCES
Text Book:

Bart Baesens, Analytics in a Big Data World: The Essential


Guide to Data Science and its Applications, Wiley, 2014

Reference Books:

1. Dirk Deroos et al., Hadoop for Dummies, Dreamtech


Press, 2014.
2. Chuck Lam, Hadoop in Action, December, 2010.
3. Leskovec, Rajaraman, Ullman, Mining of Massive
Datasets, Cambridge University Press.
4. I.H. Witten and E. Frank, Data Mining: Practical Machine
learning tools and techniques.
5. Erik Brynjolfsson et al., The Second Machine Age: Work,
Progress, and Prosperity in a Time of Brilliant
Technologies, W. W. Norton & Company, 2014.

You might also like