0% found this document useful (0 votes)
38 views

CSE704 Data Analytics Syllabus Theory

This document outlines a course on data analytics. The course aims to teach fundamental concepts of big data and analytics, tools for working with big data, stream computing, and research integrating large data amounts. It is divided into 4 modules covering introduction to big data, clustering and classification algorithms, association and recommendation systems, and NoSQL data management and visualization. Students will learn to work with big data tools, analyze data using clustering and classification, apply mining algorithms and recommendation systems, perform analytics on data streams, and manage NoSQL databases. Assessment includes attendance, class tests, seminars/vivas/quizzes, assignments, and an end semester exam.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
38 views

CSE704 Data Analytics Syllabus Theory

This document outlines a course on data analytics. The course aims to teach fundamental concepts of big data and analytics, tools for working with big data, stream computing, and research integrating large data amounts. It is divided into 4 modules covering introduction to big data, clustering and classification algorithms, association and recommendation systems, and NoSQL data management and visualization. Students will learn to work with big data tools, analyze data using clustering and classification, apply mining algorithms and recommendation systems, perform analytics on data streams, and manage NoSQL databases. Assessment includes attendance, class tests, seminars/vivas/quizzes, assignments, and an end semester exam.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 2

B.Tech.

(CSE) 2020-24 (Based on AICTE)


DATA ANALYTICS
Course Code: CSE 704 Credit Units: 03
Total Hours: 30

Course objectives:
 To know the fundamental concepts of big data and analytics.
 To explore tools and practices for working with big data
 To learn about stream computing.
 To know about the research that requires the integration of large amounts of data.

Course Contents:

Module I: Introduction to Big Data: (8 Hours)


Evolution of Big data – Best Practices for Big data Analytics – Big data characteristics – Validating – The
Promotion of the Value of Big Data – Big Data Use Cases- Characteristics of Big Data Applications –
Perception and Quantification of Value -Understanding Big Data Storage – A General Overview of High-
Performance Architecture – HDFS – MapReduce and YARN – Map Reduce Programming Model

Module II: Clustering and Classification: (6 Hours)


Analytical Theory and Methods: Overview of Clustering – K-means – Use Cases – Overview of the Method –
Determining the Number of Clusters – Diagnostics – Reasons to Choose and Cautions .- Classification:
Decision Trees – Overview of a Decision Tree – The General Algorithm – Decision Tree Algorithms –
Evaluating a Decision Tree

Module III: Association and Recommendation System: (8 Hours)


Analytical Theory and Methods: Association Rules – Overview – Apriori Algorithm – Evaluation of Candidate
Rules – Applications of Association Rules – Finding Association& finding similarity Introduction to Streams
Concepts – Stream Data Model and Architecture – Stream Computing, Sampling Data in a Stream – Filtering
Streams – Counting Distinct Elements in a Stream – Estimating moments ,Case Studies – Real Time Sentiment
Analysis. Using Graph Analytics for Big Data: Graph Analytics

Module IV: NoSQL Data Management for Big Data and Visualization: (8 Hours)
NoSQL Databases: Schema-less Models‖: Increasing Flexibility for Data Manipulation-Key Value Stores-
Document Stores – Tabular Stores – Object Data Stores – Graph Databases Hive – Sharding –- Hbase –
Analyzing big data with twitter – Big data for E-Commerce Big data for blogs – Review of Basic Data Analytic
Methods using R.

Course Outcomes:
Upon completion of the course, the students will be able to:
 Work with big data tools and its analysis techniques
 Analyze data by utilizing clustering and classification algorithms
 Learn and apply different mining algorithms and recommendation systems for large volumes of data
 Perform analytics on data streams
 Learn NoSQL databases and management.

Examination Scheme:

Components A CT S/V/Q/HA ESE


Weightage (%) 5 15 10 70

A: Attendance, CT: Class Test, S/V/Q/HA: Seminar/Viva/Quiz/ Home Assignment, ESE: End Semester
Examination;

143
B.Tech. (CSE) 2020-24 (Based on AICTE)
Text & References:
Text:
 Anand Rajaraman and Jeffrey David Ullman, “Mining of Massive Datasets”, Cambridge University
Press, 2012.
 David Loshin, “Big Data Analytics: From Strategic Planning to Enterprise Integration with Tools,
Techniques, NoSQL, and Graph”, Morgan Kaufmann/El sevier Publishers, 2013.
References:
 EMC Education Services, “Data Science and Big Data Analytics: Discovering, Analyzing, Visualizing
and Presenting Data”, Wiley publishers, 2015.
 Bart Baesens, “Analytics in a Big Data World: The Essential Guide to Data Science and its
Applications”, Wiley Publishers, 2015.
 Dietmar Jannach and Markus Zanker, “Recommender Systems: An Introduction”, Cambridge
University Press, 2010.
 Kim H. Pries and Robert Dunnigan, “Big Data Analytics: A Practical Guide for Managers ” CRC
Press, 2015.
 Jimmy Lin and Chris Dyer, “Data-Intensive Text Processing with MapReduce”, Synthesis Lectures on
Human Language Technologies, Vol. 3, No. 1, Pages 1-177, Morgan Claypool publishers, 2010.

144

You might also like