0% found this document useful (0 votes)
14 views3 pages

CourseCurriculum (8) - 1

The document outlines the curriculum for the undergraduate course CSIT363, titled 'Big Data and Data Analytics', which focuses on real-time data analytics, heterogeneous data processing, and knowledge discovery for machine learning and AI. It includes course objectives, detailed syllabus modules, learning outcomes, assessment methods, practical lab activities, and recommended readings. The course aims to equip students with essential skills in big data technologies and analytics tools.

Uploaded by

adeebkhan072004
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
14 views3 pages

CourseCurriculum (8) - 1

The document outlines the curriculum for the undergraduate course CSIT363, titled 'Big Data and Data Analytics', which focuses on real-time data analytics, heterogeneous data processing, and knowledge discovery for machine learning and AI. It includes course objectives, detailed syllabus modules, learning outcomes, assessment methods, practical lab activities, and recommended readings. The course aims to equip students with essential skills in big data technologies and analytics tools.

Uploaded by

adeebkhan072004
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 3

Course Curriculam

Course Code: CSIT363 Credit Units L T P/S SW AS/DS FW No. of PSDA Total Credit Unit
Course Level UG 3 0 2 2 0 0 0 5
Course Title Big Data and Data Analytics

Course
Description :

Course Objectives :

SN. Objectives
1 Targeting the futuristic requirement of Realtime Data Analytics.
2 Matching with the pace of availability of heterogeneous data in the format of structured and unstructured.
3 Generating the Knowledgebase discovery that will be useful for next generation of Machine Learning and Artificial Intelligence.
4 Introducing the core concepts of Big Data and Data mining, its techniques, implementation, challenges, and benefits.

Pre-Requisites : General

SN. Course Code Course Name

Course Contents / Syllabus :

SN. Module Descriptors / Topics Weightage


Module I Introduction to Big Data, Characteristics of Big Data and its scalability, Types of Digital Data, Big Data Analytics,
1 Introduction to The Design of HDFS, HDFS Concepts, Command Line Interface, Hadoop file system interfaces, Data flow, Data 20.00
Big Data Ingest with Flume and Scoop and Hadoop archives.
Large Scale Data Processing, ETL and Data Ingestion, NoSQL Databases, Hive and Querying. History of
Module II
Hadoop, Apache Hadoop, Analyzing Data with Hadoop, Hadoop Streaming, Hadoop Echo System, IBM Big
Advanced
2 Data Strategy, Introduction to Infosphere Big insights and Big Sheets. Anatomy of a Map Reduce Job Run, 20.00
Concepts of Big
Failures, Job Scheduling, Shuffle and Sort, Task Execution, Map Reduce Types and Formats, Map Reduce
Data
Features.
Objectives of Data Mining, Knowledge Discovery Process, Tools of Data Mining, Type of DM, Text Mining,
Module III Data
Spatial Databases, Web Mining. Case studies and Applications in telecommunications industry, retail, target
3 Mining and 18.00
marketing, fraud detection and protection, Traffic Surveillance, Health Care, Drug-Discovery, Science,
Applications
ecommerce, Banking and Finance.
Data preprocessing, Data Mining Techniques: Statistical techniques, Characterization and discrimination,
Module IV
Association and market basket analysis, Classification and Prediction, Decision trees, Neural Networks,
4 Algorithms and 22.00
Bayesian Classification, Association rules, Apriori, FP Tree, Introduction to Genetic Algorithm, Cluster analysis,
Implementations
Automatic Cluster Detection, Outlier analysis
Realtime Data Analytics, Framework and applications of Hadoop and MapReduce:. Pig : Introduction to PIG,
Module V Data
Execution Modes of Pig, Comparison of Pig with Databases, Grunt, Pig Latin, User Defined Functions, Data
Analytics and
5 Processing operators. Hive : Hive Shell, Hive Services, Hive Metastore, Comparison with Traditional Databases, 20.00
Applications of
HiveQL, Tables, Querying Data and User Defined Functions. Hbase: HBasics, Concepts, Clients, Example,
Big Data
Hbase Versus RDBMS. Big SQL: Introduction and applications

Course Learning Outcomes :

SN. Course Learning Outcomes


Pedagogy for Course Delivery :

SN. Pedagogy Methods


1 Course content will be delivered online using power point presentation.
2 Assignment and tutorial will be discussed and evaluated using online mode.
3 Reviewing relevant, previously learned topics.
4 Presenting the new information by linking it to previous case studies.
5 Providing learning guidance and assignments.
6 Providing time for practice, problem solving sessions and feedback.
7 Taking tests and quiz on a regular basis.

Theory /VAC / Architecture Assessment (L,T & Self Work): 80.00 Max : 100

Attendance+CE+EE : 5+35+60

SN. Type Component Name Marks


1 Attendance 5.00
2 End Term Examination (OMR) 60.00
3 Internal CLASS TEST 10.00
4 Internal HOME ASSIGNMENT 20.00
5 Internal Viva 5.00

Lab/ Practical/ Studio/Arch. Studio/ Field Work Assessment : 20.00 Max : 100

Attendance+CE+EE : 5+35+60

SN. Type Component Name Marks


1 Attendance 5.00
2 External PRACTICAL 40.00
3 External Viva 20.00
4 Internal CLASS TEST (PRACTICAL BASED) 10.00
5 Internal PERFORMANCE 10.00
6 Internal Viva 5.00
7 Internal PRACTICAL / LAB RECORDS 10.00

Lab/ Practical details, if applicable :

SN
Lab / Practical Details
.
1 1. Implement the following Data structures in Java i) Linked Lists ii) Stacks iii) Queues iv) Set v) Map.
2 2. Perform setting up and Installing Hadoop in its three operating modes: a) Standalone, Pseudo distributed, Fully distributed.
3. Implement the following file management tasks in Hadoop: • Adding files and directories • Retrieving files • Deleting files Hint: A typical Hadoop
3
workflow creates data files (such as log files) elsewhere and copies them into HDFS using one of the above command line utilities.

4 4. Write a Map Reduce program that mines weather data. Weather sensors collecting data every hour at many locations across the globe gather a
large volume of log data, which is a good candidate for analysis with MapReduce, since it is semi structured and record oriented.
5 5. Implement Matrix Multiplication with Hadoop Map Reduce.
6 6. Install and Run Pig then write Pig Latin scripts to sort, group, join, project, and filter your data.
7 7. Install and Run Hive then use Hive to create, alter, and drop databases, tables, views, functions, and indexes.
8. Data Preprocessing Using Weka: You are expected to explore, observe and understand the purpose of each button under the preprocess panel
8
after loading the ARFF file you prepared in this lab.
9 9. Try to interpret what you observe using a different ARFF file, weather.arff, provided with WEKA Tool (Open Software).
10 10. Demonstrate and analyze the result of following Data mining techniques using Weka on the data sets provided with WEKA
11 11. Classification (e.g., BayesNet, KNN, C4.5 Decision Tree, Neural Networks, SVM),
12 12. Regression (e.g., Linear Regression, Isotonic Regression, SVM for Regression),
13 13. Clustering (e.g., Simple K-means, Expectation Maximization (EM)),
14 14. Association rules (e.g., Apriori Algorithm, Predictive Accuracy, Confirmation Guided),
15 15. Feature Selection (e.g., Cfs Subset Evaluation, Information Gain, Chi-squared Statistic), and
16 16. Visualization (e.g., View different two-dimensional plots of the data).

List of Professional skill development activities :

No.of PSDA : 3
SN. PSDA Point
1 Practice and develop skills on Microsoft Azure.
2 Practice and develop data analytics skills on Weka Tool.
3 Practice and develop skills on AWS framework.

Text & References :

SN. Type Title/Name Description ISBN/ URL


Tom White “Hadoop: The Definitive Guide”
1 Book
Third Edit on, O’reily Media, 2012.
Seema Acharya, Subhasini Chellappan,
2 Book
"Big Data Analytics" Wiley 2015.
“Mastering Data Mining: The Art and
3 Book Science of Customer Relationship
Management”, by Berry and Lin o
“Data Mining: Concepts and Techniques”,
4 Book J. Han, M. Kamber, Academic Press,
Morgan Kaufmann Publisher
Jay Liebowitz, “Big Data and Business
5 Book Analytics” Auerbach Publications, CRC
press (2013)
Anand Rajaraman and Jef rey David
6 Book Ulman, “Mining of Massive Datasets”,
Cambridge University Press, 2
Michael Mineli, Michele Chambers, Ambiga
7 Book Dhiraj, "Big Data, Big Analytics: Emerging
Business Intelli
Bill Franks, “Taming the Big Data Tidal
8 Book Wave: Finding Opportunities in Huge Data
Streams with Advanc

You might also like