M.SC - BigData Syllabus
M.SC - BigData Syllabus
DR VISHWANATH KARAD
MIT - WORLD PEACE UNIVERSITY
FACULTY OF SCIENCE
BATCH – 2018-19
Dr.SudhirGavhane
Dean, LASC
PROGRAMME STRUCTURE
Preamble:
Big Data Analytics is required to deal with the problems faced by industry today. The techniques
and tools are used to solve problems from a wide variety of Industries such as manufacturing,
services, retail, banking and finance, sports, pharmaceuticals, and aerospace etc.
Big Data Analytics is interdisciplinary and is required to analyse ever growing large data ( growing
by volumn, velocity and variety) applying techniques like data mining, machine learning, and deep
learning from computer science, statistics and maths.
Big Data Analytics is required to cope up with rapid changes in both, domain knowledge and
technology. It is one of the fastest growing and most promising technologies
First year Provides foundation of Big Data Technology, Maths and Statistics including
programming languages. Programme includes technology such as Hadoop, techniques such data
mining, and computer programing, maths and statistics subjects that will provide the foundation for
students.
Second year will include subjects belonging to the chosen track in his/her own interest relevant to
Big data Analytics. It will also include advanced topics and technologies in Big Data. There will be
mini project and Internship to get industrial exposure to the students.
Dr.SudhirGavhane
Dean, LASC
Vision and Mission of the Programme
Vision:
To contribute to the society through excellence in scientific and knowledge-based education
utilizing the potential of computer science with a deep passion for wisdom, culture and
values.
Mission:
Big Data Analytics is aimed to offer a thorough professional training which prepare
students to embark on Big Data Analytics careers which is one of the fastest growing
technologies. They are also provided a very good foundation for further study at PhD level.
Prepare and equip students for opportunities in ever changing technology with hands-on
industrial training.
Transform the students to become globally competent professionals through international
training/internship.
Nurture the creativity and inculcate entrepreneurial skills among the students.
Dr.SudhirGavhane
Dean, LASC
Programme Specific Outcomes
A graduate with a M.Sc. in Big Data Analytics will have the ability to communicate
computer science concepts, designs, and solutions effectively and professionally
This course is aimed to offer training which prepare students to embark on Big Data
Analytics careers which is one of the fastest growing technologies. They are also provided
a very good foundation for further study at PhD level.
Prepare and equip students for opportunities in ever changing technology with hands-on
industrial training.
Transform the students to become globally competent professionals through internship.
Nurture the creativity and inculcate entrepreneurial skills among the students.
Project work gives students hands on experience in solving a real world problem.
The Syllabus also develops requisite professional skills and problem solving abilities for
pursuing a career in Software Industry.
Dr.SudhirGavhane
Dean, LASC
Programme Structure:
(a) Programme duration: 2 years full time.
(b) System followed: Trimester
(c) Credits System:
(i) Per term or per year, as applicable
(ii) Total in the programme, as applicable
(d) Credits for activities other than academics: NA
(e) Internship: Full time three months Industrial training should be completed.
(f) Assessment Criteria: Minimum 50% credits of first year are required to take
admission in second year.
(g) Branches or Specialisations: NA
(h) Mandatory Attendance to appear for examination:
It is expected on the part of the student to attend each and every Lecture,
Tutorial, and Laboratory practical sessions in a course for the academic
excellence. However, due to any contingencies, the attendance requirement will
be a minimum of 90% of the classes scheduled/ held.
(j) Medium of Instruction and Examination: English
(k) Eligibility criteria for admission to the programme: B.Sc.(CS), BCS, B.Sc.(IT),
BCA, BE-IT, Comp., E&TC with 50% of Marks (45% marks aggregate in case of
candidate backward class categories and persons with disability belonging to
Maharashtra state only)
Dr.SudhirGavhane
Dean, LASC
M.Sc. Big Data Analytics
2017-18
A. DefinitionofCredit:-
B. Credits:-
Total number of credits for two years Post Graduate M.Sc. Programme would be 120.
S. Category SuggestedBreakupof
No. Credits(Total175)
Humanities andSocialSciences and Peace Programmes
1 10
includingManagementcourses
Professionalcorecourses including
2 84
Laboratory/Mini Project Work
ProfessionalElectivecourses
3 06
Total 120
Coursecode Definitions
L Lecture
T Tutorial
WP Humanities andSocialSciences and Peace Programs
MBD includingManagementcourses
M.Sc.(Big Data Analytics)
E. Grading Scheme:
Dr.SudhirGavhane
Dean, LASC
M. Sc. Big Data Analytics (First Year) (Batch 2017-18)
Trimester – I
Type: Core **Assessment Marks are valid only if Attendance criteria are met
Weekly Teaching Hours:25 *CCA: Class Continuous Assessment
Total Credits: First Year M.Sc. Big Data Analytics Trimester I:20 *LCA: Laboratory Continuous Assessment
Dr.SudhirGavhane
Dean, LASC
M. Sc. Big Data Analytics (First Year) (Batch 2017-18)
Trimester – II
Type: Core **Assessment Marks are valid only if Attendance criteria are met
Dr.SudhirGavhane
Dean, LASC
M. Sc. Big Data Analytics (First Year) (Batch 2017-18)
Trimester – III
Type: Core **Assessment Marks are valid only if Attendance criteria are met
Weekly Teaching Hours: 25 *CCA: Class Continuous Assessment
Total Credits: First Year M.Sc. Big Data Analytics Trimester III:20 *LCA: Laboratory Continuous Assessment
Total First Year M.Sc. Big Data Analytics Credits: 60
Dr.SudhirGavhane
Dean, LASC
M. Sc. Big Data Analytics(Second Year) (Batch 2017-18)
Trimester – I
Type: Core/ Elective **Assessment Marks are valid only if Attendance criteria are met
Dr.SudhirGavhane
Dean, LASC
M. Sc. Big Data Analytics (Second Year) (Batch 2017-18)
Trimester – II
Type: Core/ Elective **Assessment Marks are valid only if Attendance criteria are met
Dr.SudhirGavhane
Dean, LASC
M. Sc. Big Data Analytics (Second Year) (Batch 2017-18)
Trimester – III
MIT- Core
WPU-
1 Full Time Industrial Training 4 3 50 50 100
MS-
2301
Total : 4 3 50 50 100
Type: Core **Assessment Marks are valid only if Attendance criteria are met
Weekly Teaching Hours: 15 *CCA: Class Continuous Assessment
Total Credits: Second Year M.Sc. Big Data Analytics Trimester III:20 *LCA: Laboratory Continuous Assessment
Dr.SudhirGavhane
Dean, LASC
ElectiveCourses:
Dr.SudhirGavhane
Dean, LASC
`
COURSE STRUCTURE
Physical design of a database, Concepts of algorithm design and analysis, Basic understanding of:
Software engineering principles and techniques, Probability and statistics – Bayesian theory,
regression, hypothesis testing
Course Objectives:
1. To understand the structure of Data Warehouse
2. To understand different data pre-processing techniques.
3. To understand basic descriptive and predictive data mining techniques.
4. To use data mining tool on different data sets
5. To understand Classification algorithms
6. To understand Prediction algorithms.
7. To understand Clustering algorithms.
8. To use data mining tool on different data sets
CourseOutcomes:
The student will get knowledge of:
Data processing and data quality.
Modelling and design of data warehouses.
Basic and advanced concepts of algorithms for data mining.
Data mining tool and practical experience of applying data mining algorithms
CourseContents:
Introduction to Data Mining
Basic concepts of data mining, Types of Data to be mined.
Introduction to Data Warehouse
Data Warehouse and DBMS, Architecture of Data Warehouse
Data pre-processing
Need Data pre-processing, Attributes and Data types
Data Mining Techniques: Association Rule Mining
Basic idea: item sets, Frequent Item-sets
Data Mining Techniques: Classification
Definition of Classification, Decision tree Induction: Information gain, gain ratio, Gini Index
Data Mining Techniques: Prediction
Definition of Prediction, Linear regression
Data Mining Techniques: Clustering
Definition of Clustering, Partitioning Methods
Performance Measures
Precision, recall, F-measure
Problem solving with R or Weka: filters, Discretization, mining association rules, decision trees,
Prediction, k-means
LearningResources:
Reference Books:
1. Data Mining: Concepts and Techniques, Han, Elsevier ISBN:9789380931913/
9788131205358
2. Margaret H. Dunham, S. Sridhar, Data Mining – Introductory and Advanced Topics, Pearson
Education
3. Data warehousing: fundamentals fot IT professionals 3rd edition , Kimball, Wiley Publication
4. Ian H.Witten, Eibe Frank Data Mining: Practical Machine Learning Tools and Techniques,
Elsevier/(Morgan Kauffman), ISBN:9789380501864
5. Introduction to Data Mining (2005) By Pang-Ning Tan, Michael Steinbach, Vipin Kumar
Addison Wesley ISBN: 0-321-32136-7
6. [Research-Papers]: Some of the relevant research papers that contain recent results and
developments in data mining field
Syllabus:
Module Workload in Hrs
Contents
No. Theory Lab Assess
Introduction to Data Mining: Basic concepts of data mining,
Types of Data to be mined, Stages of the Data Mining
1 4 - -
Process, Data Mining Techniques, Knowledge Discovery in
Databases, Data Mining Issues, Applications of Data Mining
Introduction to Data Warehouse: Data Warehouse and DBMS
Architecture of Data Warehouse, Multidimensional data model,
2 Concepts of OLAP and Data Cube, OLAP 5 - -
operations, Dimensional Data Modelling- Star, Snow flake
schemas
Data pre-processing: Need Data pre-processing, Attributes and
Data types, Statistical descriptions of Data, Handling missing
3 Data, Data sampling, Data cleaning, Data Integration and 6 - -
transformation, Data reduction, Discretization and generating
concept hierarchies
Data Mining Techniques: Association Rule Mining: Basic
4 idea: item sets, Frequent Item-sets, Association Rule Mining, 4 - -
Generating item sets and rules efficiently, FP growth algorithm
Data Mining Techniques: Classification: Definition of
Classification, Decision tree Induction: Information gain, gain
ratio, Gini Index, Issues: Over-fitting, tree pruning methods,
missing values, continuous classes, Classification and Regression
5 9 - -
Trees (CART), Bayesian Classification: Bayes Theorem, Naïve
Bayes classifier, Bayesian Networks, Linear classifiers,
Least squares, SVM classifiers, Lazy Learners (or Learning from
Your Neighbors)
Data Mining Techniques: Prediction: Definition of Prediction
6 Linear regression, Non-linear regression, Logistic regression 3 - -
COURSE STRUCTURE
Course Outcomes:
Course Contents:
Learning Resources:
Reference Books:
1. Jacek Błażewicz, et al., “Handbook on parallel and distributed processing”, Springer
Science & Business Media, 2013.
2. Andrew S. Tanenbaum, and Maarten Van Steen, “Distributed Systems: Principles and
Paradigms”. Prentice-Hall, 2007.
3. George F.Coulouris, Jean Dollimore, and Tim Kindberg, “Distributed systems: concepts
and design”, Pearson Education, 2005.
4. Gregor Kosec and Roman Trobec, “Parallel Scientific Computing: Theory, Algorithms, and
Applications of Mesh Based and Meshless Methods”, Springer, 2015.
Supplementary Reading:
1. Quinn, M. J., Parallel Computing: Theory and Practice (McGraw-Hill Inc.).
2. Gibbons, A., W. Rytter, Efficient Parallel Algorithms (Cambridge Uni. Press).
3. Shameem A and Jason, Multicore Programming, Intel Press, 2006
Weblinks:
1 https://www.tutorialspoint.com/parallel_algorithm/parallel_algorithm_introduction.htm
Syllabus:
COURSE STRUCTURE
Some basic knowledge and experience of Java (Jars, Array, Classes, Objects, etc.)
Course Objectives:
Course Outcomes:
Course Contents:
1. Introduction to big data: Introduction, distributed file system, Big Data and its importance,
Drivers, Big data analytics, Big data applications. Algorithms, Matrix-Vector, Multiplication by
Map Reduce.
2. Introduction to HADOOP: Big Data, Apache Hadoop & Hadoop Ecosystem, MapReduce,
Data Serialization.
4. HADOOP ecosystem and yarn: Hadoop ecosystem components, Hadoop 2.0 New Features
NameNode High Availability, HDFS Federation, MRv2, YARN, Running MRv1 in YARN.
Learning Resources:
Reference Books:
1. Boris lublinsky, Kevin t. Smith, Alexey Yakubovich, “Professional Hadoop Solutions”,
Wiley, ISBN: 9788126551071, 2015.
2. Chris Eaton, Dirk deroos et al. “Understanding Big data ”, McGraw Hill, 2012.
3. Tom White, “HADOOP: The definitive Guide”, O Reilly 2012.
4. MapReduce Design Patterns (Building Effective Algorithms & Analytics for Hadoop) by
Donald Miner & Adam Shook
Supplementary Reading:
Weblinks:
https://cloudthat.in/course/processing-bigdata-with-apache-hadoop/
Assessment Scheme:
Syllabus:
HADOOP Architecture
Hadoop Architecture, Hadoop Storage: HDFS, Common Hadoop
Shell commands, Anatomy of
File Write and Read, NameNode, Secondary NameNode, and
DataNode, Hadoop MapReduce
3 12 - -
Paradigm, Map and Reduce tasks, Job, Task trackers - Cluster
Setup – SSH &Hadoop
Configuration – HDFS Administering –Monitoring &
Maintenance.
COURSE STRUCTURE
Course Objectives:
1. To understand why Python is a useful scripting language for developers.
2. To learn how to design and program Python applications.
3. To learn how to use lists, tuples, and dictionaries in Python programs.
4. To learn how to identify Python object types.
5. To define the structure and components of a Python program.
6. To learn how to write loops and decision statements in Python.
Course Outcomes:
1. Students will demonstrate the ability to solve problems using system approaches, critical
and innovative thinking, and technology to create solutions.
2. Students will design, develop, and present their final project.
3. Students will understand the purpose and the process of code reviews.
4. Students will be able to create scripts in Python for Autodesk's Maya.
5. Students will understand and will be able to articulate and apply the principles of 3D
graphics
Course Contents:
Introduction to Python
Introduction to python language.
String Manipulation
Introduction to various operations on strings.
Functions
Introduction to functions in python.
Modules
Introduction to module, package in python.
Input-Output
Handling of inputs in python
Regular expressions
Use of regular expression in python
CGI
Introduction to CGI and cookies.
Database
Handling of database in python.
Learning Resources:
Reference Books:
1. Dive into Python by Mark Pilgrim
2. Programming Python by Mark Lutz, O’Reilly Media
3. Python Programming: An Introduction to Computer Science” by John Zelle
Supplementary Reading:
1. Python Testing Cookbook by Greg L. Turnquist
Web Resources:
1. www.tutorialspoint.com/python/
2. docs.python.org/3/tutorial/
3. www.learnpython.org
4. www.guru99.com/python-tutorials.html
5. www.tutorialspoint.com/cprogramming/
6. www.learn-c.org/
7. www.w3schools.in/c-tutorial/
Assessment Scheme:
Class Continuous Assessment (CCA) 50 Marks
Assignments Test Presentations Case study MCQ Oral Attendance
20 10 10 - - 10
Term End Examination : 50 Marks
Syllabus:
String Manipulation
Accessing Strings
Basic Operations
3 String slices 5 - -
Function and Methods
`
Functions
Defining a function
calling a function
5 Types of functions 4 - -
Function Arguments
Anonymous functions
Global and local variables
Modules
Importing module
Math module
6 Random module 4 - -
Packages
Composition
Input-Output
Printing on screen
Reading data from keyboard
7 Opening and closing file 4 - -
Reading and writing files
Functions
Regular expressions
Match function
Search function
8 4 - -
Matching VS Searching
Modifiers
Patterns
CGI
Introduction
9 Architecture 5 - -
CGI environment variable
GET and POST methods
Cookies
File upload
Database
Introduction
Connections
10 5 - -
Executing queries
Transactions
Handling error
COURSE STRUCTURE
Pre-requisites:
Knowledge of any scripting language, XML.
Course Objectives:
1. To understand why Python is a useful scripting language for developers.
2. To learn how to design and program Python applications.
3. To learn how to use lists, tuples, and dictionaries in Python programs.
4. To learn how to identify Python object types.
5. To define the structure and components of a Python program.
6. To learn how to write loops and decision statements in Python.
Course Outcomes:
1. Students will demonstrate the ability to solve problems using system approaches, critical
and innovative thinking, and technology to create solutions.
2. Students will design, develop, and present their final project.
3. Students will understand the purpose and the process of code reviews.
4. Students will be able to create scripts in Python for Autodesk's Maya.
5. Students will understand and will be able to articulate and apply the principles of 3D
graphics
Course Contents:
Introduction to Python
Introduction to python language.
String Manipulation
Introduction to various operations on strings.
Functions
Introduction to functions in python.
Modules
Introduction to module, package in python.
Input-Output
Handling of inputs in python
Regular expressions
Use of regular expression in python
CGI
Introduction to CGI and cookies.
Database
Handling of database in python.
`
Learning Resources:
Reference Books:
1. Dive into Python by Mark Pilgrim
2. Programming Python by Mark Lutz, O’Reilly Media
3. Python Programming: An Introduction to Computer Science” by John Zelle
Supplementary Reading:
1. Python Testing Cookbook by Greg L. Turnquist
Web Resources:
1. www.tutorialspoint.com/python/
2. docs.python.org/3/tutorial/
3. www.learnpython.org
4. www.guru99.com/python-tutorials.html
5. www.tutorialspoint.com/cprogramming/
6. www.learn-c.org/
7. www.w3schools.in/c-tutorial/
Assessment Scheme:
Class Continuous Assessment (CCA) 50 Marks
Assignments Test Presentations Case study MCQ Oral Attendance
20 10 10 - - 10
Term End Examination : 50 Marks
Syllabus:
`
String Manipulation
Accessing Strings
Basic Operations
3 - 2 -
String slices
Function and Methods
`
Random module
Packages
Composition
Input-Output
Printing on screen
Reading data from keyboard
7 Opening and closing file - 3 -
Reading and writing files
Functions
Regular expressions
Match function, Search function
8 Matching VS Searching - 3 -
Modifiers
Patterns
CGI
Introduction
Architecture
9 CGI environment variable - 2 -
GET and POST methods
Cookies
File upload
Database
Introduction Connections
10 Executing queries - 2 -
Transactions
Handling error
ggest the below items:
Prepared By Checked By Approved By
COURSE STRUCTURE
Some basic knowledge and experience of Java (Jars, Array, Classes, Objects, etc.)
Course Objectives:
1. Learn tips and tricks for Big Data use cases and solutions.
2. Learn to build and maintain reliable, scalable, distributed systems with Apache
3. Able to apply Hadoop ecosystem components.
Course Outcomes:
1. Students will learn tips and tricks for Big Data use cases and solutions.
2. They will able to build distributed systems with Apache Hadoop.
3. They will able to apply Hadoop ecosystem components.
Course Contents:
1. Introduction to big data: Introduction, distributed file system, Big Data and its importance,
Drivers, Big data analytics, Big data applications. Algorithms, Matrix-Vector, Multiplication by
Map Reduce.
2. Introduction to HADOOP: Big Data, Apache Hadoop & Hadoop Ecosystem, MapReduce,
Data Serialization.
4. HADOOP ecosystem and yarn: Hadoop ecosystem components, Hadoop 2.0 New Features
NameNode High Availability, HDFS Federation, MRv2, YARN, Running MRv1 in YARN.
Lab Assignments
1. Lab on Install and configure Hadoop cluster
Learning Resources:
Reference Books:
1. Boris lublinsky, Kevin t. Smith, Alexey Yakubovich, “Professional Hadoop Solutions”,
Wiley, ISBN: 9788126551071, 2015.
2. Chris Eaton, Dirk deroos et al. “Understanding Big data ”, McGraw Hill, 2012.
3. Tom White, “HADOOP: The definitive Guide”, O Reilly 2012.
4. MapReduce Design Patterns (Building Effective Algorithms & Analytics for Hadoop) by
Donald Miner & Adam Shook
Supplementary Reading:
Weblinks:
https://cloudthat.in/course/processing-bigdata-with-apache-hadoop/
Assessment Scheme:
Laboratory Continuous Assessment (LCA) 50 Marks
Practical Oral based on Site Visit Mini Project Problem based Any other
practical Learning
10 20 - - 20 -
Term End Examination : 50 Marks
Syllabus:
Course Objectives:
Course Outcomes:
Students will be able to:
1. Recognize and make appropriate use of different types of data structures
2. Use R to create sophisticated figures and graphs
3. Identify and implement appropriate control structures to solve a particular programming
problem
4. Design and write functions in R and implement simple iterative algorithms.
Course Contents:
Introduction to R
Overview of R programming, Evolution of R, Applications of R programming, Basic syntax
Basic Concepts of R
Reserved Words, Variables & Constants
Data structures in R
Vectors, Matrix
Control flow
If...else,If else() Function
Functions
R Functions, Function Return Value
Strings
String construction rules
R packages
Study of different packages in R
R Data Reshaping
Joining Columns and Rows in a Data Frame
Reference Books:
1. The Art of R Programming-a tour of statistical software design by Norman Matloff
2. R Cookbook: Proven Recipes for Data Analysis, Statistics, and Graphics (O'Reilly
Cookbooks) by Paul Teetor
3. R in Action Book by Rob Kabacoff
4. Practical Data Science with R by Nina Zumel , John Mount , Jim Porzak
5. Learning R: A Step-by-Step Function Guide to Data Analysis by Richard Cotton
Pedagogy:
Participative learning, discussions, algorithm, Flowchart & Program writing, experiential learning
through practical problem solving, assignment, PowerPoint presentation.
Assessment Scheme:
Class Continuous Assessment (CCA) 50 Marks
Assignments Test Presentations Case study MCQ Oral Attendance
10 10 - - 10 10 10
Syllabus:
Pradnya Mahadik
Preeti Adhav Dr. Sudhir Gavhane
BOS Chairman Dean
Lecturer
COURSE STRUCTURE
Some basic knowledge and experience of Java (Jars, Array, Classes, Objects, etc.)
Course Objectives:
What is Hadoop and how can it help process large data sets.
How to write MapReduce programs using Hadoop API.
How to use HDFS (the Hadoop Distributed Filesytem), from the command line and API,
for effectively loading and processing data in Hadoop.
How to ingest data from a RDBMS or a data warehouse to Hadoop.
Best practices for building, debugging and optimizing Hadoop solutions.
Get introduced to tools like Pig, Hive, HBase, Elastic MapReduce etc. and understand how
they can help in BigData projects.
Course Outcomes:
Understand Sqoop architecture and uses Able to load real-time data from an RDBMS
table/Query on to HDFS Able to write sqoop scripts for exporting data from HDFS onto
RDMS tables.
Understand Apache PIG , PIG Data Flow Engine Understand data types, data model, and
modes of execution.
Able to store the data from a Pig relation on to HDFS.
Able to load data into Pig Relation with or without schema.
Able to split, join, filter, and transform the data using pig operators Able to write pig scripts
and work with UDFs.
Understand the importance of Hive, Hive Architecture Able to create Managed, External,
Partitioned and Bucketed Tables Able to Query the data, perform joins between tables
Understand storage formats of Hive Understand Vectorization in Hive
Data Storage
What is Hadoop Distributed File System (HDFS). Architecture of HDFS.Architectural assumptions
and goals.How data is stored in HDFS.How data is read from HDFS
Namenodes and Datanodes
Data Processing
What is use of MapReduce.Architecture of the MapReduce framework.what are Phases of a
MapReduce Job.what are MapReduce Design Patterns.what is YARN Architecture
Data Integration
How to Integrate Hadoop into your existing enterprise.Introduction to Sqoop
Learning Resources:
Reference Books:
Weblinks:
https://cloudthat.in/course/processing-bigdata-with-apache-hadoop/
Pedagogy:
Participative learning, discussions, algorithm, Flowchart & Program writing, experiential learning
through practical problem solving, assignment, PowerPoint presentation
Assessment Scheme:
Class Continuous Assessment (CCA) 50 marks
To introduce the students to the use of basic methodology for the solution of liner programs
and integer programs.
To introduce the students to the advanced methods for large-scale transportation and
assignment problems.
Course Outcomes:
Define and formulate linear programming problems and appreciate their limitations.
Solve linear programming problems using appropriate techniques and optimization solvers,
interpret the results obtained and translate solutions into directives for action.
Conduct and interpret post-optimal and sensitivity analysis and explain the primal-dual
relationship.
Identify the special features of the transportation problem, and assignment problem.
Course Contents
Non-Linear programming
Kuhn-Tucker conditions. Convex functions and convex regions. Convex programming
problems. Algorithms for solving convex programming problems.
Calculation on CPM network, Various floats for activities. Critical path updating projects.
Operation time cost trade off Curve project. Time cost – trade off Curve- Selection of schedule
based on Cost.
Learning Resources:
Reference Books:
1. Hadley G. (1969): Linear Programming, Addision Wesley.
2. Taha H. A. (1971): Operations Research an Introduction, Macmillan N. Y.
3. KantiSwaroop, Gupta and Manmohan (1985): Operations Research, Sultan
Chand and Co.
4. Sharma J. K. (2003): Operations Research Theory and Applications, 2
Nd Ed. Macmillan India ltd.
5. Sharma J. K. (1986): Mathematical Models Operations Research, McGraw Hill.
Pedagogy:
Participative learning, discussions, algorithm, Flowchart & Program writing, experiential learning
through practical problem solving, assignment, PowerPoint presentation
Assessment Scheme:
Non-Linear programming
Kuhn-Tucker conditions. Convex functions and convex regions.
3 10 - -
Convex programming problems. Algorithms for solving convex
programming problems.
PERT and CPM
Basic differences between PERT and CPM. Arrow Networks,
time estimates, Earliest expected time. Latest – allowable
occurrences time. Forward Pass Computation, Backward Pass
Computation. Representation in Tabular Form, Critical Path.
4 Probability of meeting scheduled date of completion. Calculation 10 - -
on CPM network, Various floats for activities. Critical path
updating projects. Operation time cost trade off Curve project.
Time cost – trade off Curve- Selection of schedule based on Cost.
COURSE STRUCTURE
Course Objectives:
1. To study the usage and applications of Object Oriented database
2. To acquire knowledge on variety of NoSQL databases
3. To attain inquisitive attitude towards research topics in NoSQL databases
Course Outcomes:
1: Master the basics of SQL and construct queries using Pl/SQL efficiently and apply object
oriented features for developing database applications.
2: Compare and Contrast NoSQL databases with each other and Relational Database
Systems
3: Critically analyse and evaluate variety of NoSQL databases.
4: Demonstrate the knowledge of Key-Value databases, Document based Databases,
Column based Databases and Graph Databases.
Course Contents:
1. Introduction to NOSQL
Definition of NOSQL, History of NOSQL and Different NOSQL products, Exploring MondoDB
Java/Ruby/Python, Interfacing and Interacting with NOSQL
2. NOSQL Basics
NOSQL Storage Architecture, CRUD operations with MongoDB, Querying, Modifying and
Managing NOSQL Data stores, Indexing and ordering datasets (MongoDB/CouchDB/Cassandra)
3. Advanced NOSQL
NOSQL in CLOUD, Parallel Processing with Map Reduce, Big Data with Hive
4. Working with NOSQL
Surveying Database Internals, Migrating from RDBMS to NOSQL, Web Frameworksand NOSQL,
using MySQL as a NOSQL
Learning Resources:
Reference Books:
Dan Sullivan,"NoSQL for Mere Mortals",1 stEdition, Pearson Education, 2015. (ISBN-13:
978-9332557338)
Supplementary Reading:
Pramod J. Sadalage, Martin Fowler,"NoSQL Distilled: A Brief Guide to the Emerging
World of Polyglot Persistence", 1 stEdition, Pearson Education, 2012. (ISBN-13: 978-
8131775691
Syllabus:
NOSQL Basics
NOSQL Storage Architecture, CRUD operations with MongoDB,
2 Querying, Modifying and Managing NOSQL Data stores, 12 - 1
Indexing and ordering datasets (MongoDB/CouchDB/Cassandra)
Advanced NOSQL
NOSQL in CLOUD, Parallel Processing with Map Reduce, Big
3 8 - 1
Data with Hive
Course Objectives:
Course Outcomes:
Students will be able to:
1. Recognize and make appropriate use of different types of data structures
2. Use R to create sophisticated figures and graphs
3. Identify and implement appropriate control structures to solve a particular programming
problem
4. Design and write functions in R and implement simple iterative algorithms.
Course Contents:
Data structures in R:
Vectors, Matrix, List, Data Frame/ Factor
Control flow:
Decision making, Repeat, while, for
R packages, R Data Reshaping: Joining Columns and Rows in a Data Frame, Merging Data
Frames
Working with files, R object and Class: csv, excel, S3 and S4 Class, reference
Data visualization in R and Data Management: Bar Chart, Dot Plot, Scatter Plot (3D),Spinning
Scatter Plots, Pie Chart, Histogram (3D) [including colorful ones], Overlapping Histograms,
Boxplot, Plotting with Base and Lattice Graphics, Missing Value Treatment, Outlier Treatment,
Sorting Datasets, Merging Datasets, Binning variables
Reference Books:
1. The Art of R Programming-a tour of statistical software design by Norman Matloff
2. R Cookbook: Proven Recipes for Data Analysis, Statistics, and Graphics (O'Reilly
Cookbooks) by Paul Teetor
3. R in Action Book by Rob Kabacoff
4. Practical Data Science with R by Nina Zumel , John Mount , Jim Porzak
5. Learning R: A Step-by-Step Function Guide to Data Analysis by Richard Cotton
Pedagogy:
Participative learning, discussions, algorithm, Flowchart & Program writing, experiential learning
through practical problem solving, assignment, PowerPoint presentation.
Assessment Scheme:
Class Continuous Assessment (CCA) 50
Laboratory Continuous Assessment (LCA)50
Syllabus:
Course Objectives:
What is Hadoop and how can it help process large data sets.
How to write MapReduce programs using Hadoop API.
How to use HDFS (the Hadoop Distributed Filesytem), from the command line and API,
for effectively loading and processing data in Hadoop.
How to ingest data from a RDBMS or a data warehouse to Hadoop.
Best practices for building, debugging and optimizing Hadoop solutions.
Get introduced to tools like Pig, Hive, HBase, Elastic MapReduce etc. and understand how
they can help in BigData projects.
Course Outcomes:
Understand Sqoop architecture and uses Able to load real-time data from an RDBMS
table/Query on to HDFS Able to write sqoop scripts for exporting data from HDFS onto
RDMS tables.
Understand Apache PIG , PIG Data Flow Engine Understand data types, data model, and
modes of execution.
Able to store the data from a Pig relation on to HDFS.
Able to load data into Pig Relation with or without schema.
Able to split, join, filter, and transform the data using pig operators Able to write pig scripts
and work with UDFs.
Understand the importance of Hive, Hive Architecture Able to create Managed, External,
Partitioned and Bucketed Tables Able to Query the data, perform joins between tables
Understand storage formats of Hive Understand Vectorization in Hive
Course Contents
Data Storage
What is Hadoop Distributed File System (HDFS). Architecture of HDFS.Architectural assumptions
and goals.How data is stored in HDFS.How data is read from HDFS
Data Processing
What is use of MapReduce.Architecture of the MapReduce framework.what are Phases of a
MapReduce Job.what are MapReduce Design Patterns.what is YARN Architecture
Data Integration
How to Integrate Hadoop into your existing enterprise.Introduction to Sqoop
Lab Assignments
1. Lab on Manipulating files in HDFS pragmatically using the FileSystem API.Alternative
Hadoop File Systems: IBM GPFS, MapR-FS, Lustre, Amazon S3 etc.
2. Lab on Write an Inverted Index MapReduce Application with custom Partitioner and
Combiner Custom types and Composite Keys Custom Comparators InputFormats and
OutputFormats Distributed Cache MapReduce Design Patterns Sorting Joins.
3. Lab on Writing a streaming MapReduce job in Python YARN and Hadoop 2.0.
4. Lab on Importing data from an RDBMS to HDFS using Sqoop.
5. Lab on Exporting data from HDFS to an Other data integration tools: Flume, Kafka,
Informatica, Talend etc.
Learning Resources:
Weblinks:
https://cloudthat.in/course/processing-bigdata-with-apache-hadoop/
Pedagogy:
Participative learning, discussions, algorithm, Flowchart & Program writing, experiential learning
through practical problem solving, assignment, PowerPoint presentation
Assessment Scheme:
Syllabus:
Module Workload in Hrs
Contents
No. Theory Lab Assess
Data Storage
File System Abstraction
Big Data and Distributed File Systems
Hadoop Distributed File System (HDFS)
HDFS Architecture,Architectural assumptions and goals
How data is stored in HDFS
How data is read from HDFS
Namenodes and Datanodes
- 13 -
Blocks,Data Replication
Fault Tolerance
Data Integrity Namespaces
Federation in Hadoop 2.0
High Availability in Hadoop 2.0
Security and Encryption
HDFS Interfaces: FileSystem API, FSShell, WebHDFS,
Fuse etc.
Course Outcomes:
1. The student should be able to formulate and solve problems related to topics covered in this course.
2. The student should be able to solve the problems using Python or R
3. Perform statistical analysis on variety of data.
Course Contents:
4. Interval Estimation
5. Fundamentals of Hypothesis Testing
6. Two-Sample Tests
7. Inferences about Population Variances
8. Tests of Goodness of Fit and Independence
9. Experimental Design and ANOVA
10. Simple Linear Regression
Learning Resources:
Reference Books:
Text Book: David R Anderson, Dennis J Sweeney, Thomas A Williams, Jeffrey D. Camm and
James J. Cochran, Statistics for Business and Economics. 12th Edition. Cengage Learning. 2014
(note that a new edition, 13e, has recently come up, but mostly unavailable)
Assessment Scheme:
Syllabus:
. Two-Sample Tests 4
Comparing Means of Two Independent
Populations
10
Comparing Means of Two Related Populations
Comparing Two Population Proportions
Experimental Design and ANOVA 4
An Introduction
ANOVA and the Completely Randomized
Design
13
Multiple Comparison Procedure
Randomized Block Design and Factorial
Experiment (Self Study)
BOS Chairman
`
COURSE STRUCTURE
Course Objectives:
1. To provide an understanding of principal concepts, major issues, technologies and basic
approaches in information security.
2. Develop a basic understanding of cryptography, how it has evolved and some key encryption
techniques used today
CourseOutcomes:
The students will have firm understanding of:
1. Basic concepts related to network and system level security.
2. Basics of cryptography, security management and network security techniques.
3. Information security governance, and related legal and regulatory issues
4. How threats to an organization are discovered, analyzed, and dealt with.
CourseContents:
UNIT - I
Security Attacks (Interruption, Interception, Modification and Fabrication), Security Services
UNIT - II
Conventional Encryption Principles, Conventional encryption algorithms
UNIT - III
Public key cryptography principles, public key cryptography algorithms
UNIT - IV
Email privacy: Pretty Good Privacy (PGP) and S/MIME.
UNIT - V
IP Security Overview, IP Security Architecture
UNIT - VI
Web Security Requirements, Secure Socket Layer (SSL) and Transport Layer Security (TLS)
UNIT - VII
Basic concepts of SNMP, SNMPv1 Community facility and SNMPv3.
UNIT - VIII Firewall Design principles, Trusted Systems. Intrusion Detection Systems.
LearningResources:
TEXT BOOKS:
1. Network Security Essentials (Applications and Standards) by William Stallings Pearson
Education.
2. Hack Proofing your network by Ryan Russell, Dan Kaminsky, Rain Forest Puppy, Joe Grand,
David Ahmad, Hal Flynn Ido Dubrawsky, Steve W.Manzuik and Ryan Permeh, Wiley Dreamtech
REFERENCES:
1. Fundamentals of Network Security by Eric Maiwald (Dreamtech press)
2. Network Security - Private Communication in a Public World by Charlie Kaufman, Radia
Perlman and Mike Speciner, Pearson/PHI.
3. Cryptography and network Security, Third edition, Stallings, PHI/Pearson
4. Principles of Information Security, Whitman, Thomson.
5. Network Security: The complete reference, Robert Bragg, Mark Rhodes, TMH
6. Introduction to Cryptography, Buchmann, Springer.
Syllabus:
Module Workload in Hrs
Contents
No. Theory Lab Assess
UNIT - I
Security Attacks (Interruption, Interception, Modification and
1 9 - -
Fabrication), Security Services (Confidentiality, Authentication,
Integrity, Non-repudiation, access Control and Availability) and
COURSE STRUCTURE
CourseOutcomes:
The student will get knowledge of:
1. Concepts of Scala and its implementation.
2. Concepts of Spark and how it is used along with Spark.
CourseContents:
Introduction: Introduction to Scala, History of Scala
LearningResources:
1. Programming Scala by Dean Wampler, Alex Payne
6. Learning Spark by Matei Zaharia, Patrick Wendell, Andy Konwinski, Holden Karau
7. Advanced Analytics with Spark by Sandy Ryza, Uri Laserson, Sean Owen and Josh Wills
Syllabus:
Module Workload in Hrs
Contents
No. Theory Lab Assess
Introduction: Introduction to Scala, History of Scala, Features
1 3 - -
Basic Syntax, Scala Comments, Variables, Data types, Operators.
Conditional Expressions: If-else, While, do-while, for, Pattern
2 5 - -
matching, break statement.
Scala Function: Function declaration, function definition,
Function calling, Functions-Call by name, Functions with named
3 arguments, Functions with variable arguments, Default parameter 7 - -
values, Nested functions, Recursion, Higher order functions,
Scala Closures.
Scala Classes and Objects: Object, Class, Singleton Object,
Companion Object, access modifiers, constructors, method
4 overloading, inheritance, method overriding, this keyword, 4 - -
inheritance, method overriding, field overriding, final, Scala
Abstract class, Scala Trait, Apply and Unapply.
Array and Strings: Single dimensional, Passing array into the
5 function, Multidimensional Array, Strings, String methods, String 5 - -
Interpolation
Scala Collections: Sequence, List, Set, Map, Tuples, Options,
6 5 - -
Iterators
7 File Input-Output: Reading and Writing of files 1 - -
Course Outcomes:
1. Select real-world applications that needs machine learning based solutions.
2. Implement and apply machine learning algorithms.
3. Select appropriate algorithms for solving a particular group of real-world problems.
4. Recognize the characteristics of machine learning techniques that are useful to solve
real-world problems.
Course Contents
Introduction to learning
What is Supervised, Unsupervised and Reinforcement Learning? visualization of algebraic
concepts
Linear Regression
What is Regression? What is simple one variable regression line and coefficients of the line? What
are assumptions of linear regression? What is Gradient descent algorithm, cost function to find
'beta' values and concept
Gradient Descent
How to represent matrix of problem? How to use Gradient descent for multiple features and
scaling techniques in gradient descent? What are types of feature scaling, finding coefficients
analytically?
Learning Resources:
Assessment Scheme:
Course Objectives:
Course Outcomes:
1. The student should be able to formulate and solve problems related to topics covered in this course.
2. The student should be able to solve the problems using Python or R
3. Perform statistical analysis on variety of data.
Course Contents:
Laboratory Exercises / Practical:
Learning Resources:
Reference Books:
Text Book: David R Anderson, Dennis J Sweeney, Thomas A Williams, Jeffrey D. Camm and James J.
Cochran, Statistics for Business and Economics. 12th Edition. Cengage Learning. 2014 (note that a new
edition, 13e, has recently come up, but mostly unavailable)
Assessment Scheme:
Syllabus:
Workload in Hrs
Module
Contents Ass
No. Theory Lab
ess
1 Data and Statistics: 3
2 Descriptive Statistics:, 3
3 Descriptive Statistics: 3
Probability 3
4
Discrete Probability Distributions 3
5
Continuous Probability Distribution 3
6
Sampling and Sampling Distributions 3
7
Interval Estimation 3
8
Fundamentals of Hypothesis Testing 3
9
. Two-Sample Tests 3
10
. Inferences about Population Variances 3
11
Tests of Goodness of Fit and Independence 3
12
Experimental Design and ANOVA 3
13
Simple Linear Regression 3
14
BOS Chairman
COURSE STR UCTURE
Reference Books
1. Peter Flach: Machine Learning: The Art and Science of Algorithms that Make
Sense of Data, Cambridge University Press, Edition 2012.
2. Hastie, Tibshirani, Friedman: Introduction to Statistical Machine Learning with
Applications in R, Springer, 2nd Edition-2012.
3. C. M. Bishop : Pattern Recognition and Machine Learning, Springer 1st Edition-
2013.
4. Ethem Alpaydin : Introduction to Machine Learning, PHI 2nd Edition-2013.
5. Parag Kulkarni : Reinforcement and Systematic Machine Learning for Decision
Making, Wiley-IEEE Press, Edition July 2012.
Supplementary Reading:
Web Resources:
Weblinks: -
MOOCs: -
Pedagogy:
Mini Project development, Problem solving approach, Participative learning, discussions, algorithm,
Program writing, experiential learning through practical problem-solving, assignment, PowerPoint
presentation
Assessment Scheme:
Syllabus:
Course Objectives:
Learning in neural networks output vs hidden layers; linear vs nonlinear networks
Course Outcomes: Understand Deep Learning
Course Contents
Course overview What is deep learning? DL successes; syllabus & course logistics;
Intro to neural networks cost functions, hypotheses and tasks; training data; maximum likelihood
based cost, cross entropy, MSE cost; feed-forward networks; MLP, sigmoid units; neuroscience
inspiration; Learning in neural networks output vs hidden layers; linear vs nonlinear networks;
Backpropagation learning via gradient descent; recursive chain rule (backpropagation); if time:
bias-variance tradeoff, regularization; output units: linear, softmax; hidden units: tanh,
Deep learning strategies I (e.g., GPU training, regularization,etc); project proposals
Deep learning strategies II (e.g., RLUs, dropout, etc) SCC/TensorFlow overview How to use the
SCC cluster; introduction to Tensorflow. CNNs I Convolutional neural networks
Deep Belief Nets I probabilistic methods RNNs I Recurrent neural networks Other DNN variants
(e.g. attention, memory networks, etc.)
Neural Turing Machines(Kate) Unsupervised deep learning I(e.g. autoencoders etc.)
Unsupervised deep learning II (e.g. deep generative models etc.)
Deep reinforcement learning Vision applications I NLP applications I Laboratory Exercises /
Practical: NA
Reference Books
1. Ian Goodfellow, Yoshua Bengio, Aaron Courville. Deep Learning.
Supplementary Reading:
1. Duda, R.O., Hart, P.E., and Stork, D.G. Pattern Classi cation . Wiley-Interscience.
2nd Edition. 2001.
2. Theodoridis, S. and Koutroumbas, K. Pattern Recognition. Edition 4 . Academic
Press, 2008.
3. Russell, S. and Norvig, N. Artificial Intelligence: A Modern Approach . Prentice Hall
Series in ArtificialIntelligence. 2003.
4. Bishop, C. M. Neural Networks for Pattern Recognition . Oxford University Press.
1995.
5. Hastie, T., Tibshirani, R. and Friedman, J. The Elements of Statistical Learning .
Springer. 2001.
6. Koller, D. and Friedman, N. Probabilistic Graphical Models . MIT Press. 2009.
Web Resources:
Weblinks: -
MOOCs: -
Pedagogy:
Mini Project development, Problem solving approach, Participative learning, discussions, algorithm,
Program writing, experiential learning through practical problem-solving, assignment, PowerPoint
presentation
Assessment Scheme:
Syllabus:
CourseOutcomes:
1. Select real-world applications that needs machine learning based solutions.
2. Implement and apply machine learning algorithms.
3. Select appropriate algorithms for solving a particular group of real-world problems.
4. Recognize the characteristics of machine learning techniques that are useful to solve
real-world problems.
CourseContents
T. Hastie, R. Tibshirani and J. Friedman, “Elements of Statistical Learning”,
1.
2.
Springer, 2009.
E. Alpaydin, “Machine Learning”, MIT Press, 2010.
3.
K. Murphy, “Machine Learning: A Probabilistic Perspective”, MIT Press, 2012.
4.
C. Bishop, “Pattern Recognition and Machine Learning, Springer”, 2006.
5.
Shai Shalev-Shwartz, Shai Ben-David, “Understanding Machine Learning:From Theory to
6.
Algorithms”, Cambridge University Press, 2014.
7. John Mueller and Luca Massaron, “Machine Learning For Dummies“, John Wiley &
Sons, 2016.
Pedagogy:
AssessmentScheme:
Syllabus:
Naive Bayes:
Venn diagrams, Naive Bayes algorithm, application and
2 problems, Naive Bayes learning, Bayesian inference, Retail 12 - -
basket analysis; Concept of boosting and bagging
Association Rules
4 Association rule mining, K-nearest neighbours algorithm. 09 - -
Course Objectives:
To describe Data Science Life cycle.
To describe Data Visualization
Course Outcomes:
Students will be able to understand Data Science Life cycle & Data Visualization
Course Contents:
1. What is Data Science?
What does Data Science involve?
Era of Data Science
Business Intelligence vs Data Science
Life cycle of Data Science including Extract Transform and Load
Data Preprocessing
Data Imputation
Data Cleaning
Data Transformation
Data Visualization
Data Analysis
Data Engineering - Big Data
Tools of Data Science
3. Visualization of Data
Introduction to Visualization.
Human Perception and Information Processing
Data types
Graphical perception (the ability of viewers to interpret visual
(graphical) encodings of information and thereby decode information in graphs
Color for information display
Color management systems
Picture visualization and fruition
Data Transformation into sources of knowledge through visual representation.
Requirements and heuristics for high-quality visualizations.
Charts and standard views: relevance and appropriateness.
Advanced and innovative tools for data visualization and advanced quantitative analysis.
The evaluation of the quality of visualizations and infographics.
Learning Resources:
Reference Books:
1. Foundations of Data Science By Avrim Blum, John Hopcroft, and Ravindran Kannan
Pedagogy:
Participative learning, discussions, algorithm, experiential learning through practical problem
solving, assignment, PowerPoint presentation
Assessment Scheme:
Syllabus :
Module Workload in Hrs
Contents
No. Theory Lab Assess
What is Data Science?
What does Data Science involve?
Era of Data Science
1 Business Intelligence vs Data Science 12 - -
Life cycle of Data Science
Tools of Data Science
Course Outcomes:
1. Select real-world applications that needs machine learning based solutions.
2. Implement and apply machine learning algorithms.
3. Select appropriate algorithms for solving a particular group of real-world problems.
4. Recognize the characteristics of machine learning techniques that are useful to solve
real-world problems.
Course Contents
Introduction to learning
What is Supervised, Unsupervised and Reinforcement Learning? visualization of algebraic
concepts
Linear Regression
What is Regression? What is simple one variable regression line and coefficients of the line? What
are assumptions of linear regression? What is Gradient descent algorithm, cost function to find
'beta' values and concept
Gradient Descent
How to represent matrix of problem? How to use Gradient descent for multiple features and
scaling techniques in gradient descent? What are types of feature scaling, finding coefficients
analytically?
Learning Resources:
Assessment Scheme:
Course Objectives:
Data Cleaning
Data Transformation
Data Visualization
Data Analysis
Data Engineering - Big Data
Tableau Desktop
Getting Started
Connecting to Data
Visual Analytics
Dashboards and Stories
Mapping
Calculations
Why is Tableau Doing That?
How To cleanse & represent
Learning Resources:
Reference Books:
Foundations of Data Science By Avrim Blum, John Hopcroft, and Ravindran Kannan
Pedagogy:
Participative learning, discussions, algorithm, Flowchart & Program writing, experiential learning
through practical problem solving, assignment, PowerPoint presentation.
Assessment Scheme:
Class Continuous Assessment (CCA) 50
Syllabus:
Basic of Tableau :
i. Tableau interface:
Menus and Toolbar
Data Pane
Analytics Pane
Sheet Tabs
Shelves and Cards
Marks Card
Legends
4 - -
Layout for Dashboards & Stories
Distributing and Publishing
ii. Distributing & publishing:
Way to share
Exploring images and PDFs
Workbook file types
Opening workbook files
Sharing securely
Connecting with Data:
Getting Started with Data
Managing Metadata
Managing Extracts
Saving and Publishing Data Sources
Data Prep with Text and Excel Files
5 - -
Join Types with Union
Cross-database Joins
Data Blending
Additional Data Blending Topics
Connecting to Cubes
Connecting to PDFs
Visual Analytics:
Interactive Filters
Where Tableau Filters
Additional Filtering Topics
Parameters
Formatting
The Formatting Pane
Basic Tooltips
Viz in Tooltip
Trend Lines
Reference Lines
Forecasting
Clustering
Analysis with Cubes and MDX
Dashboards and Stories:
Getting Started with Dashboards and Stories
Building a Dashboard
Dashboard Objects
7 Dashboard Formatting - -
Device Designer
Dashboard Interactivity Using Actions
Story Points
Mapping:
Getting Started with Mapping
Maps in Tableau
Editing Unrecognized Locations
Spatial Files
8 Expanding Tableau's Mapping Capabilities - -
Custom Geocoding
Polygon Maps
Mapbox Integration
WMS: Web Mapping Services
Background Images
Calculations:
Calculation Syntax
Introduction to LOD Expressions
Modifying Table Calculations
9 Aggregate Calculations - -
Logic Calculations
String Calculations
Number Calculations
Type Calculations
Conceptual Topics with LOD Expressions
Aggregation and Replication with LOD Expressions
Nested LOD Expressions
How to Integrate R and Tableau
Using R within Tableau
Date Calculations
Getting Started with Calculations
Intro to Table Calculations
Why is Tableau Doing That?
Understanding Pill Types
Measure Names and Measure Values
10 Aggregation, Granularity, and Ratio Calculations - -
When to Blend and When to Join
Fixing "Incorrect" Sorts
Filtering for Top Across Panes
How To
Finding the Second Purchase Date with LOD
Expressions
Using a Parameter to Change Fields
Cleaning Data by Bulk Re-aliasing
11 Bollinger Bands - -
Bump Charts
Control Charts
Funnel Charts
Pareto Charts
Waterfall Charts
Course Outcomes:
Course Contents:
M2M to IoT
Introduction of M2M to IoT
Learning Resources:
Reference Books:
1. Jan Holler, Vlasios Tsiatsis, Catherine Mulligan, Stefan Avesand, Stamatis
Karnouskos, David Boyle, “From Machine-to-Machine to the Internet of Things:
Introduction to a New Age of Intelligence”, 1st Edition, Academic Press, 2014.Data
Warehousing in the Real World, Anahory, Murray, Pearson Education
2. Vijay Madisetti and Arshdeep Bahga, “Internet of Things (A Hands-on-Approach)”,
1st Edition, VPT, 2014.
3. Francis daCosta, “Rethinking the Internet of Things: A Scalable Approach to
Connecting Everything”, 1st Edition, Apress Publications, 2013
Supplementary Reading:
1. Collaborative Internet of Things (C-IoT): For Future Smart Connected Life and
Business
2. By Fawzi Behmann, Kwok Wu
Weblinks:
www.tutorialspoint.com
Pedagogy:
Participative learning, discussions, Problem Solving, experiential learning through practical
problem solving, assignment, PowerPoint presentation
Assessment Scheme:
Class Continuous Assessment (CCA) 50 Marks
Course Objectives:
1. To learn the fundamental concepts of Digital Image Processing.
2. To study basic image processing operations.
3. To understand image analysis algorithms.
4. To expose students to current applications in the field of digital image processing.
Course Outcomes:
1. Understand image formation and the role human visual system plays in perception of gray
and color image data.
2. Get broad exposure to and understanding of various applications of image processing in
industry, medicine, and defense.
3. Learn the signal processing algorithms and techniques in image enhancement and image
restoration.
4. Acquire an appreciation for the image processing issues and techniques and be able to apply
these techniques to real world problems.
5. Be able to conduct independent study and analysis of image processing problems and
techniques
Course Contents
Introduction
What is Image Processing?, The origins of Image Processing, Examples of Fields that use Image
Processing, Gamma-Ray Imaging, X-Ray Imaging, Imaging in the Ultraviolet Band, Imaging in
the Visible and Infrared Bands, Imaging in the Microwave Band, Imaging in the Radio Band,
Fundamental steps in Digital Image Processing, Components of an Image Processing System
Image Segmentation
Fundamentals, Point, Line, and Edge Detection, Background, Detection of Isolated Points, Line
Detection
Edge Models, Basic Edge Detection, Edge Linking and Boundary Detection, Thresholding,
Foundation, Basic Global Thresholding, Optimum Global Thresholding Using Otsu's Method.
Learning Resources:
Reference Books
B1: Cay’s Horstmann and Gary Cornell Core Java Volume -1 and Volume 2.
B2: Herbert Schildt (TMH) The complete reference JAVA-2 Fifth Edition.
Pedagogy:
Participative learning, discussions, algorithm, Flowchart & Program writing, experiential learning
through practical problem solving, assignment, PowerPoint presentation.
Assessment Scheme:
Class Continuous Assessment (CCA) 50 Marks
ML basics
Algorithms, Naïve Bayes, Bayesian Statistics, HMM, CRF
Word Forms
POS tagging and Chunking: Morphology fundamentals; Morphological Diversity of Indian
Languages; Morphology Paradigms; Finite State Machine Based Morphology; Automatic
Morphology Learning; Shallow Parsing; Named Entities; Maximum Entropy Models; Random
Fields, POS tagging techniques, Chunking techniques:CRF.
Structures
Theories of Parsing, Parsing Algorithms; Robust and Scalable Parsing on Noisy Text as in Web
documents; dependency parsing; Hybrid of Rule Based and Probabilistic Parsing: MST, MALT
parser; Scope Ambiguity and Attachment Ambiguity resolution.
Meaning
Lexical Knowledge Networks, Wordnet Theory; Indian Language Wordnets and Multilingual
Dictionaries; Semantic Roles; Word Sense Disambiguation; WSD and Multilinguality; Metaphors;
Co-references.
Reference Books:
1. Allen, James, “Natural Language Understanding”, Second Edition, Benjamin/Cumming, 1995.
2. Charniack, Eugene, “Statistical Language Learning”, MIT Press, 1993.
3. Jurafsky, Dan and Martin, James, “Speech and Language Processing”,Second Edition, Prentice
Hall, 2008.
4. Manning, Christopher and Heinrich, Schutze, “Foundations of StatisticalNatural Language
Processing”, MIT Press, 1999.
5. AksharBharti, VineetChaitanya, Rajeev Sangal,”Natural Language Processing: An Paninian
perspective”
Web Resources:
Weblinks: -
MOOCs:-
Pedagogy:
Participative learning, discussions, algorithm, Flowchart & Program writing, experiential learning
through practical problem solving, assignment, PowerPoint presentation
Assessment Scheme:
Syllabus:
Course Objectives:
Organizations worldwide are waking up to the opportunity of this revolutionary medium to fulfill
various business objectives ranging from Sales,
Marketing, CRM, Product Development and Research. This has created an ever increasing demand
of skilled Web Analytics professionals.The objective is to fulfill this demand.
Course Outcomes:
After taking this course, you will be able to: - Utilize various Application Programming
Interface (API) services to collect data from different social media sources such as YouTube,
Twitter, and Flickr. - Process the collected data - primarily structured - using methods involving
correlation, regression, and classification to derive insights about the sources and people who
generated that data. - Analyze unstructured data - primarily textual comments - for sentiments
expressed in them. - Use different tools for collecting, analyzing, and exploring social media data
for research and development purposes.
Course Contents:
Google Analytics
Getting Started With Google Analytics
How Google Analytics works?
Accounts, profiles, and users navigating
Google Analytics
Content performance analysis
Pages and Landing Pages
Event Tracking and Ad Sense Site Search
Learning Resources:
Reference Books:
Written by none other than Avinash Kaushik, Digital Marketing Evangelist for Google, Co-
Founder and Chief Education Officer for Market Motive, and author of two bestselling
books: Web Analytics 2.0, Web Analytics: An Hour A Day tops the chart when it comes to
best Web Analytics Books.
Supplementary Reading:
Weblinks:
Pedagogy:
Participative learning, discussions, Problem Solving, experiential learning through practical
problem solving, assignment, PowerPoint presentation
Assessment Scheme:
Class Continuous Assessment (CCA): 50 Marks
What’s analysis?
Is analysis worth the effort?
• Small businesses
1 • Medium and large scale businesses 5 - -
Analysis vs intuition
What is web analytics?
Getting started with Google Analytics
• How Google Analytics works
• Accounts, profiles, and users
Google Analytics
Visitor analysis
Unique visitors
4 Geographic and language information 6 - 1
Technical reports
Benchmarking
Radian6
6 Sentiment analysis 7 - 1
Workflow management
Text analytics
Articulate the main concepts, key technologies, strengths, and limitations of cloud
computing and the possible applications for state-of-the-art cloud computing
Identify the architecture and infrastructure of cloud computing, including SaaS, PaaS, IaaS,
public cloud, private cloud, hybrid cloud, etc.
Explain the core issues of cloud computing such as security, privacy, and interoperability.
Choose the appropriate technologies, algorithms, and approaches for the related issues.
Identify problems, and explain, analyze, and evaluate various cloud computing solutions.
Provide the appropriate cloud computing solutions and recommendations according to the
applications used.
Attempt to generate new ideas and innovations in cloud computing.
Collaboratively research and write a research paper, and present the research online.
Course Contents:
INTRODUCTION
Introduction of Cloud
CLOUD SERVICES
Types of Cloud services
Service providers- Google, Amazon, Microsoft Azure, IBM, Sales force
Dr.Sudhir Gavhane
Dean LASC
Email Communication over the Cloud - CRM Management - Project Management-Event
Management - Task Management – Calendar - Schedules - Word Processing – Presentation
Spreadsheet - Databases – Desktop - Social Networks and Groupware
VIRTUALIZATION FOR CLOUD
Need for Virtualization – Pros and cons of Virtualization – Types of Virtualization –System
Vm, Process VM, Virtual Machine monitor – Virtual machine properties - Interpretation and
Binary translation, HLL VM - Hypervisors – Xen, KVM, VMWare, Virtual Box, Hyper-V.
Pedagogy:
Participative learning, discussions, algorithm, Flowchart & Program writing, experiential learning
through practical problem solving, assignment, PowerPoint presentation.
Assessment Scheme:
Class Continuous Assessment (CCA) 50 Marks
Assignments Test Presentations Case study Attendance Oral Any other
10 10 10 10 10 - -
Course Objectives:
Organizations worldwide are waking up to the opportunity of this revolutionary medium to fulfill
various business objectives ranging from Sales,
Marketing, CRM, Product Development and Research. This has created an ever increasing demand
of skilled Web Analytics professionals.The objective is to fulfill this demand.
Course Outcomes:
After taking this course, you will be able to: - Utilize various Application Programming
Interface (API) services to collect data from different social media sources such as YouTube,
Twitter, and Flickr. - Process the collected data - primarily structured - using methods involving
correlation, regression, and classification to derive insights about the sources and people who
generated that data. - Analyze unstructured data - primarily textual comments - for sentiments
expressed in them. - Use different tools for collecting, analyzing, and exploring social media data
for research and development purposes.
Course Contents:
Google Analytics
Getting Started With Google Analytics
How Google Analytics works?
Accounts, profiles, and users navigating
Google Analytics
Learning Resources:
Reference Books:
Written by none other than Avinash Kaushik, Digital Marketing Evangelist for Google, Co-
Founder and Chief Education Officer for Market Motive, and author of two bestselling books: Web
Analytics 2.0, Web Analytics: An Hour A Day tops the chart when it comes to best Web Analytics
Books
Supplementary Reading:
Weblinks:
Pedagogy:
Participative learning, discussions, Problem Solving, experiential learning through practical
problem solving, assignment, PowerPoint presentation
Assessment Scheme:
Class Continuous Assessment (CCA): 50 Marks
Syllabus:
What’s analysis?
Is analysis worth the effort?
• Small businesses
1 • Medium and large scale businesses 5 - -
Analysis vs intuition
What is web analytics?
Getting started with Google Analytics
• How Google Analytics works
• Accounts, profiles, and users
Google Analytics
Content performance analysis
Unique visitors
4 Geographic and language information 6 - 1
Technical reports
Benchmarking
Radian6
6 Sentiment analysis 7 - 1
Workflow management
Text analytics
Course Objectives:
1. This course will focus on developing marketing strategies and resource allocation
decisions driven by quantitative analysis.
2. This course covers basic concepts in marketing process Measuring Brand Assets
3. This course includes Customer Lifetime Value, Regression Analysis, and Spreadsheet
with Formulas.
Course Outcomes:
4. Students will know what are the basic marketing strategies
5. Students learn about the core concepts and tools in marketing
6. Students know about measure brand value, calculate brand value
7. Students understand the marketing models.
Course Contents
Regression Basics
What is Regression Analysis? How to interpret Regression Outputs? What is Multivariable
Regressions, Omitted Variable Bias? How to use Price Elasticity to Evaluate Marketing? What is
Log-Log Models and Marketing Mix Models?
Learning Resources:
1. Marketing Analytics A Practitioner's Guide to Marketing Analytics and Research Methods
By (author): Ashok Charan (NUS, Singapore)
2. Managing Customer Value One Stage at a Time By (author): Dilip Soman (University of
Toronto, Canada), Sara N-Marandi (University of Toronto, Canada)
3. Worldwide Casebook in Marketing Management By (author): Luiz Moutinho (Dublin City
University, Ireland)
4. Data-Driven Marketing: The 15 Metrics Everyone in Marketing Should Know Hardcover –
February 8, 2010 by Mark Jeffery (Author)
5. Lean Analytics: Use Data to Build a Better Startup Faster (Lean Series) Hardcover – March
21, 2013 by Alistair Croll (Author), Benjamin Yoskovitz (Author)
6. Digital Marketing Analytics: Making Sense of Consumer Data in a Digital World (Que
Biz-Tech) Paperback – April 25, 2013 by Chuck Hemann (Author), Ken Burbary (Author)
Pedagogy:
Assessment Scheme:
COURSE STRUCTURE
Course Objectives:
1. To introduce use of data analytics techniques in HR
Course Outcomes:
1. Students will able to use data analytics technique in HR.
Course Contents:
HR Analytics in perspective
Introduction to role of data analytics in HR
An analytics method
Describes challenges in HR and solution to it using data analytics
Toolkits
Introduction to various toolkits required for HRA.
Data challenges
Introduction to statistical methods for processing of data.
Predictive analytics
Introduction to use of predictive analysis for HR data .
Learning Resources:
Reference Books:
1. The New HR Analytics: Predicting the Economic Value of Your Company's Human
By Jac FITZ-ENZ
2. Predictive HR Analytics: Mastering the HR Metric By Dr Martin R. Edwards,
Kirsten Edwards
3. Predictive Analytics for Human Resources By Jac Fitz-enz, John Mattox, II
Supplementary Reading:
Weblinks:
1. MOOCs:
Assessment Scheme:
Class Continuous Assessment (CCA) 50 Marks
Assignments Test Presentations Case study MCQ Oral Attendance
20 10 10 - - 10
Term End Examination : 50 Marks
`
Syllabus:
Predictive analytics
When to use predictive analysis
8 Importance of innovation 8
What is “the organization as a system”?
Organization design
`