0% found this document useful (0 votes)
28 views20 pages

LTI1

Uploaded by

BALAKUMAR C
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
28 views20 pages

LTI1

Uploaded by

BALAKUMAR C
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 20

MASTER OF COMPUTER APPLICATIONS 2024-2025

SEMESTER I
24CAP101B PYTHON FOR DATA SCIENCE 5H-3C
Instruction Hours / Week: L:5 T:0 P:0 Marks: Internal:40 External:60 Total:100
End Semester Exam:3Hours
PREREQUISITE:

 Probability and Statistics, Programming Skills, Python Libraries

COURSE OBJECTIVES (CO):


 To understand the basics of Python syntax and semantics.
 To create interactive visualizations using libraries like Plotly.
 To implement machine learning algorithms such as linear regression, decision trees, and
clustering.
COURSE OUTCOMES (COs):
At the end of this course, students will be able to

COs Course Outcomes Blooms Level


CO1 Understand the concepts of the various programming Understanding
constructs of Python programming
CO2 Make use of object oriented concepts to solve real world Remembering
problems
CO3 Analyze the basics of python and standard modules used Analyzing
for data science with hands-on.
CO4 Understand the data structures and visualization used for Understanding
data science with hands-on.
CO5 Evaluate the machine learning libraries used for data Evaluating
science with hands-on.

UNIT I PYTHON - DATA STRUCTURES, OOPS & MODULES 12 HOURS


Data structures: Dictionaries - Maps - Hash Tables - Array Data Structures - Records - Structs - Data
Transfer Objects - Sets and Multisets-Stacks (LIFOs) - Queues (FIFOs) ; Python : Python installation -
Python OOPs - Polymorphism in OOPs programming - Python String Concatenation - Print Exception in
Python - Python Libraries - Python Pandas - Python Matplotlib - Python Seaborn - Python SciPy - Chatbot
in Python - Machine Learning using Python - Exploratory Data Analysis in Python - Open CV Python -
Tkinter - Pythons Turtle Module - PyGame in Python - Pytorch - Scrapy - Web Scraping - Django -
Python Programs - Types of Data structure in Python - Built in data structures - User defined data
structures; Object Oriented Concepts and Design : APIs and Data Collection - Simple API - REST APIs &
HTTP Requests - Web scraping - HTML for Web Scraping - file formats
UNIT II PYTHON – NUMPY, PANDAS & DS LIBRARIES 12 HOURS
Installation and setup : Anaconda Distribution - Anaconda Navigator to create a New Environment -
Startup and Shutdown Process - Intro to the Jupyter Lab Interface - Code Cell - execution; Python : Basic
datatypes - Operators - variables - Built in Functions - Custom Functions - String Methods - Lists - Index
Positions and Slicing - Navigating Libraries using Jupyter Lab; Series : Create series object from a list and
dictionary - The head and Tail methods - Passing Series to Python Built-In Functions – Methods for Data
sorting ; Dataframe : Methods and Attributes between Series and DataFrames - Fill in Missing Values -
Filtering data and methods in Dataframe - Data Extraction in dataframes -Working with Text Data -
Merging Dataframes; Data Mining - Data Processing and Modelling - Data Visualization
UNIT III VISUALIZATION 12 HOURS
Introduction to Matplotlib - Matplotlib Basics - Matplotlib - Understanding the Figure Object - Matplotlib
- Implementing Figures and Axes - Matplotlib - Figure Parameters - Matplotlib Styling - Legends -
Matplotlib Styling - Colors and Styles - Advanced Matplotlib Commands - Introduction to Seaborn -
Scatterplots with Seaborn - Distribution Plots - Part One - Understanding Plot Types - Distribution Plots -
Part Two - Coding with Seaborn - Categorical Plots - Statistics within Categories - Understanding Plot
Types - Categorical Plots - Statistics within Categories - Coding with Seaborn - Categorical Plots -
Distributions within Categories - Understanding Plot Types - Categorical Plots - Distributions within
Categories - Coding with Seaborn - Seaborn - Comparison Plots - Understanding the Plot Types - Seaborn
- Comparison Plots - Coding with Seaborn - Seaborn Grid Plots - Seaborn - Matrix Plots.
UNIT IV REGRESSION AND CLASSIFICATION 12 HOURS
Introduction to Linear Regression : Cost Functions - Gradient Descent - Python coding Simple - Overview
of Scikit-Learn and Python - Residual Plots - Model Deployment and Coefficient Interpretation -
Polynomial Regression - Theory and Motivation - Creating Polynomial Features - Training and Evaluation
- Bias Variance Trade-Off - Polynomial Regression - Choosing Degree of Polynomial - Model
Deployment - Feature Scaling; Introduction to Cross Validation : Regularization Data Setup - Ridge
Regression Theory - Lasso Regression - Background and Implementation - Elastic Net - Feature
Engineering and Data Preparation; Dealing with Outliers - Dealing with Missing Data - Evaluation of
Missing Data - Filling or Dropping data based on Rows - Fixing data based on Columns - Dealing with
Categorical Data - Encoding Options - Cross Validation - Test - Validation - Train Split - cross_val_score
- cross validate - Grid Search; Linear Regression Project: The Logistic Function - Logistic Regression -
Theory and Intuition; Linear to Logistic: Logistic Regression - Theory and Intuition - Linear to Logistic
Math; Logistic Regression: Theory and Intuition Logistic Regression Model Training - Classification
Metrics - Confusion Matrix and Accuracy - Classification Metrics - Precison, Recall, F1-Score - ROC
Curves - Logistic Regression with Scikit-Learn - Performance Evaluation - Multi-Class Classification with
Logistic Regression - Data and EDA – Model.
UNIT V UNSUPERVISED AND ADVANCED MACHINE LEARNING 12 HOURS
Introduction to KNN Section: KNN Classification, KNN Coding with Python - Choosing K, KNN
Classification Project Exercise; Introduction & history of Support Vector Machines- Hyperplanes and
Margins, Kernel Intuition, Kernel Trick and Mathematics; SVM with Scikit-Learn and Python –
Classification, Regression Tasks; Introduction to Tree Based Methods- Decision Tree, Understanding Gini
Impurity; Constructing Decision Trees with Gini Impurity, Coding Decision Trees; Introduction to
Random Forests-Key Hyperparameters, Number of Estimators and Features in Subsets, Bootstrapping and
Out-of-Bag Error; Coding Classification with Random Forest Classifier, Coding Regression with Random
Forest Regressor, Advanced Models. Introduction to K-Means Clustering Section; K-Means Color
Quantization; K-Means Clustering Exercise Overview, Solution; Introduction to Hierarchical Clustering,
Coding - Data and Visualization, Scikit-Learn; Introduction to Principal Component Analysis(PCA)-
Manual Implementation in Python-SciKit-Learn.
TOTAL: 60 HOURS

TEXT BOOKS:
1 Fuentes, A. (2018). Become a Python Data Analyst. Packet Publishing.
2 Motwani, B. (2020). Data Analytics using Python. Wiley.
3 Damji, J. S. (2020). Learning Spark: Lightning-Fast Data Analytics (2nd ed.). Shroff/O'Reilly.

REFERENCE BOOKS:
1 Barry, P. (2016). Head First Python (2nd ed .). O'Reilly Media.
2 McKinney, W. (2022). Python for Data Analysis: Data Wrangling with pandas, NumPy, and
Jupyter (3rd ed.). O'Reilly Media.
3 Lambert, K. A. (2019). Fundamentals of Python – First Programs (2nd ed.). Cengage Publication.

WEBSITES:
1 http://docs.python.org/3/tutorial/index.html
2 http://interactivepython.org/courselib/static/ pythons
3 http://www.ibiblio.org/g2swap/byteofpython/read/

CO, PO, PSO Mapping

PO PO PO PO PO PO PO PO PO PO PO PO PO PO PO PSO PSO
CO
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 1 2
CO1 3 - 3 2 - 2 - 1 2 1 - - - - - 1 -
CO2 3 - 3 - 3 2 - - 2 - - - - - - - -
CO3 3 - 3 3 2 2 - - 1 - 1 - - - - - 2
CO4 3 - 3 3 2 2 - - 2 - - - - - - - -
CO5 3 2 3 3 3 2 2 1 - - 1 - - - - - 2
Average 3 2 3 2.8 2.5 2 2 1 1.8 1 1 - - - - 1 2

1 - Low, 2 - Medium, 3 - High, ‘-' - No Correlation


MASTER OF COMPUTER APPLICATIONS 2024-2025
SEMESTER I
24CAP102B APPLIED MACHINE LEARNING 4H-3C
Instruction Hours / Week: L:4 T:0 P:0 Marks: Internal:40 External:60 Total:100
End Semester Exam:3Hours
PREREQUISITE:

 Probability and Statistics, Data Mining Concepts

COURSE OBJECTIVES (CO):


 To introduce students to the concepts and techniques of Machine Learning.
 To be able to formulate machine learning problems corresponding to different applications.
 To apply the algorithms to a real-world problem, optimize the models learned and report on the
expected accuracy that can be achieved by applying the models.

COURSE OUTCOMES (COs):


At the end of this course, students will be able to

COs Course Outcomes Blooms Level


CO1 Know about Supervised Learning, Support Vector Understanding
Machines, Unsupervised Learning
CO2 Get the knowledge about Feature Engineering, Statistical Remembering
Data Analysis, Outlier Analysis and Detection
CO3 Learn about ML Model Development, Model Evaluation Evaluating
Techniques, Model Deployment and Inferences, Model
Explainability
CO4 Recognize the importance and value of Operations Applying
Research and mathematical modelling in solving practical
problems in industry
CO5 Define and formulate linear programming problems and Analyzing
appreciate their limitations

UNIT I SUPERVISED LEARNING 10 HOURS


Implement and understand the cost function and gradient descent for multiple linear regression -
Implement and understand methods for improving machine learning models by choosing the learning rate
- plotting the learning curve - performing feature engineering - applying polynomial regression -
Implement and understand the logistic regression model for classification -Learn why logistic regression is
better suited for classification tasks than the linear regression model is - Implement and understand the
cost function and gradient descent for logistic regression - Understand the problem of - overfitting -
improve model performance using regularization - Implement regularization to improve both regression
and classification models
UNIT II ADVANCED LEARNING ALGORITHMS 10 HOURS
Build a neural network for binary classification of handwritten digits using TensorFlow - Gain a deeper
understanding by implementing a neural network in Python from scratch - Optionally learn how neural
network computations are vectorized to use parallel processing for faster training and prediction - Build a
neural network to perform multi-class classification of handwritten digits in TensorFlow -using categorical
cross-entropy loss functions and the SoftMax activation - Learn where to use different activation functions
– ReLu - linear - sigmoid - SoftMax in a neural network - depending on the task you want your model to
perform - Use the advanced Adam optimizer to train your model more efficiently - Discover the value of
separating your data set into training - cross-validation -test sets - Choose from various versions of your
model using a cross-validation dataset -evaluate its ability to generalize to real- world data using a test
dataset - Use learning curves to determine if your model is experiencing high bias or high variance
UNIT III ADVANCED LEARNING ALGORITHMS 10 HOURS
Learn which techniques to apply regularization - adding more data - adding or removing input features to
improve your model’s performance - Learn how the bias-variance trade-off is different in the age of deep
learning - and apply Andrew Ng’s advice for handling bias and variance when training neural networks -
Learn to apply the iterative loop of machine learning development to train - evaluate - tune your model -
Apply data-centric AI to not only tune your model but tune your data using data synthesis or data
augmentation to improve your model’s performance - Build decision trees and tree ensembles - such as
random forest and XGBoost - boosted trees - to make predictions - Learn when to use neural network or
tree ensemble models for your task - as these are the two most commonly used supervised learning models
in practice today.
UNIT IV UNSUPERVISED LEARNING 9 HOURS
Use unsupervised learning techniques for unsupervised learning: including clustering and anomaly
detection - Build recommender systems with a collaborative filtering approach and a content-based deep
learning method - Build a deep reinforcement learning model - Implement K-mean clustering - Implement
anomaly detection - Learn how to choose between supervised learning or anomaly detection to solve
certain tasks.
UNIT V RECOMMENDER SYSTEMS 9 HOURS
Build a recommender system using collaborative filtering - Build a recommender system using a content-
based deep learning method - Build a deep reinforcement learning model (Deep Q Network)." -
Histograms - Box Plots etc - use of frequency distributions – mean comparisons - cross tabulation -
statistical inferences using chi square - t-test and ANOVA - Outlier Analysis and Detection - outlier
analysis - density based and distance based.
TOTAL: 48 HOURS
TEXT BOOKS:
1 Li, H. (2023). Machine Learning Methods. Springer Nature Singapore.
2 Rao, R. N. (2022). Machine Learning in Data Science Using Python. Dreamtech Press.
REFERENCE BOOKS:
1 Alpaydin, E. (2014). Introduction to Machine Learning (3rd ed., Adaptive Computation and
Machine Learning Series). MIT Press.
2 Aggarwal, C. C. (2018). Neural Networks and Deep Learning (1st Kindle ed.).
WEBSITES:
1 https://ai.google/education/
2 https://machinelearningmastery.com/
CO, PO, PSO Mapping

PO PO PO PO PO PO PO PO PO PO PO PO PO PO PO PSO PSO
CO
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 1 2
CO1 2 1 - - - - 1 - - - - - - - - - 1
CO2 3 2 1 - - - - - - - - - - - - - 1
CO3 2 1 - - - - 1 - - - - - - - - - 1
CO4 2 1 - - - - - - - - - - - - - - 1
CO5 3 2 1 - - - 1 - - - - - - - - - 1
Average 2.4 1.4 1 0 0 0 1 0 0 0 0 0 0 0 0 0 1
1 - Low, 2 - Medium, 3 - High, ‘-' - No Correlation
MASTER OF COMPUTER APPLICATIONS 2024-2025
SEMESTER I
24CAP103B DATA ENGINEERING 4H-3C
Instruction Hours / Week: L:4 T:0 P:0 Marks: Internal:40 External:60 Total:100
End Semester Exam:3Hours
PREREQUISITE:

 Database Concepts, Programming Skills, Data Analysis Concepts

COURSE OBJECTIVES (CO):


 To understand the fundamentals of data engineering and its importance in modern data-driven
applications.
 To representation of complex and voluminous data.
 To identify and design the various components of an Information Retrieval system
COURSE OUTCOMES (COS):
At the end of this course, students will be able to

COs Course Outcomes Blooms Level


CO1 Identify and explain different data formats and their use cases, Applying
including structured, semi-structured, and unstructured data.
CO2 Describe various data ingestion techniques, such as ETL, and stream Understanding
processing, and their advantages and limitations.
CO3 Perform data profiling and analyze data quality metrics to ensure data Understanding
accuracy, completeness, and consistency.
CO4 Design and implement effective storage and retrieval methods for Evaluating
large-scale data sets, including relational databases, NoSQL
databases, and distributed file systems.
CO5 Apply data engineering principles to real-world scenarios, such as Applying
data warehousing, big data analytics, and machine learning.

UNIT I DATA TYPES & FORMATS 10 HOURS


Introduction to Data Types and Formats - Types of Data - Structured vs. Unstructured Data - Formats of
Data - Semi-Structured Data - Data Type Conversion and Transformation - Data Serialization - Choosing
the Right Data Type and Format - Tools and Technologies for Data Types and Formats.
UNIT II DATA INGESTION TECHNIQUES 10 HOURS
Introduction to Data Ingestion - Streaming Data Ingestion - Batch Data Ingestion - Hybrid Data Ingestion -
Data Ingestion vs. Data Integration - Data Ingestion Challenges - Tools and Solutions for Data Ingestion -
StreamSets DataOps Platform - Benefits of Data Ingestion - Data Ingestion Framework.
UNIT III DATA PROFILING & VISUAL REPRESENTATION VIA VARIOUS 10 HOURS
TOOLS (PANDAS)
Introduction to Data Profiling and Visualization - Exploratory Data Analysis (EDA) with Pandas - Steps
Involved in Exploratory Data Analysis (EDA) Data Analysis (EDA) with Pandas - Market Analysis with
Exploratory Data Analysis (EDA) - Data Analytics and Its Future Scope - Data Analytics with Python -
Top Business Intelligence Tools - Application of Data Analytics - Retrieving and Cleaning Data -
Exploratory Data Analysis and Feature Engineering - Inferential Statistics and Hypothesis Testing -
Descriptive Statistics - Types of Descriptive Statistics - Concepts of Populations, Samples, and Variables -
Statistical Methods for Describing Data Characteristics - Real-World Applications of Descriptive Statistics
using Excel - Types of Missing Data and Handling Techniques.
UNIT IV STORAGE AND RETRIEVAL METHODS 9 HOURS
Introduction to Storage and Retrieval - Types of Data and Storage Methods - Local vs. Distributed Storage
& Retrieval - Hardware Aspects of Storage & Retrieval - Choosing Storage Methods - Data Partitioning
and Sharding - Data Replication and Redundancy - Data Compression and Encoding - Data Archiving and
Retrieval - Backup and Disaster Recovery - Data Lifecycle Management.
UNIT V DATA LINEAGE ANALYSIS 9 HOURS
Introduction to Data Lineage Analysis - Building a Data Flow - ETL (Extract, Transform, Load) Process -
Usage of Data Warehouse - Edge Intelligence in Data Flow - Understanding Data Lineage - How Data
Lineage Works - Benefits of Data Lineage - Data Lineage Tool Features.
TOTAL: 48 HOURS
TEXT BOOKS:
1 Judd, C. M. (2017). Data Analysis: A Model Comparison Approach To Regression, ANOVA, and
Beyond (3rd ed.). Routledge.
2 Bonnefoy, P.-Y., Chaize, E., Mansuy, R., & Tazi, M. (2024). The Definitive Guide to Data
Integration (1st ed.). Packt Publishing.
REFERENCE BOOKS:
1 Baeza-Yates, R., & Ribeiro-Neto, B. (2011). Modern Information Retrieval: The Concepts and
Technology behind Search (2nd ed., ACM Press Books).
2 Reis, J., & Housley, M. (2022). Fundamentals of Data Engineering: Plan and Build Robust Data
Systems. Grayscale Indian Edition.
WEBSITES:
1 https://www.datacamp.com/tutorial/category/data-engineering
2 https://www.codecademy.com/catalog/subject/data-engineering
CO, PO, PSO Mapping

PO PO PO PO PO PO PO PO PO PO PO PO PO PO PO PSO PSO
CO
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 1 2
CO1 3 - 3 3 2 2 - 1 2 - 2 - - - - 2 -
CO2 3 - 3 2 2 2 - 1 2 1 - - - - - - -
CO3 3 - 3 2 2 2 - 1 2 - - - - 1 - - 3
CO4 3 - 2 2 2 2 - 1 2 - - - - - - - -
CO5 3 - 3 3 2 2 - 1 2 - - - - - - - 3
Average 3 - 2.8 2.4 2 2 - 1 2 1 2 - - 1 - 2 3

1 - Low, 2 - Medium, 3 - High, ‘-' - No Correlation


MASTER OF COMPUTER APPLICATIONS 2024-2025
SEMESTER I
24CAP104B MATHEMATICAL FOUNDATION FOR DATA SCIENCE 4H-4C
Instruction Hours / Week: L:4 T:0 P:0 Marks: Internal:40 External:60 Total:100
End Semester Exam:3Hours
PREREQUISITE:

 Algebra, Probability and Statistics, Programming Skills.

COURSE OBJECTIVES (CO):


 To refresh the statistical knowledge learnt earlier with hands-on practical expertise
 To understand and manipulate data in high-dimensional spaces.
 To model uncertainty, make inferences about populations from samples, and make predictions.
COURSE OUTCOMES (COS):
At the end of this course, students will be able to

COs Course Outcomes Blooms Level


CO1 Refresh the mathematics knowledge with respect to Linear Remembering
algebra, Vectors, Projections, Principal Component Analysis and
Generative Models
CO2 Refresh the mathematics knowledge with respect to Matrices, Understanding
Gradient Calculus, Optimization models.
CO3 Refresh the mathematics knowledge with respect to probability, Applying
statistics.
CO4 Find information about the population on the basis of a random Evaluating
sample taken from that population and also to choose an
appropriate test procedure under the test of significance
CO5 Apply mathematical concepts to real-world data science Applying
problems.

UNIT I LINEAR ALGEBRA 10 HOURS


Systems of Linear Equations - Machine learning motivation - A geometric notion of singularity - Singular
vs non-singular matrices - Linear dependence and independence - Matrix row-reduction - Row operations
that preserve singularity - The rank of a matrix - Row echelon form - Reduced row echelon form- LU
decomposition- Solving Systems of Linear Equations - Machine learning motivation - Solving non-
singular systems of linear equations - Solving singular systems of linear equations - Solving systems of
equations with more variables - Gaussian elimination.
UNIT II PROBABILITY & STATISTICS 10 HOURS
Introduction to probability - Concept of probability: repeated random trials - Conditional probability and
independence - Random variables - Cumulative distribution function - Discrete random variables:
Binomial distribution - Probability mass function - Continuous random variables: Uniform distribution -
Continuous random variables: Gaussian distribution -Joint distributions - Marginal and conditional
distributions - Independence - covariance - Multivariate normal distribution - Sampling and point
estimates - Interval estimation -Confidence intervals – Confidence Interval for mean of population -
Biased vs Unbiased estimates-Maximum likelihood estimation - Intuition behind maximum likelihood
estimation - Hypothesis testing - Describing samples: sample proportion and sample mean - Two types of
errors - Test for proportion and means - Two sample inference for difference between groups.
UNIT III BAYESIAN STATISTICS & ITS APPLICATIONS IN VARIOUS 10 HOURS
FIELDS
Bayesian statistics and its applications in various fields - Bayesian Learning: Bayes theorem - maximum
likelihood and least squared error hypotheses – Naïve Bayes classifier- Bayesian belief networks- gradient
ascent training of Bayesian networks- learning the structure of Bayesian networks- the EM algorithm-
mixture of models- Markov models- hidden Markov models - Time series analysis and forecasting
techniques - Basic Properties of time-series data: Distribution and moments- Stationarity- Autocorrelation-
Heteroscedasticity- Normality- Survival Analysis.
UNIT IV NON-PARAMETRIC STATISTICS 9 HOURS
Non-parametric Statistics - Chi square test- Sign test -Wilcoxon signed rank test - Mann Whitney test -
Run test - Kolmogorov Smirnov test - Spearmann and Kendall’s test - Tolerance region.
UNIT V MULTIVARIATE STATISTICAL METHODS FOR ANALYZING 9 HOURS
COMPLEX DATASETS
Multivariate statistical methods for analysing complex datasets - Factor Analysis - Cluster Analysis-
Regression Analysis - Discriminant Analysis.
TOTAL: 48 HOURS
TEXT BOOKS:
1 Phillips, J. M. (2021). Mathematical Foundations for Data Analysis. Springer Series.
2 Hastie, T., Tibshirani, R., & Friedman, J. (2009). The Elements of Statistical Learning: Data
Mining, Inference, and Prediction (2nd ed.). Springer.
REFERENCE BOOKS:
1 Thompson, S. K. (2012). Sampling. John Wiley & Sons.
2 Montgomery, D. C. (2008). Introduction to Quality Control (6th ed.). John Wiley & Sons.
WEBSITES:
1 https://ibse.iitm.ac.in/course/math-foundations-of-ds/
2 https://medium.com/illumination/i-found-the-4-mathematical-foundations-that-are-essential-for-
data-science-ebe449aa30ce
CO, PO, PSO Mapping

PO PO PO PO PO PO PO PO PO PO PO PO PO PO PO PSO PSO
CO
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 1 2
CO1 3 - 2 2 3 2 - 3 3 - 1 - - - - 3 2
CO2 3 - 1 1 3 2 - 2 3 - - - - - - 2 1
CO3 3 - 1 - - - - - - - 2 - - - - - -
CO4 3 - 2 - 3 3 - 1 2 - 1 - - - - - -
CO5 3 - - - - - - - - - 2 - - - - - -
Average 3 - 1.5 1.5 3 2.3 - 2 2.7 - 1.5 - - - - 2.5 1.5

1 - Low, 2 - Medium, 3 - High, ‘-' - No Correlation


MASTER OF COMPUTER APPLICATIONS 2024-2025
SEMESTER I
24CAP105PE2 GENERATIVE ADVERSARIAL NETWORKS 4H-3C
Instruction Hours / Week: L:4 T:0 P:0 Marks: Internal:40 External:60 Total:100
End Semester Exam:3Hours
PREREQUISITE:
 Optimization Techniques, Digital Communication.
COURSE OBJECTIVES (CO):
 To learn the historical development and motivation behind GANs.
 To understand the unique characteristics and applications of various GAN variants.
 To analyze case studies to understand the impact of GANs in different fields.

COURSE OUTCOMES (COS):


At the end of this course, students will be able to

COs Course Outcomes Blooms Level


CO1 Understand generative models such as generative adversarial Understanding
networks (GANs) and their advanced techniques.
CO2 Build sophisticated and robust GAN models using PyTorch & Applying
convolutional layers etc.,
CO3 Learn about the advantages and disadvantages of different Understanding
GAN performance measures.
CO4 Explore and examine the applications of GANs Evaluating
CO5 Identify potential areas for future research and innovation in Applying
GANs.

UNIT I BUILD BASIC GENERATIVE ADVERSARIAL NETWORKS (GANs) 9 HOURS


Overview of GenAI - Intro to GANs - Learn about GANs and their applications, understand the intuition
behind the basic components of GANs -build your very own GAN using PyTorch - Deep Convolutional
GAN - Build a more sophisticated GAN using convolutional layers - Learn about useful activation
functions - batch normalization - and transposed convolutions to tune your GAN architecture and apply
them to build an advanced DCGAN specifically for processing images - Wasserstein GANs with
Normalization - Reduce instances of GANs failure due to imbalances between the generator and
discriminator by learning advanced techniques such as WGANs to mitigate unstable training and mode
collapse with a W-Loss and an understanding of Lipschitz Continuity - Conditional and Controllable
GANs - Understand how to effectively control your GAN - modify the features in a generated image - and
build conditional GANs capable of generating examples from determined categories.
UNIT II BUILD BETTER GENERATIVE ADVERSARIAL NETWORKS (GANs) 9 HOURS
GAN Evaluation - Understand the challenges of evaluating GANs - learn about the advantages and
disadvantages of different GAN performance measures - and implement the Fréchet Inception Distance
FID method using embeddings to assess the accuracy of GANs -GAN Disadvantages and Bias - Find out
the disadvantages of GANs when compared to other generative models - discover the pros/cons of these
models
UNIT III: BUILD BETTER GENERATIVE ADVERSARIAL NETWORKS (GANs) 10 HOURS
Plus - learn about the many places where bias in machine learning can come from - why it’s important -
and an approach to identify it in GANs - StyleGAN and Advancements - Understand how StyleGAN
improves upon previous models and implements the components and the techniques associated with
StyleGAN - currently the most state-of-the-art GAN with powerful capabilities.
UNIT IV APPLY GENERATIVE ADVERSARIAL NETWORKS (GANs) 10 HOURS
GANs for Data Augmentation and Privacy Preservation - Explore the applications of GANs and examine
them wrt data augmentation, privacy, and anonymity Improve your downstream AI models with GAN-
generated data - Image-to-Image Translation - Leverage the image-to-image translation framework and
identify extensions – generalizations - applications of this framework to modalities beyond images.
UNIT V APPLY GENERATIVE ADVERSARIAL NETWORKS (GANs) 10 HOURS
Implement Pix2Pix - a paired image-to-image translation GAN - to adapt satellite images to map routes
with advanced U-Net generator -Patch GAN discriminator architectures - Image-to-Image Unpaired
Translation - Compare paired image-to-image translation to unpaired image-to-image translation and
identify how their key difference necessitates different GAN architectures - Implement Cycle GAN- an
unpaired image-to-image translation model, to adapt horses to zebras with two GANs in one.
TOTAL: 48 HOURS
TEXT BOOKS:
1 Langr, J., & Bok, V. (2019). GANs in Action: Deep learning with Generative Adversarial
Networks. Manning.
2 Hany, J. (2019). Hands-On Generative Adversarial Networks with PyTorch 1.x. Packt Publishing.
REFERENCE BOOKS:
1 Kang, S., & Bali, R. (2019). Generative Adversarial Networks: A Practical Guide.
2 Kalin, J. (2021). Generative Adversarial Networks Cookbook: Over 100 Recipes to Build and
Train GANs Using Python, TensorFlow, and Keras.
WEBSITES:
1 https://machinelearningmastery.com/what-are-generative-adversarial-networks-gans/
2 https://www.coursera.org/specializations/generative-adversarial-networks-gans
CO, PO, PSO Mapping
PO PO PO PO PO PO PO PO PO PO PO PO PO PO PO PSO PSO
CO
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 1 2
CO1 3 - 3 3 1 1 - - 2 - 1 - - - - 3 2
CO2 3 - 3 3 2 2 - 2 2 - - - - - - 2 1
CO3 3 - 3 2 2 2 - 1 3 - - - - - - 3 3
CO4 3 - 3 3 3 3 - 1 2 - - - - - - - -
CO5 3 - 3 3 1 1 - 1 2 - 1 - - - - 3 3
Average 3 - 3 2.8 1.8 1.8 - 1.3 2.2 - 1 - - - - 2.8 2.3

1 - Low, 2 - Medium, 3 - High, ‘-' - No Correlation


MASTER OF COMPUTER APPLICATIONS 2024-2025
SEMESTER I
24CAP106 PROFESSIONAL SOFT SKILLS - I 3H-1C
Instruction Hours / Week: L:3 T:0 P:0 Marks: Internal:40 External:60 Total:100
End Semester Exam:3Hours
PREREQUISITE:
 Not Required

COURSE OBJECTIVES (CO):


 To improve clarity and conciseness in verbal and written communication.
 To enhance ability to adapt to changing circumstances and new challenges.
 To promote a respectful and supportive workplace environment.

COURSE OUTCOMES (COS):


At the end of this course, students will be able to

COs Course Outcomes Blooms Level


CO1 Understand and implement positive outlook, interpret the body Understanding
language of team members and stakeholders, better interpersonal
relationships. Develop into self-motivated professionals with
confidence. Practice Responding instead of Reacting.
CO2 Create good Presentation and Present with confidence. Also, Creating
recognize and manage Stress, Prioritize and Plan.
CO3 Listen to understand. To be able to ask good questions. Understanding
CO4 Understand to be a good Team player, Team Dynamics and to Applying
understand the Business Ethics
CO5 Write and speak correctly, forming grammatically correct sentences. Applying

UNIT I POSITIVE ATTITUDE 7 HOURS


Attitude- Campus to Corporate attitude change, Recognizing Negative Attitude, Campus to Corporate
attitude change; Attitude at work- Impact of Negative Attitude in the Workplace, Overcoming Negative
Attitude, positive attitude, thought process, Building self-confidence and Assertiveness; Toxic positivity;
3Es, Motivation-Intrinsic and Extrinsic Motivation, Inspiration vs motivation; Emotional Intelligence-
Intro to EI, Four clusters. Transactional Analysis (TA), SWOT analysis - Professional analysis.
UNIT II: BODY LANGUAGE 7 HOURS
Importance of Body Language, Five Cs of Body Language, Body language in different cultures, Positive
Body Language; Voice Control- Pace. Pause and Pitch; Culture-Inclusivity and Proxemics across Global
Cultures, Understanding POSH; Stress Management-What is Stress, Eustress, Reasons of stress (work/
personal); Stress Management Techniques
UNIT III PRESENTATION SKILLS 7 HOURS
Self-introduction – Exercises, Why Give Presentations; Craft your message-Plan the visuals, Manage the
Response; How to create an effective presentation - Virtual & Physical, Do’s & Don'ts of Presentation
Skills, Objection handling, Stage Fear – Causes and Cure, Practice the Delivery; Time Management-
Common Time & Energy Wasters, Planning & Prioritizing Time Matrix & Analysis

UNIT IV LISTENING & QUESTIONING SKILLS 7 HOURS


Barriers to effective listening - how to overcome them; Exercises - Customer Call Flow – Role-play, Cust
calls amongst the team; How to frame Questions, Different kinds of questions, asking appropriate
questions; Spoken English-Introduction to Parts of Speech and its usage; Subject - Verb Agreement; Basic
conversation skills-sentence construction -SVO
UNIT V TEAMWORK 8 HOURS
Teamwork and Ethics - Definition of TEAM - Team vs Groups. Difference b/w Healthy competition and
cut throat competition, Importance of working in teams, Evolution of a TEAM, Benefits of team work;
Virtual teams- Challenges and ways to overcome it, Diversity and Inclusion in a team; Development of
Teams Stages of team development; Team dynamics-its importance & Interpersonal Skills Development
Ethics- to enable students to identify and deal with ethical problems, develop their moral intuitions, which
are implicit in everyday choices and actions; Conflict Management: Team building Activities-
Predetermined/ Predesigned Indoor/ Outdoor activities to build a team, enhance language and inter
personal skills
TOTAL: 36 HOURS
TEXT BOOKS:
1 Kumar, S., & PushpLata. (2015). Communication Skills (2nd ed.). New Delhi: Oxford University
Press.
2 Murphy, R. (2012). Essential English Grammar: Reference and Practice for South Asian Students
(2nd ed.). Cambridge: Cambridge University Press.
REFERENCE BOOKS:
1 Pye, G. (2011). Vocabulary in Practice, Parts 1 and 2 (1st ed.). Cambridge: Cambridge University
Press.
WEBSITES:
1 https://www.forbes.com/advisor/in/business/soft-skills-examples/
2 https://www.thebalancemoney.com/list-of-soft-skills-2063770
CO, PO, PSO Mapping

PO PO PO PO PO PO PO PO PO PO PO PO PO PO PO PSO PSO
CO
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 1 2
CO1 3 - 1 - 1 1 - - - 1 2 - - - - 1 -
CO2 3 - 1 - - - - - - - - - 1 - - - 2
CO3 3 - 2 - - - - 2 2 - - - 1 - - - -
CO4 3 - 1 - - 1 - 2 2 - - - - - - - -
CO5 3 - 1 1 1 - - 1 2 - - - 1 - - - -
Average 3 - 1.2 1 1 1 - 1.7 2 1 2 - 1 - - 1 2

1 - Low, 2 - Medium, 3 - High, ‘-' - No Correlation


MASTER OF COMPUTER APPLICATIONS 2024-2025
SEMESTER I
24CAP111B DATA SCIENCE - PRACTICAL 5H-2C
Instruction Hours / Week: L:0 T:0 P:5 Marks: Internal:40 External:60 Total:100
End Semester Exam:3Hours
PREREQUISITE:
 Probability and Statistics, Programming Skills, Python Libraries, Learning Concepts
COURSE OBJECTIVES (CO):
 To learn techniques for handling missing data, outliers, and data imputation.
 To build a portfolio of projects demonstrating your proficiency and innovation in data science.
 To evaluate model performance using appropriate metrics (e.g., accuracy, precision, recall).
COURSE OUTCOMES (COS):
At the end of this course, students will be able to

COs Course Outcomes Blooms Level


CO1 Achieve proficiency in cleaning and preprocessing diverse datasets,
Applying
ensuring data integrity and quality.
CO2 Apply feature engineering techniques to extract relevant features and
Applying
improve model performance.
CO3 Generate visualizations and summary statistics that provide
Understanding
meaningful insights into data characteristics.
CO4 Develop and implement machine learning models for predictive tasks
Applying
(e.g., regression, classification).
CO5 Implement advanced machine learning techniques such as ensemble
methods (e.g., random forests, gradient boosting) and deep learning Applying
for complex data problems.

LIST OF PROGRAMS (CASE STUDIES) 60 HOURS


1 Present your view on the different techniques you have employed to do outlier analysis, handling
missing data, feature engineering, feature importance and improving the accuracy of the model
both from a classifier as well as a regressor. Use any sample data and present your POV in a well-
structured presentation.
2 Present your findings on different activation functions you have used and methods to improve the
accuracy of the model using neural networks. You should be able to clearly articulate the
advantage and disadvantage of each activation function. Use any sample data and present your
POV in a well-structured presentation.
3 Present your findings on different techniques of anomaly detection and k means clustering. Use
any sample data and present your POV in a well-structured presentation
4 Present your POV on how to generate synthetic data using GANs. You can assume a sample
dataset from an IOT enabled machine where the failure rates are minimal.
5 Present your POV on Style related GANS. Explore the earliest models to the current models.
Articulate the successive improvements in the models. Also articulate the future of GANs in
generating realistic images.
6 Present your POV on GANs used for Deep Fakes. Articulate how we can identify the Deep Fake
from the original.
TOTAL: 60 HOURS

TEXT BOOKS:
1 Fuentes, A. (2018). Become a Python Data Analyst. Packt Publishing.
2 Motwani, B. (2020). Data Analytics using Python. Wiley.
3 Damji, J. S. (2020). Learning Spark: Lightning-Fast Data Analytics (2nd ed.). Shroff/O'Reilly.

REFERENCE BOOKS:
1 Barry, P. (2016). Head First Python (2nd ed.). O’Reilly Media.
2 McKinney, W. (2022). Python for Data Analysis: Data Wrangling with pandas, NumPy, and
Jupyter (3rd ed.). O’Reilly Media.
3 Lambert, K. A. (2019). Fundamentals of Python – First Programs (2nd ed.). Cengage
Publications.
WEBSITES:
1 http://docs.python.org/3/tutorial/index.html
2 http://interactivepython.org/courselib/static/ pythons
3 http://www.ibiblio.org/g2swap/byteofpython/read/

CO, PO, PSO Mapping

PO PO PO PO PO PO PO PO PO PO PO PO PO PO PO PSO PSO
CO
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 1 2
CO1 3 - 1 - - 1 - - - 1 - - - - - 3 -
CO2 3 - - - 1 - - - 1 - - - - - - - -
CO3 3 - 2 3 3 2 - 1 3 - - - - - - - 2
CO4 3 - - - 1 - - - 1 - - - 2 2 - - -
CO5 3 - 1 2 - 1 - 1 - - 2 - - 3 - - -
Average 3 - 1.3 2.5 1.7 1.3 - 1 1.7 1 2 - 2 2.5 - 3 2

1 - Low, 2 - Medium, 3 - High, ‘-' - No Correlation


MASTER OF COMPUTER APPLICATIONS 2024-2025
SEMESTER I
24CAP112B APPLIED MACHINE LEARNING - PRACTICAL 4H-2C
Instruction Hours / Week: L:0 T:0 P:4 Marks: Internal:40 External:60 Total:100
End Semester Exam:3Hours
PREREQUISITE:
 Probability and Statistics, Programming Skills, Python Libraries, Learning Concepts
COURSE OBJECTIVES (CO):
 To deploy a machine learning model into a production environment, ensuring it meets
performance and scalability requirements.
 To implement automated decision-making processes based on machine learning predictions,
reducing reliance on manual interventions.
 To achieve higher accuracy and reliability in predictions compared to baseline or existing
methods, validated through rigorous testing and evaluation.
COURSE OUTCOMES (COS):
At the end of this course, students will be able to

COs Course Outcomes Blooms Level


CO1 Practice translating business requirements into well-defined machine
Applying
learning tasks (e.g., classification, regression, clustering).
CO2 Handle missing data, outliers, and data normalization effectively to
Analyzing
improve model performance.
CO3 Implement feature engineering techniques to create informative
Understanding
features from raw data.
CO4 Evaluate and compare different machine learning algorithms suitable
Evaluating
for the problem at hand.
CO5 Engage in continuous learning through projects, online courses, and
Evaluating
participation in machine learning communities.

LIST OF PROGRAMS 48 HOURS


1 Understanding "Mobile Price" dataset by doing feature analysis. Data is available at:
https://www.kaggle.com/datasets/iabhishekofficial/mobile-price-classification/data
2 Execute data preprocessing step on the above dataset: perform outlier and missing data analysis
towards building a refined dataset
3 Build machine learning model/s to predict the actual price of the new mobile based on other given
features like RAM, Internal Memory etc
4 Calculate the prediction accuracy of the models used in Experiment 3 and do comparative analysis
among them to identify the best technique.
5 Understanding "Second Hand Car Prediction Price" dataset by doing feature analysis. Data is
available at: https://www.kaggle.com/datasets/sujithmandala/second-hand-car-price-prediction
6 Perform data preprocessing step on the above dataset: perform outlier and missing data analysis
towards building a refined dataset.
7 Perform Feature Engineering towards building new feature which is more impactful.
Build machine learning model/s to predict the price of the car based on other given features like
Brand, Model, Year, Fuel Type etc
8 Calculate the prediction accuracy of the models used in Experiment 7 and do comparative analysis
among them to identify the best technique.
9 Plot the features (actual price and predicted price) in scatter plot to understand the variation.
10 Understanding "Marketing Campaign Positive Response Prediction" dataset by analysing all the
features. Data is available at: https://www.kaggle.com/datasets/sujithmandala/marketing-
campaign-positive-response-prediction
11 Perform exploratory data analysis on the above dataset: perform outlier and missing data analysis
towards building a refined dataset. Show the outliers in box plot or through some statistical
technique. Find the numerical and categorial features.
12 Perform Feature Engineering towards building new feature which is more impactful than the
existing ones. Build the correlation matrix and show visually the relationship among various
features.
13 Build machine learning model/s to predict the result of marketing campaign based on other given
features like customer details, gender, annual income etc
14 Calculate the prediction accuracy of the models used in Experiment 13 and do comparative
analysis among them to identify the best technique.
15 Please check whether you find imbalanced classes, overfitting, and data bias in the above two
datasets. Please apply some technique to overcome it.
TOTAL: 48 HOURS
TEXT BOOKS:
1 Li, H. (2023). Machine Learning Methods. Springer Nature Singapore.
2 Rao, R. N. (2022). Machine Learning in Data Science Using Python. Dreamtech Press.
REFERENCE BOOKS:
1 Alpaydin, E. (2014). Introduction to Machine Learning (3rd ed., Adaptive Computation and
Machine Learning Series). MIT Press.
2 Aggarwal, C. C. (2018). Neural Networks and Deep Learning (1st Kindle ed.).
WEBSITES:
1 https://ai.google/education/
2 https://machinelearningmastery.com/

CO, PO, PSO Mapping


PO PO PO PO PO PO PO PO PO PO PO PO PO PO PO PSO PSO
CO
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 1 2
CO1 3 - - - - - - - 2 2 2 - - - - 3 -
CO2 3 - 3 3 3 2 - 1 3 - - - - - - - 2
CO3 3 - 3 3 3 2 - 1 3 - - - - 2 - - -
CO4 3 - 3 3 3 2 - 1 3 - - - 2 - - - 1
CO5 3 - 2 2 2 1 1 1 2 - - - - - - - -
Average 3 - 2.8 2.8 2.8 1.8 1 1 2.6 2 2 - 2 2 - 3 1.5

1 - Low, 2 - Medium, 3 - High, ‘-' - No Correlation


MASTER OF COMPUTER APPLICATIONS 2024-2025
SEMESTER I
JOURNAL PAPER ANALYSIS & PRESENTATION 2H-0C
Instruction Hours / Week: L:2 T:0 P:0 Marks: Internal:00 External:00 Total:000
End Semester Exam:3Hours

You might also like