Introduction To Machine Learning
Introduction To Machine Learning
1. RATIONALE
Machine learning focuses on the use of data and algorithms to perform learning similar to the
way human learns. To solve recent problems in Computer domain it is important to
understand the need of machine learning and apply machine learning methods in efficient
ways. Every student of Computer Engineering must therefore understand the blue prints of
machine learning approaches and must be able to apply learning methods on available
datasets. This course will help students to build up core competencies in understanding
machine learning approaches and students will be able to design and train machine learning
modes for various use cases.
2. COMPETENCY
Students acquire the ability to assess and analyze outcomes produced by machine learning
algorithms and models, while enhancing their capacity for critical thinking in addressing
practical challenges.
The practical exercises, the underpinning knowledge, and the relevant soft skills associated
with this competency are to be developed in the student to display the following COs:
3 - 4 5 30 70 25 25 150
(*): Out of 30 marks under the theory CA, 10 marks are for assessment of the micro-project to
facilitate integration of COs and the remaining 20 marks is the average of 2 tests to be taken
during the semester for the assessing the attainment of the cognitive domain UOs required for
the attainment of the COs.
Legends: L-Lecture; T – Tutorial/Teacher Guided Theory Practice; P -Practical; C – Credit, CA -
Continuous Assessment; ESE -End Semester Examination.
Approx.
S. Unit
Practical Outcomes (PrOs) Hrs.
No. No.
required
Write a Pandas program to create a line plot of the opening, closing stock
8 II 4
prices of given company between two specific dates.
Write a Pandas program to filter all columns where all entries present,
11 check which rows and columns has a NaN and finally drop rows with any III 4
NaNs from the given dataset.
Total 56
Note
i. More Practical Exercises can be designed and offered by the respective course teacher to
develop the industry relevant skills/outcomes to match the COs. The above table is only a
suggestive list.
ii. The following are some sample ‘Process’ and ‘Product’ related skills (more may be
added/deleted depending on the course) that occur in the above listed Practical Exercises
of this course required which are embedded in the COs and ultimately the competency.
Total 100
S. PrO. No.
Equipment Name with Broad Specifications
No.
The ADOs are best developed through the laboratory/field-based exercises. Moreover, the
level of achievement of the ADOs according to Krathwohl’s ‘Affective Domain Taxonomy’
should gradually increase as planned below:
i. ‘Valuing Level’ in 1st year
ii. ‘Organization Level’ in 2nd year.
iii. ‘Characterization Level’ in 3rd year.
8. UNDERPINNING THEORY
Only the major Underpinning Theory is formulated as higher-level UOs of Revised Bloom’s
taxonomy in order development of the COs and competency is not missed out by the students
and teachers. If required, more such higher-level UOs could be included by the course teacher
to focus on the attainment of COs and competency.
Unit Unit Outcomes (UOs) Topics and Sub-topics
Unit – I 1.1 Describe basic concept of 1.1.1 Overview of Human Learning and
Introduction machine learning and its Machine Learning
to machine applications 1.1.2 Types of Machine Learning
learning Supervised Machine Learning
Unsupervised Machine Learning
Reinforcement Learning.
1.1.3 Applications of Machine Learning
1.1.4 Tools and Technology for
Machine Learning
Unit – II 2.1. Develop Program using 2.1 Numpy
Python Python Libraries Creating Array: array()
libraries Accessing Array: by referring to its
suitable for index number
Machine Stacking & Splitting: stack(),
Learning array_split()
Maths Functions: add(), subtract(),
multiply(), divide(), power(), mod()
Statistics Functions: amin(), amax(),
mean(), median(), std(), var(),
average(), ptp()
2.2 Pandas
Series: Series()
Dataframes: DataFrames()
Read CSV File: read_csv()
Cleaning Empty Cells: dropna()
Cleaning Wrong Data: drop()
Removing Duplicates: duplicated()
Pandas Plotting: plot()
2.3 Matplotlib
Pyplot.plot: plot()
Show: show()
Labels: xlabel(), ylabel()
Grid: grid()
Bars: bar()
Histogram: hist()
Subplot: subplot()
pie chart: pie()
Save the plotted images into pdf:
savefig()
2.4 sklearn
Key concepts and features
Steps to Build a Model in Sklearn:
Loading a Dataset- read_csv(),
train_test_split- train_test_split()
Unit – III 3.1 Describe different types of 3.1.1 Machine Learning activities
Preparing to Machine learning Activities Preparing to Model
Model and 3.2 Explain Data preprocessing Learning: Data Partition-k-fold
Preprocessing cross validation, Model Selection
Performance Evaluation: confusion
matrix
Performance Improvement:
Ensemble
3.2.1 Types of Data
Qualitative/Categorical Data:
Nominal, Ordinal
Quantitative/Numeric Data:
Interval, Ratio
3.2.2 Data quality and remediation
Handling outliers
Handling missing values
3.2.3 Data Pre-Processing
Dimensionality reduction
Feature subset selection: Filter,
Wrapper, Hybrid, Embedded
Unit– IV 4.1 Define Supervised Learning 4.1.1 Introduction of Supervised
Supervised 4.2 List types of Supervised Learning
Machine Learning, Describe K-Nearest Brief explanation of Supervised
Learning Neighbour and Simple linear Machine Learning
Models regression Working of Supervised Machine
4.3 Advantage and disadvantage learning
of supervised machine learning Real world Applications/Examples
of Supervised Machine learning
Steps in Supervised Machine
learning
4.2.1 Types of Supervised Learning
● Classification: Define Classification,
list types of classification, list types
of Machine learning classification
algorithms (list linear models,
nonlinear models), list use cases of
classification algorithms. K-Nearest
Neighbour (K-NN) : Working of K-
NN, Need of KNN algorithm, steps
of working of K-NN, Select value of
K, advantage and disadvantage of
K-NN algorithm
● Regression: Define Regression
analysis, list types of regression
analysis, list real world examples of
regression analysis Linear
regression: List types of linear
regression, mathematical equation
of linear regression, diagram of
Note: The UOs need to be formulated at the ‘Application Level’ and above of Revised Bloom’s
Taxonomy’ to accelerate the attainment of the COs and the competency.
Total 42 23 25 22 70
Legends: R=Remember, U=Understand, A=Apply and above (Revised Bloom’s taxonomy)
Note: This specification table provides general guidelines to assist students for their learning
and to teachers to teach and question paper designers/setters to formulate test
items/questions assess the attainment of the UOs. The actual distribution of marks at different
taxonomy levels (of R, U and A) in the question paper may vary slightly from the above table.
undertaken to build up the skill and confidence in every student to become problem solver so
that s/he contributes to the projects of the industry. In special situations where groups have
to be formed for micro-projects, the number of students in the group should not exceed three.
The micro-project could be industry application based, internet-based, workshop-
based, laboratory-based or field-based. Each micro-project should encompass two or more
COs which are in fact, an integration of PrOs, UOs and ADOs. Each student will have to
maintain a dated work diary consisting of individual contributions in the project work and give
a seminar presentation of it before submission. The total duration of the micro-project should
not be less than 16 (sixteen) student engagement hours during the course. The student ought
to submit a micro-project by the end of the semester to develop the industry-oriented COs.
A suggestive list of micro-projects is given here. This has to match the competency and
the COs. Similar micro-projects could be added by the concerned course teacher:
● Project idea 1: Breast Cancer Prediction: This machine learning project uses a dataset
that can help determine the likelihood that a breast tumor is malignant or benign. You
can build a classification model for this project.
● Project idea 2: Loan Prediction: The idea behind this ML project is to build a model
that will classify how much loan the user can take. It is based on the user’s marital
status, education, number of dependents, and employments. You can build a linear
model for this project.
● Project idea 3: Stock Price Prediction: This machine learning beginner’s project aims
to predict the future price of the stock market based on the previous year’s data.
● Project idea 4: Phishing: Create a Python program that can predict if a URL is
legitimate. Teach a computer to recognize phishing web links.
● Project idea 5: Titanic survived Prediction: This will be a fun project to build as you will
be predicting whether someone would have survived if they were in the titanic ship or
not.
● Project idea 6: BigMart Sales Prediction: BigMart sales dataset consists of 2013 sales
data for 1559 products across 10 different outlets in different cities. The goal of the
BigMart sales prediction ML project is to build a regression model to predict the sales
of each of 1559 products for the following year in each of the 10 different BigMart
outlets.
● Project idea 7: Wine Quality Test: It will use the chemical information of the wine and
based on the machine learning model, it will give you the result of wine quality.
● Project idea 8: Data from leading music service can be taken to build a better music
recommendation system.
2 Machine Learning Using Python Pradhan Manaranjan, Wiley India Pvt. Ltd
U Dinesh Kumar
6 Machine Learning with Python Cookbook_ Chris Albon O’Reilly Media, Inc.
Practical Solutions from Preprocessing to
Deep Learning.
Competency
Students acquire the ability to assess and analyze outcomes produced by machine learning algorithms and models, while
enhancing their capacity for critical thinking in addressing practical challenges.
Course Outcomes
Legend: ‘3’ for high, ‘2’ for medium, ‘1’ for low or ‘-’ for the relevant correlation of each competency, CO, with PO/ PSO
Sr. Contact
Name and Designation Institute Email
No. No.