0% found this document useful (0 votes)

28 views61 pages

ML Lab Manual-It

The document is a Machine Learning Lab Manual for III B.Tech II Semester students at Marri Laxman Reddy Institute of Technology and Management, detailing course objectives, outcomes, and a series of experiments to be conducted using Python. It includes guidelines for lab conduct, institutional vision and mission statements, and a list of experiments covering various machine learning algorithms. The manual is prepared by Mrs. Karimunnisa Shaik, Assistant Professor in the Department of Information Technology.

Uploaded by

hareeshn622

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

28 views61 pages

ML Lab Manual-It

Uploaded by

hareeshn622

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 61

DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING

MACHINE LEARNING
LAB MANUAL

(Subject Code: 2070518)

B. Tech, III Year II–

Semester Regulation-

R22
A.Y: 2024-25

Prepared by:
MRS.KARIMUNNISA. SHAIK,
ASSISTANT PROFESSOR

Department of INFORMATION TECHNOLOGY

MARRI LAXMAN REDDY INSTITUTE OF TECHNOLOGY AND MANAGEMENT
Hyderabad-500043
CERTIFICATE

This is to certify that this manual is a bonafide record of practical work in the
“MACHINE LEARNING” in III B.Tech II Sem (IT) program during the academicyear 2024-
25. This manual is prepared by Mrs.Karimunnisa shaik, Assistant Professor, IT.
PREFACE

This Lab Manual entitled “Machine Learning Lab” is intended for the use of III B. Tech II Semester
Information Technology students of Marri Laxman Reddy Institute of Technology and Management,
Dundigal, Hyderabad. The main objective of the Machine Learning Lab manual is to introduce the
concepts of operating systems, designing principles of operating systems and implementation of
machine learning.

By
Karimunnisa shaik
Department of IT
ACKNOWLEDGEMENT

It was really a good experience, working with “MACHINE LEARNING LAB”. First,
we would like to thank Dr. M. Nagalakshmi , Professor, HOD of Department of Information
Technology and , Marri Laxman Reddy Institute of Technology & Management for his
concern and giving the technical support in preparing the document. We are deeply indebted
and gratefully acknowledge the constant support and valuable patronage of Dr. P. SRIDHAR,
Director, Marri Laxman Reddy Institute of technology & Management for giving us this
wonderful opportunity for preparing the “MACHINE LEARNING LAB” laboratory manual.
We express our hearty thanks to Dr. R. Murali Prasad, Principal, Marri Laxman Reddy
Institute of technology & Management, for timely corrections and scholarly guidance. At last,
but not the least I would like to thanks the entire IT faculties those who had inspired and
helped us to achieve our goal.

By
KARIMUNNISA SHAIK
Assistant Professor
GENERAL INSTRUCTIONS

1. Students are instructed to come to Machine Learning laboratory on time. Late comers are not
entertained in the lab.

2. Students should be punctual to the lab. If not ,the conducted experiments will not be repeated.

3. Students are expected to come prepared at home with the experiments which are going to be
performed.

4. Students are instructed to display their identity cards before entering into the lab.

5. Students are instructed not to bring mobile phones to the lab.

6. Any damage/loss of system parts like keyboard, mouse during the lab session, it is student’s
responsibility and penalty or fine will be collected from the student.

7. Students should update the records and lab observation books session wise. Beforeleaving the lab
the student should get his/her lab observation book signed by the faculty.

8. Students should submit the lab records by the next lab to the concerned faculty members in the staff
room for their correction and return.

9. Students should not move around the lab during the lab session.

10. If any emergency arises, the student should take the permission from facultymember
concerned in written format.

11. The faculty members may suspend any student from the lab session on disciplinary grounds.

12. Never copy the output from other students. Write down your own outputs.
INSTITUTION VISION AND MISSION

VISION
To be as an ideal academic institution by graduating talented engineers to be ethically strong,
competent with quality research and technologies.

MISSION
To fulfill the promised vision through the following strategic characteristics and aspirations:
 Utilize rigorous educational experiences to produce talented engineers.
 Create an atmosphere that facilitates the success of students.
 Programs that integrate global awareness, communication skills and
Leadership qualities.
 Education and Research partnership with institutions and industries to
prepare the students for interdisciplinary research.
DEPARTMENT VISION AND MISSION

VISION
To empower the students to be technologically adept, innovative, self-motivated and responsible
global citizen possessing human values and contribute significantly towards high quality
technical education with ever changing world.
MISSION

 To offer high-quality education in the computing fields by providing an environment

where the knowledge is gained and applied to participate in research, for both students
and faculty.
 To develop the problem-solving skills in the students to be ready to deal with cutting
edge technologies of the industry.
 To make the students and faculty excel in their professional fields by inculcating the
communication skills, leadership skills, team building skills with the organization of
various cocurricular and extra-curricular programs.
 To provide the students with theoretical and applied knowledge, and adopt an
education approach that promotes lifelong learning and ethical growth.
PROGRAMME EDUCATIONAL OBJECTIVES

 Learn and Integrate: Graduates shall apply knowledge to solve computer science
and allied engineering problems with continuous learning.

• Think and Create: Graduates are inculcated with a passion towards higher education
and research with social responsibility.

• Communicate and Organize: Graduates shall pursue career in industry, empowered

with professional and interpersonal skills.

PROGRAM SPECIFIC OUTCOMES

PSO1: Applications of Computing: Ability to use knowledge in various domains to provide

solution to new ideas and innovations.
PSO2: Programming Skills: Identify required data structures, design suitable algorithms,
develop and maintain software for real world problems.
PROGRAMME OUT COMES

The Program Outcomes (POs) of the department are defined in a way that the Graduate
Attributes are included, which can be seen in the Program Outcomes (POs) defined.
Program Outcomes (POs) department are as stated below:

a : An ability to apply knowledge of Science, Mathematics, Engineering & Computing

fundamentals for the solutions of Complex Engineering problems.
b : An ability to identify, formulates, research literature and analyze complex engineering
problems using first principles of mathematics and engineering sciences.
c : An ability to design solutions to complex process or program to meet desired needs.
d : Ability to use research-based knowledge and research methods including design of
experiments to provide valid conclusions.
e : An ability to use appropriate techniques, skills and tools necessary for computing practice.
f : Ability to apply reasoning informed by the contextual knowledge to assess social issues,
consequences & responsibilities relevant to the professional engineering practice.
g: Ability to understand the impact of engineering solutions in a global, economic,
environmental, and societal context with sustainability.
h : An understanding of professional, ethical, Social issues and responsibilities.
i : An ability to function as an individual, and as a member or leader in diverse teams and

inmultidisciplinary settings.
j: An ability to communicate effectively on complex engineering activities within the
engineering community.
k : Ability to demonstrate and understanding of the engineering and management principles
as a member and leader in a team.
l : Ability to engage in independent and lifelong learning in the context of technological
change.
MACHINE LEARNING LAB

COURSE STRUCTURE, OBJECTIVES & OUTCOMES

COURSE STRUCTURE:

Laboratory subjects – Internal and external evaluation – Details of marks“ MACHINE

LEARNING“ lab will have a continuous evaluation during IV semester for 40 sessional marks
and 60 end semester examination marks. Out of the 40 marks for internal evaluation, day- to-
day work in the laboratory shall be evaluatedfor 20 marks and internal practical examination
shall be evaluated for 20 marks conducted by the laboratory teacher concerned. The end
examination will be evaluatedfor a maximum of 60 marks. The end semester examination shall
be conducted with an external examiner and internal examiner. The external examiner shall be
appointedby the principal / Chief Controller of examinations

COURSE OBJECTIVES:

 To get an overview of the various Machine Learning Techniques and can able to
Demonstrate them using Python.

COURSE OUTCOMES:

 Understand complexity of Machine Learning algorithms and their limitations

 Understand modern notions in data analysis-oriented computing;
 Confidently applying common Machine Learning algorithms in practice and
implementing their own;
 Apply experiments in Machine Learning using real-world data.
MACHINE LEARNING LAB

2070584: MACHINE LEARNING LAB

Course Objectives:
 To get an overview of the various Machine Learning Techniques and can able to Demonstrate
them using Python.
Course Outcomes:
The students will be able to:
 Understand complexity of Machine Learning algorithms and their limitations
 Understand modern notions in data analysis-oriented computing;
 Confidently applying common Machine Learning algorithms in practice and implementing their own;
 Apply experiments in Machine Learning using real-world data.
List of Experiments
1. The probability that it is Friday and that a student is absent is 3 %. Since there are 5 school days in a
week, the probability that it is Friday is 20 %. What is the probability that a student is absent given
that today is Friday? Apply Baye’s rule in python to get the result.(Ans: 15%)
2. Extract the data from database using python
3. Implement Find-S algorithm using python.
4. Implement Candidate-Elimination algorithm using python.
5. Implement Decision-Tree Learning algorithm using python.
6. Implement k-nearest neighbors classification using python
7. Given the following data, which specify classifications for nine combinations of VAR1 and
VAR2 predict a classification for a case where VAR1=0.906 and VAR2=0.606, using the result k-
means clustering with 3 means (i.e., 3 centroids)
VAR1 VAR2 CLASS 1.713
1.586 0
0.180 1.786 1
0.353 1.240 1
0.940 1.566 0
1.486 0.759 1
1.266 1.106 0
1.540 0.419 1
0.459 1.799 1
0.773 0.186 1
8. The following training examples map descriptions of individuals onto high, medium and
low creditworthiness.
medium skiing design single twenties no -> highRisk high
golf trading married forties yes -> lowRisk
MACHINE LEARNING LAB

low speedway transport married thirties yes -> medRisk

medium football banking single thirties yes -> lowRisk
high flying media married fifties yes -> highRisk
low football security single twenties no -> medRisk
medium golf media single thirties yes -> medRisk
medium golf transport married forties yes -> lowRisk
high skiing banking single thirties yes -> highRisk
low golf unemployed married forties yes -> highRisk
Input attributes are (from left to right) income, recreation, job, status, age-group, home-owner.Find the
unconditional probability of `golf' and the conditional probability of `single' given `medRisk' in the
dataset?
9. Implement linear regression using python.
10. Implement Naïve Bayes theorem to classify the English text
11. Implement an algorithm to demonstrate the significance of genetic algorithm
12. Implement the finite words classification system using Back-propagation algorithm
TEXT BOOKS:
1. Machine Learning – Tom M. Mitchell, - MGH
REFERENCES:
1. Machine Learning: An Algorithmic Perspective, Stephen Marshland, Taylor & Francis
MACHINE LEARNING LAB

EXPERIMENT -1

AIM: The probability that it is Friday and that a student is absent is 3 %. Since there are 5
school days in a week, the probability that it is Friday is 20 %. What is the probability that a
student is absent given that today is Friday? Apply Baye’s rule in python to get the result.
(Ans: 15%)

ALGORITHM:

Step 1: Calculate probability for each word in a text and filter the words which have a probability
less than threshold probability. Words with probability less than threshold probability are irrelevant.
Step 2: Then for each word in the dictionary, create a probability of that word being in insincere
questions and its probability insincere questions. Then finding the conditional probability to use in
naive Bayes classifier.
Step 3: Prediction using conditional probabilities.
Step 4: End.

SOURCE CODE:

PFIA=float(input(“Enter probability that it is Friday and that a student is absent=”))

PF=float(input(“ probability that it is Friday=”))
PABF=PFIA / PF
print(“probability that a student is absent given that today is Friday using conditional
probabilities=”,PABF)

OUTPUT:

Enter probability that it is Friday and that a student is absent= 0.03

probability that it is Friday= 0.2
probability that a student is absent given that today is Friday using conditional probabilities= 0.15
MACHINE LEARNING LAB

VIVA QUESTIONS

1. How machine learning is different from general programming?

2. Is Python a compiled language or an interpreted language?

3. What is the difference between a Set and Dictionary?

4. What are args and kwargs?

5. Definitions A prior probability AND POSTERIOR?

6. Bayes Theorem Derivation?

7. What is Bayes rule ?

8. What is Bayes classifier?

9. What are Bayesian Networks (BN) ?

10.Given the following statistics, what is the probability that a woman has cancer if she has a

positive result?
MACHINE LEARNING LAB

EXPERIMENT – 2

AIM: EXTRACT THE DATA FROM DATABASE USING PYTHON.

EXPLANATION:
1. First you need to create a table (students) in MySQL database (SampleDB)
2. Open command prompt and then execute the following command to enter into MYSQL prompt.
3. MySQL -u root -p
4. And the, you need to execute the following commands at MYSQL prompt to create table in the
database.
5. Create database SampleDB;
6. Use SampleDB;
7. CREATE TABLE students (sid VARCHAR(10),sname VARCHAR(10),age int);
INSERT INTO students VALUES(‘s521’,’jhon Bob’,23);
INSERT INTO students VALUES(‘s522’,’jhon Dilly’,22);
INSERT INTO students VALUES(‘s523’,’Kenny’,23);
INSERT INTO students VALUES(‘s524’,’Herny’,23);
8. Next, open command prompt and then execute the following command to install mysql.connector
package to connect with MySQL database through python.
Pip install mysql.connector

SOURCE CODE:
Import mysql .connector
Myconn=mysql.connector.connect(host=”localhost”,user=”root”,passwd=””,database=”SamplDB”)
Cur=myconn.cursor( )
Cur.execute(“select * from students”)
Result = cur.fetchall( )
Print(“student Details are:”) For x
in result:
Print (x);
myconn.commit( )
myconn.close( )
MACHINE LEARNING LAB

OUTPUT:
MACHINE LEARNING LAB

VIVA QUESTIONS
1. What is MySQL Connector/Python?
2. What are the five major steps for connecting MySQL and Python?
3. How do we create a connection object?
4. How do we create a cursor object?
5. How do we execute SQL query through Python?
6. What is the difference between fetchall() and fetchnone() methods?
7. What is the purpose of rowcount parameter?
8. Which method is used to establish a connection to a database using sqlite3 in Python?
9. Which method is used to execute an SQL query and fetch all the results in Python’s database
interaction?
10. Which method is used to close the cursor in Python’s database interaction?
MACHINE LEARNING LAB

EXPERIMENT – 3

AIM : IMPLEMENT FIND-S ALGORITHM USING PYTHON.

TRAINING DATABASE

ALGORITHM
1. Initialize h to the most specific hypothesis in H
2. For each positive training instance x For each attribute constraint a, in
h If the constraint a, is satisfied by x Then do nothing
Else replace a, in h by the next more general constraint that is satisfied by x
Output hypothesis h.
3. Hypothesis Construction

SOURCE CODE:
import csv
# Initialize an empty list to hold the data a =
[]
# Open and read the CSV file
with open('FIND-S.csv', 'r') as csvfile:
reader = csv.reader(csvfile)

# Read all rows into the list `a` for

row in reader:
a.append(row)

# Print the number of training instances

print("\nThe total number of training instances are:", len(a)) #

Get the number of attributes (excluding the target attribute)

MACHINE LEARNING LAB

num_attribute = len(a[0]) - 1

# Initialize the hypothesis with '0' (assuming all attributes are '0')
hypothesis = ['0'] * num_attribute
print("\nThe initial hypothesis is:")
print(hypothesis)

# Process each training instance

for i in range(len(a)):
# Check if the target attribute is 'yes'
if a[i][num_attribute] == 'yes':
# Update the hypothesis based on the current instance
for j in range(num_attribute):
if hypothesis[j] == '0' or hypothesis[j] == a[i][j]:
hypothesis[j] = a[i][j]
else:
hypothesis[j] = '?'

# Print the hypothesis for the current training instance

print("\nThe hypothesis for the training instance {} is:".format(i + 1))
print(hypothesis)

# Print the final maximally specific hypothesis

print("\nThe maximally specific hypothesis is:")
print(hypothesis)
MACHINE LEARNING LAB

OUTPUT:
MACHINE LEARNING LAB

VIVA QUESTIONS

1. What is present in the version space of the Find-S algorithm in the beginning?

2. When does the hypothesis change in the Find-S algorithm, while iteration?

3. What is one of the advantages of the Find-S algorithm?

4. How does the hypothesis change gradually?

5. When do we use CSV file?

6. What is csv reader( ) function?

7.What is one of the advantages of the Find-S algorithm?

8. How does the hypothesis change gradually?

9. What is one of the drawbacks of the Find-S algorithm

10. The algorithm accommodates all the maximally specific hypotheses.

MACHINE LEARNING LAB

EXPERIMENT: 4

AIM: IMPLEMENT CANDIDATE-ELIMINATION ALGORITHM USING PYTHON.

TRAINING DATABASE

ALGORITHM

SOURCE CODE:

import csv

# Open and read the CSV file with

open("enjoysport.csv") as f:
csv_file = csv.reader(f) data =
list(csv_file)
MACHINE LEARNING LAB

# Print the data from the CSV file

print(data)
print(" ")
# Extracting the first positive instance (excluding the last element)
s = data[1][:-1]

# Initialize the general hypothesis as the most general

g = [['?' for i in range(len(s))] for j in range(len(s))]

# Print initial specific and general hypotheses

print("Initial specific hypothesis:", s)
print("Initial general hypothesis:", g)
print(" ")

# Candidate Elimination Algorithm

for i in data:
if i[-1] == "TRUE":
# For each positive training record
for j in range(len(s)):
if i[j] != s[j]:
s[j] = '?'
g[j][j] = '?'
elif i[-1] == "FALSE":
# For each negative training record
for j in range(len(s)):
if i[j] != s[j]:
g[j][j] = s[j]
else:
g[j][j] = '?'

# Print the hypothesis after processing each instance

print("\nSteps of Candidate Elimination Algorithm", data.index(i) + 1)
print("Specific hypothesis:", s)
print("General hypothesis:", g)

# Extract the final general hypothesis

gh = []
for i in g:
for j in i:
if j != '?':
gh.append(i)
break

# Print the final hypotheses print("\

nFinal specific hypothesis:\n", s) print("\
nFinal general hypothesis:\n", gh)
MACHINE LEARNING LAB

OUTPUT:
MACHINE LEARNING LAB

VIVA QUESTIONS:

1. The algorithm is trying to find a suitable day for swimming. What is the most general hypothesis?

2. Candidate-Elimination algorithm can be described by?

3. How is the version space represented?

4. It is possible that in the output, set S contains only phi.

5. Python supports the creation of anonymous functions at runtime, using a construct called?

6. What is the order of precedence in python?

7. Which of the following is true for variable names in Python?

8. Let G be the set of maximally general hypotheses. While iterating through the dataset,

9. when is it changed for the first time?

10. What are the two main types of functions in Python?

MACHINE LEARNING LAB

EXPERIMENT: 5

AIM: IMPLEMENT DECISION-TREE LEARNING ALGORITHM USING PYTHON.

ALGORITHM:

Consider a training Dataset D:(Real Time Dataset which predicts Cancer.xlsx)

MACHINE LEARNING LAB

SOURCE CODE:

import pandas as pd

from sklearn.model_selection import train_test_split

from sklearn.tree import DecisionTreeClassifier, plot_tree

from sklearn.metrics import accuracy_score, classification_report

from sklearn.preprocessing import LabelEncoder

import numpy as np

import math

import matplotlib.pyplot as plt

file_path = "C:\\ASMA\\cancer.xlsx" # Use double backslashes in the file path

df = pd.read_excel(file_path)

label_encoders = {}

for column in df.columns:

if df[column].dtype == 'object': # Check if column is non-numeric

le = LabelEncoder()

df[column] = le.fit_transform(df[column])

label_encoders[column] = le

X = df.drop('Level', axis=1) # Drop the 'Level' column to use the rest as features

y = df['Level'] # Target variable

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.3, random_state=42)

clf = DecisionTreeClassifier(criterion='entropy', random_state=42)

clf.fit(X_train, y_train)

python

Copy code

y_pred = clf.predict(X_test)

accuracy = accuracy_score(y_test, y_pred)

def calculate_entropy(y):
MACHINE LEARNING LAB

class_counts = np.bincount(y)

probabilities = class_counts /

len(y)

return -np.sum([p * math.log2(p) for p in probabilities if p >

0]) def calculate_information_gain(X_column, y, threshold):

parent_entropy =

calculate_entropy(y) left_split =

y[X_column <= threshold] right_split

= y[X_column > threshold] n = len(y)

n_left, n_right = len(left_split),

len(right_split) if n_left == 0 or n_right == 0:

return 0

weighted_avg_entropy = (n_left / n) * calculate_entropy(left_split) + (n_right / n) *

calculate_entropy(right_split)

return parent_entropy - weighted_avg_entropy

first_feature_index = clf.tree_.feature[0]

first_threshold = clf.tree_.threshold[0]

first_feature_name = X.columns[first_feature_index]

X_column = X_train.iloc[:, first_feature_index]

first_split_ig = calculate_information_gain(X_column, y_train, first_threshold)

first_split_entropy = calculate_entropy(y_train)

print(f"Model Accuracy: {accuracy}")

print(f"Entropy of the first node: {first_split_entropy}")

print(f"Information Gain of the first split on {first_feature_name} at threshold {first_threshold}:

{first_split_ig}")

python

Copy code
MACHINE LEARNING LAB

plt.figure(figsize=(15, 10))
MACHINE LEARNING LAB

plot_tree(clf, filled=True, feature_names=X.columns, class_names=np.unique(y).astype(str),

rounded=True)

plt.title("Decision Tree Visualization")

plt.show()

print("\nClassification Report:\n", classification_report(y_test, y_pred))

OUTPUT:
MACHINE LEARNING LAB

VIVA QUESTIONS:

1. Practical decision tree learning algorithms are based on heuristics?

2. Given the entropy for a split, e split = 0.39 and the entropy before the split, e before = What is the

information gain for the split?

3. Information gain and gini index are the same.?

4. What is a decision tree algorithm used for?

5. Which algorithm is commonly used to construct decision trees?

6. Which attribute selection measure is used in the id3 algorithm?

7. What is the goal of a decision tree algorithm during training?

8. Which algorithm can handle missing values in decision trees?

9. what is purning?

10. Formula for gini index?

MACHINE LEARNING LAB

EXPERIMENT – 6

AIM: IMPLEMENT K-NEAREST NEIGHBORS CLASSIFICATION USING PYTHON

Step 1: Load the data
Step 2: Initialize the value of k
Step 3: For getting the predicted class, iterate from 1 to total number of training data points
 Calculate the distance between test data and each row of training data. Here we will use
Euclidean distance as our distance metric since it’s the most popular method. The other metrics
that can be used are Chebyshev, cosine, etc.
 Sort the calculated distances in ascending order based on distance values 3. Get top k rows from
the sorted array
 Get the most frequent class of these rows i.e. Get the labels of the selected K entries
 Return the predicted class
 If regression, return the mean of the K labels
 If classification, return the mode of the K labels
 If regression, return the mean of the K labels
 If classification, return the mode of the K
labels Step 4: End.

SOURCE CODE

import numpy as np
from sklearn import datasets import
matplotlib.pyplot as plt
from mpl_toolkits.mplot3d import Axes3D

# Load the iris dataset iris =

datasets.load_iris() data
= iris.data
labels = iris.target

# Print selected samples for

i in [0, 79, 99, 101]:
print(f"index: {i:3}, features: {data[i]}, label: {labels[i]}")

# Shuffle the data

MACHINE LEARNING LAB

np.random.seed(42)
indices = np.random.permutation(len(data))

n_training_samples = 12

# Create training and test sets

learn_data = data[indices[:-n_training_samples]]
learn_labels = labels[indices[:-n_training_samples]]

test_data = data[indices[-n_training_samples:]]
test_labels = labels[indices[-n_training_samples:]]

# Print learn and test sets

print("The first samples of our learn set:")
print(f"{'index':7s}{'data':50s}{'label':3s}")
for i in range(5):
print(f"{i:4d} {learn_data[i]} {learn_labels[i]:3}")

print("The first samples of our test set:")

print(f"{'index':7s}{'data':50s}{'label':3s}")
for i in range(5):
print(f"{i:4d} {test_data[i]} {test_labels[i]:3}")

# Visualizing the data of our learnset

colours = ("r", "g", "y")
X = []

for iclass in range(3):

X.append([[], [], []])
for i in range(len(learn_data)):
if learn_labels[i] == iclass:
X[iclass][0].append(learn_data[i][0]) X[iclass]
[1].append(learn_data[i][1]) X[iclass]
[2].append(sum(learn_data[i][2:]))

fig = plt.figure()
MACHINE LEARNING LAB

ax = fig.add_subplot(111, projection='3d')

for iclass in range(3):

ax.scatter(X[iclass][0], X[iclass][1], X[iclass][2], c=colours[iclass])

plt.show()

# Euclidean distance function

def distance(instance1, instance2):
""" Calculates the Euclidean distance between two instances """
return np.linalg.norm(np.subtract(instance1, instance2))

# Function to find k nearest neighbors

def get_neighbors(training_set, labels, test_instance, k, distance):
"""
get_neighbors calculates a list of the k nearest neighbors of an instance
'test_instance'. The function returns a list of k 3-tuples. Each 3-tuple consists of
(index, dist, label) """
distances = []
for index in range(len(training_set)):
dist = distance(test_instance, training_set[index])
distances.append((training_set[index], dist, labels[index]))
distances.sort(key=lambda x: x[1])
neighbors = distances[:k]
return neighbors

# Testing the neighbors function on the test set

for i in range(5):
neighbors = get_neighbors(learn_data, learn_labels, test_data[i], 3, distance=distance)
print("Index: ", i, '\n',
"Testset Data: ", test_data[i], '\n',
"Testset Label: ", test_labels[i], '\n',
"Neighbors: ", neighbors, '\n')
MACHINE LEARNING LAB

OUTPUT:
MACHINE LEARNING LAB
MACHINE LEARNING LAB

VIVA QUESTIONS
1. How does KNN deal with missing data?

2. How does KNN handle imbalanced datasets?

3. What are some common ways to improve KNN performance?

4. How do you implement KNN in Python using sklearn?

5. What are the computational challenges in KNN?

6. What are the pros and cons of KNN?

7. How does distance metric affect KNN?

8. What is the role of the parameter ‘K’ in KNN?

9. How does KNN handle classification and regression tasks?

10. What is KNN?

MACHINE LEARNING LAB

EXPERIMENT -7

AIM: GIVEN THE FOLLOWING DATA, WHICH SPECIFY CLASSIFICATIONS FOR NINE
COMBINATIONS OF VAR1 AND VAR2 PREDICT A CLASSIFICATION FOR A CASE WHERE
VAR1=0.906 AND VAR2=0.606, USING THE RESULT K- MEANS CLUSTERING WITH 3
MEANS (I.E., 3 CENTROIDS)
VAR1 VAR2 CLASS
1.713 1.586 0
0.180 1.786 1
0.353 1.240 1
0.940 1.566 0
1.486 0.759 1
1.266 1.106 0
1.540 0.419 1
0.459 1.799 1
0.773 0.186 1
SOURCE CODE:
import numpy as np
from sklearn.cluster import KMeans

# Data with VAR1, VAR2, and CLASS

data = np.array([
[1.713, 1.586, 0],
[0.180, 1.786, 1],
[0.353, 1.240, 1],
[0.940, 1.566, 0],
[1.486, 0.759, 1],
[1.266, 1.106, 0],
[1.540, 0.419, 1],
[0.459, 1.799, 1],
[0.773, 0.186, 1]
])

# Separate features (VAR1, VAR2) and labels (CLASS)

X = data[:, :2]
y = data[:, 2]
MACHINE LEARNING LAB

# Define KMeans model with 3 clusters

kmeans = KMeans(n_clusters=3, random_state=42)

# Fit the model to the data

kmeans.fit(X)

# New data point to classify

new_point = np.array([[0.906, 0.606]])

# Predict the cluster for the new data point

predicted_cluster = kmeans.predict(new_point)

print(f"The new data point belongs to cluster: {predicted_cluster[0]}")

# Map cluster to the majority class within each cluster

# For simplicity, we'll assign the most frequent class in each cluster
from collections import Counter

# Get the labels (clusters) for each point

cluster_labels = kmeans.labels_

# Create a mapping from cluster to majority class

cluster_to_class = {}

for cluster in np.unique(cluster_labels):

# Get the indices of points in the current cluster
indices = np.where(cluster_labels == cluster)
# Find the most common class for this cluster
common_class = Counter(y[indices]).most_common(1)[0][0]
cluster_to_class[cluster] = common_class

# Predict the class based on the cluster

predicted_class = cluster_to_class[predicted_cluster[0]]
print(f"The predicted class for VAR1=0.906 and VAR2=0.606 is: {predicted_class}")
MACHINE LEARNING LAB

OUTPUT:
MACHINE LEARNING LAB

VIVA QUESTIONS

1. List the classification algorithm

2. Classification is which type of learning?
3. Which of the following statements is false about ensemble learning?
4. Which of the following statements is true about stochastic gradient descent?
5. decision tree uses the inductive learning machine learning approach.
6. Which of the following statements is not true about boosting?
7. K-nearest neighbors (knn) is classified as what type of machine learning algorithm?
8. What is machine learning?
9. What’s the key benefit of using deep learning for tasks like recognizing images
10. What is classification?
MACHINE LEARNING LAB

EXPERIMENT - 8

AIM: THE FOLLOWING TRAINING EXAMPLES MAP DESCRIPTIONS OF INDIVIDUALS

ONTO HIGH, MEDIUM AND LOW CREDITWORTHINESS.
medium skiing design single twenties no -> highRisk
high golf trading married forties yes -> lowRisk
low speedway transport married thirties yes -> medRisk
medium football banking single thirties yes -> lowRisk
high flying media married fifties yes -> highRisk
low football security single twenties no -> medRisk
medium golf media single thirties yes -> medRisk
medium golf transport married forties yes ->
lowRisk high skiing banking single thirties yes ->
highRisk low golf unemployed married forties yes ->
highRisk
Input attributes are (from left to right) income, recreation, job, status, age-group, home-owner.Find the
unconditional probability of `golf' and the conditional probability of `single' given `medRisk' in the
dataset?

SOURCE CODE:
import numpy as np #
Define the dataset
data = np.array([
['medium', 'skiing', 'design', 'single', 'twenties', 'no', 'highRisk'],
['high', 'golf', 'trading', 'married', 'forties', 'yes', 'lowRisk'],
['low', 'speedway', 'transport', 'married', 'thirties', 'yes', 'medRisk'],
['medium', 'football', 'banking', 'single', 'thirties', 'yes', 'lowRisk'],
['high', 'flying', 'media', 'married', 'fifties', 'yes', 'highRisk'],
['low', 'football', 'security', 'single', 'twenties', 'no', 'medRisk'],
['medium', 'golf', 'media', 'single', 'thirties', 'yes', 'medRisk'],
['medium', 'golf', 'transport', 'married', 'forties', 'yes', 'lowRisk'],
['high', 'skiing', 'banking', 'single', 'thirties', 'yes', 'highRisk'],
['low', 'golf', 'unemployed', 'married', 'forties', 'yes', 'highRisk']
])
MACHINE LEARNING LAB

# Extract columns for recreation, status, and label

MACHINE LEARNING LAB

recreation_column = data[:, 1]
status_column = data[:, 3]
label_column = data[:, 6]

# Calculate the unconditional probability of 'golf'

total_samples = len(data)
golf_count = np.sum(recreation_column == 'golf')
unconditional_prob_golf = golf_count / total_samples

# Calculate the conditional probability of 'single' given 'medRisk'

medrisk_indices = np.where(label_column == 'medRisk')
single_given_medrisk_count = np.sum(status_column[medrisk_indices] == 'single')
medrisk_count = len(medrisk_indices[0])
conditional_prob_single_given_medrisk = single_given_medrisk_count / medrisk_count

# Output the results

print(f"Unconditional probability of 'golf': {unconditional_prob_golf:.2f}")
print(f"Conditional probability of 'single' given 'medRisk': {conditional_prob_single_given_medrisk:.2f}")

OUTPUT:
MACHINE LEARNING LAB

VIVA QUESTIONS

1. What is Linear Regression?

2. n a simple linear regression, how many independent variables are there?
3. What is the primary goal of linear regression?
4. What is the equation of a simple linear regression line?
5. What is the difference between simple linear regression and multiple linear regression?
6. Which type of Programming does Python support?
7. Who developed Python Programming Language?
8. Is Python code compiled or interpreted?
9. All keywords in Python ?
10. What will be the value of the following Python expression?
MACHINE LEARNING LAB

EXPERIMENT - 9

AIM: IMPLEMENT LINEAR REGRESSION USING PYTHON.

SOURCE CODE:
import numpy as np
import matplotlib.pyplot as plt

# Sample data
X = np.array([1, 2, 4, 3, 5])
y = np.array([1, 3, 3, 2, 5])

# Mean of X and y
X_mean = np.mean(X)
y_mean = np.mean(y)

# Calculate coefficients (slope m and intercept b) for the line y = mx + b

numerator = np.sum((X - X_mean) * (y - y_mean))
denominator = np.sum((X - X_mean) ** 2)

m = numerator / denominator # slope b =

y_mean - m * X_mean # intercept

# Regression line y_pred =

m*X+b

# Print coefficients print(f"Slope

(m): {m}")
print(f"Intercept (b): {b}")

# Plotting the regression line

plt.scatter(X, y, color='blue', label='Data points') plt.plot(X,
y_pred, color='red', label='Regression line')
plt.xlabel('X')
plt.ylabel('y')
MACHINE LEARNING LAB

plt.legend()
plt.show()
OUTPUT:
MACHINE LEARNING LAB

VIVA QUESTIONS
1. What Is Regression?

2. What are the different types of Logistic Regression?

3. How do we handle categorical variables in Logistic Regression?

4. What are the assumptions made in Logistic Regression?

5. Can we solve the multiclass classification problems using Logistic Regression? If Yes thenHow?

6. The correlation for the values of two variables moving in the same direction is

7. Who introduced the term ‘regression’?

8. The slope of the regression line of Y on X is also referred to as the:

9. What is the other term used for dependent variables?

10. What is the significance of hypothesis testing?

MACHINE LEARNING LAB

EXPERIMENT - 10

AIM: IMPLEMENT NAÏVE BAYES THEOREM TO CLASSIFY THE ENGLISH TEXT

SOURCE CODE
from sklearn.feature_extraction.text import CountVectorizer
from sklearn.model_selection import train_test_split
from sklearn.naive_bayes import MultinomialNB
from sklearn.metrics import accuracy_score, classification_report

# Sample text data (text, label)

data = [
("I love this sandwich", "positive"), ("This
is an amazing place", "positive"),
("I feel very good about these beers", "positive"), ("This
is my best work", "positive"),
("I do not like this restaurant", "negative"), ("I
am tired of this stuff", "negative"),
("I can't deal with this", "negative"), ("He is
my sworn enemy", "negative"), ("My
boss is horrible", "negative")
]

# Separate the text and the labels

texts, labels = zip(*data)

# Convert the text data to a bag-of-words (BOW) model using CountVectorizer

vectorizer = CountVectorizer()
X = vectorizer.fit_transform(texts)

# Split the data into training and testing sets

X_train, X_test, y_train, y_test = train_test_split(X, labels, test_size=0.3, random_state=42)

# Create and train the Naive Bayes model model

= MultinomialNB()
MACHINE LEARNING LAB

model.fit(X_train, y_train)

# Predict the labels for the test data

y_pred = model.predict(X_test)

# Evaluate the model

print(f"Accuracy: {accuracy_score(y_test, y_pred):.2f}")
print("Classification Report:")
print(classification_report(y_test, y_pred))

# Test on new text data

new_texts = ["I enjoy this new movie", "I hate this weather"]
new_X = vectorizer.transform(new_texts)
predictions = model.predict(new_X)

for text, prediction in zip(new_texts, predictions):

print(f"Text: {text} => Predicted Label: {prediction}")

OUTPUT:
MACHINE LEARNING LAB

VIVA QUESTIONS
1. Naïve Bayes classifier algorithms are mainly used in text classification?

2. What is the formula for Bayes’ theorem? Where (A & B) and (H & E) are events and P(B), P(H) &

P(E) ≠ 0.

3. What is the assumptions of Naïve Bayesian classifier?

4. Is the assumption of the Naïve Bayes algorithm a limitation to use it?

5. here are two boxes. The first box contains 3 white and 2 red balls whereas the second contains 5

white and 4 red balls. A ball is drawn at random from one of the two boxes and is found to be

white. Find the probability that the ball was drawn from the second box?

6. The main objective of a classification algorithm in supervised learning is to?

7. The term "supervised" in supervised learning refers to:

8. What is the maximum possible length of an identifier?

9. In which year was the Python language developed?

10. What do we use to define a block of code in Python language?

MACHINE LEARNING LAB

EXPERIMENT – 11

AIM: IMPLEMENT AN ALGORITHM TO DEMONSTRATE THE SIGNIFICANCE OF GENETIC

ALGORITHM

SOURCE CODE:
import random import
numpy as np

# Parameters population_size =
10
chromosome_length = 6 # Binary representation with 6 bits
max_generations = 50
mutation_rate = 0.01

# Fitness function: maximize f(x) = x^2 def

fitness_function(individual):
# Convert binary string to decimal value x =
int("".join(map(str, individual)), 2)
return x ** 2

# Create an individual (random binary string) def

create_individual():
return [random.randint(0, 1) for _ in range(chromosome_length)]

# Create an initial population def

create_population():
return [create_individual() for _ in range(population_size)]

# Selection function (roulette wheel selection)

def selection(population, fitness_scores):
total_fitness = sum(fitness_scores)
probabilities = [score / total_fitness for score in fitness_scores]
selected_index = np.random.choice(len(population), p=probabilities)
return population[selected_index]
MACHINE LEARNING LAB

# Crossover (single-point crossover)

def crossover(parent1, parent2):
crossover_point = random.randint(1, chromosome_length - 1)
child1 = parent1[:crossover_point] + parent2[crossover_point:]
child2 = parent2[:crossover_point] + parent1[crossover_point:]
return child1, child2

# Mutation
def mutate(individual):
for i in range(len(individual)):
if random.random() < mutation_rate:
individual[i] = 1 - individual[i] # Flip bit
return individual

# Main Genetic Algorithm loop

def genetic_algorithm():
# Step 1: Initialize population
population = create_population()

for generation in range(max_generations):

# Step 2: Evaluate fitness for the population
fitness_scores = [fitness_function(individual) for individual in population]

# Display best solution in current generation

best_fitness = max(fitness_scores)
best_individual = population[fitness_scores.index(best_fitness)]
print(f"Generation {generation + 1}: Best Fitness = {best_fitness}, Best Individual =
{best_individual}")

# Step 3: Selection, Crossover, and Mutation to create new population

new_population = []
for _ in range(population_size // 2): # Create pairs of offspring
# Select two parents based on fitness
parent1 = selection(population, fitness_scores)
parent2 = selection(population, fitness_scores)
MACHINE LEARNING LAB

# Apply crossover
child1, child2 = crossover(parent1, parent2)

# Apply mutation
child1 = mutate(child1)
child2 = mutate(child2)

# Add children to the new population

new_population.append(child1)
new_population.append(child2)

# Step 4: Replace the old population with the new population

population = new_population

# Step 5: Return the best solution after all generations

fitness_scores = [fitness_function(individual) for individual in population]
best_fitness = max(fitness_scores)
best_individual = population[fitness_scores.index(best_fitness)]
return best_individual, best_fitness

# Run the genetic algorithm

best_individual, best_fitness = genetic_algorithm()

# Display the final result

x = int("".join(map(str, best_individual)), 2)
print(f"\nBest individual after {max_generations} generations: {best_individual}")
print(f"Best solution x = {x}, Fitness (x^2) = {best_fitness}")
MACHINE LEARNING LAB

OUTPUT:
MACHINE LEARNING LAB

VIVA QUESTIONS

1. What is a Genetic Algorithm?

2. The algorithm operates by iteratively updating a pool of hypotheses, called the

3. When would the genetic algorithm terminate?

4. GA techniques are inspired by biology.

5. is any predicate (or its negation) applied to any set of terms.

6. What is/are the requirement for the Learn-One-Rule method?

7. Which type of feedback used by RL?

8. 0*10 represents the set of bit strings that includes exactly

9. Search through the hypothesis space cannot be characterized. Why?

10. ILP stand for

MACHINE LEARNING LAB

EXPERIMENT -12

AIM: IMPLEMENT THE FINITE WORDS CLASSIFICATION SYSTEM USING BACK- PROPAGATION
ALGORITHM

SOURCE CODE:
import numpy as np

# Sigmoid activation function def

sigmoid(x):
return 1 / (1 + np.exp(-x))

# Derivative of the sigmoid function (used for backpropagation) def

sigmoid_derivative(x):
return x * (1 - x)

# Word dataset with feature vectors and labels (manually created embeddings) #
Features: [Length of word, Number of vowels, Frequency of letter 'e', etc.]
word_data = np.array([
[5, 2, 1], # apple (fruit)
[6, 3, 1], # orange (fruit)
[3, 1, 0], # dog (animal)
[5, 2, 0], # tiger (animal)
[6, 2, 1], # table (object)
[4, 1, 0], # book (object)
])

# Labels (fruit = [1, 0, 0], animal = [0, 1, 0], object = [0, 0, 1]) labels
= np.array([
[1, 0, 0], # fruit
[1, 0, 0], # fruit
[0, 1, 0], # animal
[0, 1, 0], # animal
[0, 0, 1], # object
[0, 0, 1], # object
MACHINE LEARNING LAB

])

# Initialize parameters
input_size = word_data.shape[1] # Number of input features
hidden_size = 4 # Number of neurons in the hidden layer
output_size = labels.shape[1] # Number of output categories (3 in this case)
learning_rate = 0.5
epochs = 10000 # Number of iterations for training

# Initialize weights and biases

np.random.seed(42)
weights_input_hidden = np.random.uniform(-1, 1, (input_size, hidden_size))
weights_hidden_output = np.random.uniform(-1, 1, (hidden_size, output_size))

bias_hidden = np.random.uniform(-1, 1, (1, hidden_size))

bias_output = np.random.uniform(-1, 1, (1, output_size))

# Training the neural network using backpropagation

for epoch in range(epochs):
# Feedforward phase
hidden_input = np.dot(word_data, weights_input_hidden) + bias_hidden
hidden_output = sigmoid(hidden_input)

final_input = np.dot(hidden_output, weights_hidden_output) + bias_output

final_output = sigmoid(final_input)

# Calculate error
error = labels - final_output

# Backpropagation phase
d_output = error * sigmoid_derivative(final_output)

error_hidden_layer = d_output.dot(weights_hidden_output.T)
d_hidden_layer = error_hidden_layer * sigmoid_derivative(hidden_output)

# Update weights and biases

MACHINE LEARNING LAB

weights_hidden_output += hidden_output.T.dot(d_output) * learning_rate

bias_output += np.sum(d_output, axis=0, keepdims=True) * learning_rate

weights_input_hidden += word_data.T.dot(d_hidden_layer) * learning_rate

bias_hidden += np.sum(d_hidden_layer, axis=0, keepdims=True) * learning_rate

# Print error at intervals

if epoch % 1000 == 0:
loss = np.mean(np.abs(error))
print(f'Epoch {epoch}, Loss: {loss}')

# Testing the trained neural network

def classify(word_vector):
hidden_layer_activation = np.dot(word_vector, weights_input_hidden) + bias_hidden
hidden_layer_output = sigmoid(hidden_layer_activation)

final_layer_activation = np.dot(hidden_layer_output, weights_hidden_output) + bias_output

final_output = sigmoid(final_layer_activation)

return final_output

# Test on new word data

new_word = np.array([4, 1, 0]) # book-like input
classification = classify(new_word)
print("\nClassification (Output Probabilities):", classification)

# Convert probabilities to class

category = np.argmax(classification)
categories = ["fruit", "animal",
"object"]
print(f"Classified as: {categories[category]}")
MACHINE LEARNING LAB

OUTPUT:
MACHINE LEARNING LAB

VIVA QUESTIONS

1. What is backpropagation?

2. How does backpropagation work?

3. What is the difference between a Perceptron and Logistic Regression?

4. Can we have the same bias for all neurons of a hidden layer?

5. What if we do not use any activation function(s) in a neural network?

6. In a neural network, what if all the weights are initialized with the same value?

7. What is the role of weights and bias in a neural network?

8. How can learning process be stopped in backpropagation rule?

9. Does backpropagation learning is based on gradient descent along error surface?

10. What is meant by generalized in statement “backpropagation is a generalized delta rule” ?

DATA VISUALIZATION Lab Manual Based On Syllabus
No ratings yet
DATA VISUALIZATION Lab Manual Based On Syllabus
46 pages
Web Tech Lab File (BCS-552)
No ratings yet
Web Tech Lab File (BCS-552)
65 pages
R-22 Data Visualization - R Programming Power Bi Lab Record
No ratings yet
R-22 Data Visualization - R Programming Power Bi Lab Record
36 pages
Data Structures Using Python Lab Manual (R20a0583)
No ratings yet
Data Structures Using Python Lab Manual (R20a0583)
71 pages
Data Structures Lab
No ratings yet
Data Structures Lab
141 pages
Flow Meter Manual
No ratings yet
Flow Meter Manual
70 pages
Ml-Lab-Manual Cse
No ratings yet
Ml-Lab-Manual Cse
69 pages
ISO 4287-1997 - Surface Texture
No ratings yet
ISO 4287-1997 - Surface Texture
36 pages
DWDM Lab Manual - It - Iii-Ii - 2018-19 PDF
No ratings yet
DWDM Lab Manual - It - Iii-Ii - 2018-19 PDF
96 pages
CS3491AI & ML Lab Manual
No ratings yet
CS3491AI & ML Lab Manual
105 pages
DS Lab Manual
No ratings yet
DS Lab Manual
140 pages
DS Lab Manual
No ratings yet
DS Lab Manual
70 pages
Big Data Analytics Lab Manual
No ratings yet
Big Data Analytics Lab Manual
90 pages
Shaft Misalignment and Vibration - A Model
No ratings yet
Shaft Misalignment and Vibration - A Model
13 pages
Data Structures Through Python Lab Manual (R20a0503)
No ratings yet
Data Structures Through Python Lab Manual (R20a0503)
70 pages
9 First-Order Circuits Noted
No ratings yet
9 First-Order Circuits Noted
67 pages
Laboratory Manual Data Warehousing and Mining Lab: Department of Computer Science and Engineering
No ratings yet
Laboratory Manual Data Warehousing and Mining Lab: Department of Computer Science and Engineering
234 pages
It - (20) - 2-2 - Database Management Systems Laboratory Manual (2022-23)
No ratings yet
It - (20) - 2-2 - Database Management Systems Laboratory Manual (2022-23)
70 pages
2 - 1 Lab Manual Final
No ratings yet
2 - 1 Lab Manual Final
183 pages
DS Lab (R22a0583)
No ratings yet
DS Lab (R22a0583)
64 pages
Losses in Pipes
100% (2)
Losses in Pipes
19 pages
Screenshot 2024-05-28 at 12.25.15 PM
No ratings yet
Screenshot 2024-05-28 at 12.25.15 PM
53 pages
S.E Lab Manual Cse-B
No ratings yet
S.E Lab Manual Cse-B
181 pages
Operating Systems Lab
No ratings yet
Operating Systems Lab
100 pages
Machine Learning Lab
No ratings yet
Machine Learning Lab
88 pages
Artifical Intelligence and Machine Learning Lab
No ratings yet
Artifical Intelligence and Machine Learning Lab
109 pages
ML Lab Manual 20-06
No ratings yet
ML Lab Manual 20-06
40 pages
Lab Manual-3
No ratings yet
Lab Manual-3
74 pages
Cryptography
No ratings yet
Cryptography
99 pages
It Iii B.tech Sem-Ii Dwdm-R17a0590 Lab Manual 2019-20
No ratings yet
It Iii B.tech Sem-Ii Dwdm-R17a0590 Lab Manual 2019-20
107 pages
Dbms Lab Manual - II B.tech It Semii (2017-18)
No ratings yet
Dbms Lab Manual - II B.tech It Semii (2017-18)
83 pages
It - III B.tech Sem-II - DWDM Lab Manual (20-21)
No ratings yet
It - III B.tech Sem-II - DWDM Lab Manual (20-21)
94 pages
Database Management Systems Laboratory Manual & Record
No ratings yet
Database Management Systems Laboratory Manual & Record
70 pages
Linux Programming Lab
No ratings yet
Linux Programming Lab
60 pages
R15A0591 - LP Lab Manual
No ratings yet
R15A0591 - LP Lab Manual
47 pages
Computer Networks and Operating Systems Lab Manual (R20a0567)
No ratings yet
Computer Networks and Operating Systems Lab Manual (R20a0567)
60 pages
KNN, Kmeans
No ratings yet
KNN, Kmeans
41 pages
Data Structures Lab - 101
No ratings yet
Data Structures Lab - 101
70 pages
Dbms Lab Manual
No ratings yet
Dbms Lab Manual
68 pages
Tsa Lab Record - Cse
No ratings yet
Tsa Lab Record - Cse
61 pages
Lab Manual ML Final
No ratings yet
Lab Manual ML Final
47 pages
23CS0903 Artificial Intelligence and Machine Learning Lab Manual R23 CSE CSM
No ratings yet
23CS0903 Artificial Intelligence and Machine Learning Lab Manual R23 CSE CSM
43 pages
Operating Systems Lab Manual (R20a0584)
No ratings yet
Operating Systems Lab Manual (R20a0584)
63 pages
Csit r22 Data Visualization
No ratings yet
Csit r22 Data Visualization
46 pages
It - (r22) - 3-1 - Data Science and Artificial Intelligence - Lab Manual
No ratings yet
It - (r22) - 3-1 - Data Science and Artificial Intelligence - Lab Manual
97 pages
Se Lab New
No ratings yet
Se Lab New
46 pages
21UCS608 AI&ML Lab Manual
No ratings yet
21UCS608 AI&ML Lab Manual
29 pages
Bda Lab Manual (R20a0592)
No ratings yet
Bda Lab Manual (R20a0592)
89 pages
CS3381-Oops Lab - Rubrics Final
No ratings yet
CS3381-Oops Lab - Rubrics Final
5 pages
Ai 1131
No ratings yet
Ai 1131
65 pages
Blockchain Technology
No ratings yet
Blockchain Technology
89 pages
AI&ML Handouts
No ratings yet
AI&ML Handouts
10 pages
X Viber Balancing Method
No ratings yet
X Viber Balancing Method
8 pages
CCS334 Master Manual
No ratings yet
CCS334 Master Manual
65 pages
Machine Learning Lab Manual
No ratings yet
Machine Learning Lab Manual
43 pages
6 Big Data Analytics Lab Manual
No ratings yet
6 Big Data Analytics Lab Manual
73 pages
MACHINE LEARNING Notes
No ratings yet
MACHINE LEARNING Notes
40 pages
Big Data Analytics Lab Manual 2025
No ratings yet
Big Data Analytics Lab Manual 2025
91 pages
CSE-DS Power BI Updated Lab Manual
No ratings yet
CSE-DS Power BI Updated Lab Manual
99 pages
Quarter 2 Module 5
50% (2)
Quarter 2 Module 5
4 pages
DWDM Lab Manual r20
No ratings yet
DWDM Lab Manual r20
97 pages
Artificial Intelligence & Machine Learning Lab Manual (R22a6684)
No ratings yet
Artificial Intelligence & Machine Learning Lab Manual (R22a6684)
81 pages
Csit II-II (R22a0585) Software Engineering Lab Manual (2024-25)
No ratings yet
Csit II-II (R22a0585) Software Engineering Lab Manual (2024-25)
63 pages
Experiment List. DSPYL
No ratings yet
Experiment List. DSPYL
10 pages
Csit III-II (R22a6681) Machine Learning Lab Manual (2024-25)
No ratings yet
Csit III-II (R22a6681) Machine Learning Lab Manual (2024-25)
75 pages
Asset Pricing
No ratings yet
Asset Pricing
23 pages
Lab Manual R20A6610 Deep Learning Year-IV Semester-I
No ratings yet
Lab Manual R20A6610 Deep Learning Year-IV Semester-I
68 pages
Prime Factorization: by Jane Alam Jan
No ratings yet
Prime Factorization: by Jane Alam Jan
6 pages
WIPRO Online Assessment Syllabus - WILP
No ratings yet
WIPRO Online Assessment Syllabus - WILP
3 pages
ST 16 2-5 (-4)
No ratings yet
ST 16 2-5 (-4)
9 pages
Mathematics W 21
100% (1)
Mathematics W 21
25 pages
Physics 2 A Fiv
No ratings yet
Physics 2 A Fiv
3 pages
DEA With Stata
No ratings yet
DEA With Stata
14 pages
12 - Design Root Locus - A
No ratings yet
12 - Design Root Locus - A
109 pages
Biostatistics Notes For PG
No ratings yet
Biostatistics Notes For PG
10 pages
Physics Electrostatics MCQ
No ratings yet
Physics Electrostatics MCQ
8 pages
Lead Compensator Design Paper
No ratings yet
Lead Compensator Design Paper
17 pages
Grade 12 Maths Model Exam
No ratings yet
Grade 12 Maths Model Exam
57 pages
Analysis of Selected Mathematical Models of High-Cycle S-N Characteristics
No ratings yet
Analysis of Selected Mathematical Models of High-Cycle S-N Characteristics
15 pages
Vinyl Acetate
No ratings yet
Vinyl Acetate
13 pages
Non Plan Works and Flood Works 2020-21
No ratings yet
Non Plan Works and Flood Works 2020-21
16 pages
Chapter 1 - Introduction To Finite Element Analysis
No ratings yet
Chapter 1 - Introduction To Finite Element Analysis
16 pages
Nas PPT Dhule
No ratings yet
Nas PPT Dhule
30 pages
NCERT Solutions For Class 11 Maths Chapter 3 Trigonometric Functions Miscellaneous Exercise
No ratings yet
NCERT Solutions For Class 11 Maths Chapter 3 Trigonometric Functions Miscellaneous Exercise
13 pages
1 s2.0 S0022169421007320 Main
No ratings yet
1 s2.0 S0022169421007320 Main
13 pages
Department of Education: General Mathematics Weekly Home Learning Plan
No ratings yet
Department of Education: General Mathematics Weekly Home Learning Plan
3 pages
Regional Mathematical Olympiad Examination - 2019: Admit Card
No ratings yet
Regional Mathematical Olympiad Examination - 2019: Admit Card
2 pages
Central Chi-Squared Distribution
No ratings yet
Central Chi-Squared Distribution
2 pages
PROBLEMSet 092022
No ratings yet
PROBLEMSet 092022
1 page
Machine Learning Mastery for Engineers
From Everand
Machine Learning Mastery for Engineers
Abdellatif Sadeq
No ratings yet

ML Lab Manual-It

Uploaded by

ML Lab Manual-It

Uploaded by

DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING

(Subject Code: 2070518)

B. Tech, III Year II–

Department of INFORMATION TECHNOLOGY

5. Students are instructed not to bring mobile phones to the lab.

 To offer high-quality education in the computing fields by providing an environment

• Communicate and Organize: Graduates shall pursue career in industry, empowered

PROGRAM SPECIFIC OUTCOMES

PSO1: Applications of Computing: Ability to use knowledge in various domains to provide

a : An ability to apply knowledge of Science, Mathematics, Engineering & Computing

COURSE STRUCTURE, OBJECTIVES & OUTCOMES

Laboratory subjects – Internal and external evaluation – Details of marks“ MACHINE

 Understand complexity of Machine Learning algorithms and their limitations

2070584: MACHINE LEARNING LAB

low speedway transport married thirties yes -> medRisk

PFIA=float(input(“Enter probability that it is Friday and that a student is absent=”))

Enter probability that it is Friday and that a student is absent= 0.03

1. How machine learning is different from general programming?

2. Is Python a compiled language or an interpreted language?

3. What is the difference between a Set and Dictionary?

4. What are *args and *kwargs?

5. Definitions A prior probability AND POSTERIOR?

6. Bayes Theorem Derivation?

7. What is Bayes rule ?

8. What is Bayes classifier?

9. What are Bayesian Networks (BN) ?

AIM: EXTRACT THE DATA FROM DATABASE USING PYTHON.

AIM : IMPLEMENT FIND-S ALGORITHM USING PYTHON.

# Read all rows into the list `a` for

# Print the number of training instances

Get the number of attributes (excluding the target attribute)

# Process each training instance

# Print the hypothesis for the current training instance

# Print the final maximally specific hypothesis

3. What is one of the advantages of the Find-S algorithm?

4. How does the hypothesis change gradually?

5. When do we use CSV file?

6. What is csv reader( ) function?

7.What is one of the advantages of the Find-S algorithm?

8. How does the hypothesis change gradually?

9. What is one of the drawbacks of the Find-S algorithm

10. The algorithm accommodates all the maximally specific hypotheses.

AIM: IMPLEMENT CANDIDATE-ELIMINATION ALGORITHM USING PYTHON.

# Open and read the CSV file with

# Print the data from the CSV file

# Initialize the general hypothesis as the most general

# Print initial specific and general hypotheses

# Candidate Elimination Algorithm

# Print the hypothesis after processing each instance

# Extract the final general hypothesis

# Print the final hypotheses print("\

2. Candidate-Elimination algorithm can be described by?

3. How is the version space represented?

4. It is possible that in the output, set S contains only phi.

6. What is the order of precedence in python?

7. Which of the following is true for variable names in Python?

9. when is it changed for the first time?

10. What are the two main types of functions in Python?

AIM: IMPLEMENT DECISION-TREE LEARNING ALGORITHM USING PYTHON.

Consider a training Dataset D:(Real Time Dataset which predicts Cancer.xlsx)

from sklearn.model_selection import train_test_split

from sklearn.tree import DecisionTreeClassifier, plot_tree

from sklearn.metrics import accuracy_score, classification_report

from sklearn.preprocessing import LabelEncoder

import matplotlib.pyplot as plt

file_path = "C:\\ASMA\\cancer.xlsx" # Use double backslashes in the file path

for column in df.columns:

if df[column].dtype == 'object': # Check if column is non-numeric

y = df['Level'] # Target variable

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.3, random_state=42)

clf = DecisionTreeClassifier(criterion='entropy', random_state=42)

accuracy = accuracy_score(y_test, y_pred)

return -np.sum([p * math.log2(p) for p in probabilities if p >

0]) def calculate_information_gain(X_column, y, threshold):

y[X_column <= threshold] right_split

= y[X_column > threshold] n = len(y)

4. What are args and kwargs?