0% found this document useful (0 votes)

6 views9 pages

Practical File OF Machine Learning

This practical file outlines various machine learning experiments conducted by Shubham Kumar Chaubey as part of a Bachelor of Technology program in Computer Science. It includes experiments on natural language processing, customer segmentation using K-Means clustering, and neural networks, detailing methodologies such as tokenization, stopword removal, and sentiment analysis. The document serves as a comprehensive guide to applying machine learning techniques to real-world data analysis tasks.

Uploaded by

sirbabu778

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views9 pages

Practical File OF Machine Learning

Uploaded by

sirbabu778

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 9

PRACTICAL FILE

OF
MACHINE LEARNING

BACHELOR OF TECHNOLOGY IN COMPUTER SCIENCE

SESSION 2022-2026
Department of Computer Science
Global Institute of Technology & Management,
(Gurugram University)
Farrukh Nagar, Haryana, India

SUBMITTED BY SUBMITTED TO
Name: Shubham Kumar Chaubey Name: Mr. Muzamil Aslam
Roll No.: 221116 Designation: Professor
Semester: 6th- B
S.
EXPERIMENT DATE SIGNATURE
NO.
1. Automatic Word Analysis in NLP 11-02-2025
Classification Algorithms and ROC
2. 28-02-2025
Interpretation
Customer Segmentation using K-Means
3. 21-03-2025
Clustering
Neural Networks: Feedforward, CNN, and
4. 28-03-2025
RNN
5. Feature Selection Techniques 04-04-2025

Linear and Logistic Regression

6. 11-04-2025
Implementation
7. K-Means Clustering on Iris Dataset 17-04-2025

INDEX
sis
import nltk
from nltk.tokenize import word_tokenize
from nltk.corpus import stopwords
from nltk.stem import PorterStemmer, WordNetLemmatizer
from collections import Counter
from textblob import TextBlob

# Download necessary NLTK resources

nltk.download('punkt')
nltk.download('stopwords')
nltk.download('wordnet')

# Sample text for analysis

text = """Machine learning is a branch of artificial
intelligence that enables computers to learn from data.
It is used in various applications, including speech
recognition, recommendation systems, and automation."""

# 1. Tokenization (Splitting text into words)

tokens = word_tokenize(text.lower()) # Convert to lowercase

# 2. Removing stopwords
stop_words = set(stopwords.words("english"))
filtered_words = [word for word in tokens if word.isalnum()
and word not in stop_words]

# 3. Stemming (Reducing words to their root form)

stemmer = PorterStemmer()
stemmed_words = [stemmer.stem(word) for word in
filtered_words]

# 4. Lemmatization (Converting words to their base form)

lemmatizer = WordNetLemmatizer()
lemmatized_words = [lemmatizer.lemmatize(word) for word in
filtered_words]

# 5. Word Frequency Analysis

word_freq = Counter(lemmatized_words)

# 6. Sentiment Analysis
blob = TextBlob(text)
sentiment = blob.sentiment.polarity # Range from -1
(negative) to 1 (positive)

# Output Results
print("Original Text:", text)
print("\nTokenized Words:", tokens)
print("\nFiltered Words (Without Stopwords):", filtered_words)
print("\nStemmed Words:", stemmed_words)
print("\nLemmatized Words:", lemmatized_words)
print("\nWord Frequency:", word_freq)
print("\nSentiment Analysis Score:", sentiment)
if sentiment > 0:
print("Overall Sentiment: Positive ")
elif sentiment < 0:
print("Overall Sentiment: Negative ")
else:
print("Overall Sentiment: Neutral ")

Explanation:

 Tokenization: Splits the input text into individual words.

 Stopword Removal: Eliminates common English words that don’t
contribute to meaning.
 Stemming: Converts words to their root form using the Porter
Stemmer.
 Lemmatization: Uses WordNet Lemmatizer to find meaningful base
words.
 Word Frequency Analysis: Counts occurrences of each word in the text.
 Sentiment Analysis: Uses TextBlob to determine positive, negative, or
neutral sentiment.

Expected Output:
Original Text: Machine learning is a branch of artificial
intelligence that enables computers to learn from data...

Tokenized Words: ['machine', 'learning', 'is', 'a', 'branch',

'of', ...]

Filtered Words (Without Stopwords): ['machine', 'learning',

'branch', 'artificial', ...]

Stemmed Words: ['machin', 'learn', 'branch', 'artifici', 'intellig',

...]

Lemmatized Words: ['machine', 'learning', 'branch', 'artificial',

'intelligence', ...]

Word Frequency: {'machine': 1, 'learning': 1, 'branch': 1, ...}

Sentiment Analysis Score: 0.3

Overall Sentiment: Positive

Conclusion:
This experiment successfully demonstrates automatic word analysis using
machine learning and NLP techniques, covering:

✅ Tokenization
✅ Stopword Removal
✅ Stemming & Lemmatization
✅ Word Frequency Analysis
✅ Sentiment Analysis

This approach is widely used in text classification, chatbots, and AI-driven

language processing. 🚀

 Dvlnxfbnlfbncnnvnlx
 bfbnvlnClustering: The dataset contained raw, ungrouped customer data
without clear segmentation.
 After Clustering: The K-Means algorithm successfully identified distinct customer
segments based on their annual income and spending patterns.
 Customer Segmentation: The segmented customers represent different shopper
profiles, allowing targeted marketing strategies to be devised.

Note: The labels assigned to clusters ("Budget Shoppers," "Impulse Buyers,"

etc.) are interpretations based on the visual representation of the clusters.
Further analysis and domain expertise might be required to refine the label
accuracy.

This experiment demonstrates how K-Means clustering can be employed to

understand customer behavior and identify distinct segments within a dataset.

Experiment 4: Three assignments on designing

neural networks for solving learning problems.

# Import necessary libraries

import tensorflow as tf
from tensorflow import keras
from tensorflow.keras.models import Sequential
from tensorflow.keras.layers import Dense, Flatten, Conv2D, MaxPooling2D,
Embedding, LSTM
from tensorflow.keras.datasets import mnist, cifar10, imdb
from tensorflow.keras.preprocessing.sequence import pad_sequences
import numpy as np
import matplotlib.pyplot as plt

# -----------------------------
# Assignment 1: Feedforward NN for MNIST
# -----------------------------
print("Training Feedforward NN on MNIST...")
vnnnnnnnnnnnnnnnnnnnnnnnnuuuuuuuuuuuuuuuuuuuuuuuuuuuuuuuuuxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxjjjjjjjjjjjjjjjjjjjjjjjjjjjjjlxxxxxxxx
xxxxxxxxxxxxxxxxxxxxllllllllllllllllllllllllllllllllllmean
texture 0.323782 1.000000 ... 0.415185

fvbnum
Experiment 7: One Assignment to be done in
Grouping
Grouping Data using K-Means Clustering

Objective:

To implement the K-Means Clustering algorithm for unsupervised

learning and to group similar data points based on selected features.

Software/Tools Required:

 Python
 Libraries: pandas, sklearn, matplotlib, seaborn

Dataset:

Use the Iris dataset from sklearn.datasets

Tasks to Perform:

1. Load the Iris dataset.

2. Choose two features for clustering.
3. Apply K-Means clustering to group data into clusters.
4. Visualize the clusters using a scatter plot.
5. Display the cluster centroids.
6. Evaluate the clustering using Silhouette Score.

Sample Python Code:

import pandas as pd
import matplotlib.pyplot as plt
import seaborn as sns
from sklearn.cluster import KMeans
from sklearn.datasets import load_iris
from sklearn.metrics import silhouette_score

# Load dataset
iris = load_iris()
df = pd.DataFrame(iris.data, columns=iris.feature_names)

# Select features for clustering

X = df[['petal length (cm)', 'petal width (cm)']]

# Apply KMeans
kmeans = KMeans(n_clusters=3, random_state=42)
df['cluster'] = kmeans.fit_predict(X)

# Visualize the clusters

plt.figure(figsize=(8,6))
sns.scatterplot(x='petal length (cm)', y='petal width (cm)',
hue='cluster', data=df, palette='Set2')
centers = kmeans.cluster_centers_
plt.scatter(centers[:, 0], centers[:, 1], c='black', s=200,
marker='X', label='Centroids')
plt.title("K-Means Clustering")
plt.legend()
plt.show()

# Evaluate with Silhouette Score

score = silhouette_score(X, df['cluster'])
print("Silhouette Score:", score)

Sample Output:
Silhouette Score: 0.66

A scatter plot will display 3 groups of data points with distinct colors
and black centroids.
Expected Learning Outcomes:

 Understand unsupervised learning and clustering.

 Learn how to group similar data points without labeled data.
 Visualize and evaluate clustering performance using Silhouette
Score.


Efficient Python Tricks and Tools For Data Scientists - by Khuyen Tran
No ratings yet
Efficient Python Tricks and Tools For Data Scientists - by Khuyen Tran
20 pages
Python GTU Study Material Presentations Unit-5 20112020032922AM
No ratings yet
Python GTU Study Material Presentations Unit-5 20112020032922AM
24 pages
Uml Pracs
No ratings yet
Uml Pracs
35 pages
AI Manual
No ratings yet
AI Manual
69 pages
CSIT366-Lab File
No ratings yet
CSIT366-Lab File
17 pages
ML Copy
No ratings yet
ML Copy
33 pages
ML New Record
No ratings yet
ML New Record
51 pages
Practical File of AI and ML
No ratings yet
Practical File of AI and ML
26 pages
AAIC Syllabus
No ratings yet
AAIC Syllabus
19 pages
Machine Learning Lab New
No ratings yet
Machine Learning Lab New
14 pages
Crash Course Sul Machine Learning ?
No ratings yet
Crash Course Sul Machine Learning ?
13 pages
Tushar ML
No ratings yet
Tushar ML
52 pages
CSS - 1st Sem - 1st Quarter - DLL
100% (2)
CSS - 1st Sem - 1st Quarter - DLL
44 pages
178 hw1
No ratings yet
178 hw1
4 pages
MIDS Lab Theory
No ratings yet
MIDS Lab Theory
6 pages
To Study About Numpy, Pandas and Matplotlib Libraries in Python
No ratings yet
To Study About Numpy, Pandas and Matplotlib Libraries in Python
21 pages
Programming Questions
No ratings yet
Programming Questions
5 pages
ML Lab Programs (1-13)
No ratings yet
ML Lab Programs (1-13)
44 pages
Practical File OF Machine Learning
No ratings yet
Practical File OF Machine Learning
31 pages
Lab Manual Final
No ratings yet
Lab Manual Final
34 pages
Jadavpur University: Assignment Submission
No ratings yet
Jadavpur University: Assignment Submission
9 pages
Record
No ratings yet
Record
23 pages
Machine Learning Lab Manual (15CSL76)
No ratings yet
Machine Learning Lab Manual (15CSL76)
30 pages
DM Practical File
No ratings yet
DM Practical File
21 pages
BAET Record
No ratings yet
BAET Record
19 pages
Maxbox Starter60 Machine Learning
No ratings yet
Maxbox Starter60 Machine Learning
8 pages
Machine Learning Lab
No ratings yet
Machine Learning Lab
20 pages
Linear Regression (Code)
No ratings yet
Linear Regression (Code)
9 pages
Practical File OF Machine Learning
No ratings yet
Practical File OF Machine Learning
16 pages
Amlnew
No ratings yet
Amlnew
25 pages
ML Practical Kunal 6-10
No ratings yet
ML Practical Kunal 6-10
10 pages
ML Lab
No ratings yet
ML Lab
7 pages
Python 21to30
No ratings yet
Python 21to30
9 pages
ML Practical Lovepreet 6-10
No ratings yet
ML Practical Lovepreet 6-10
10 pages
Python For Data Science IA 1 Programs
No ratings yet
Python For Data Science IA 1 Programs
14 pages
ML Practical Kiranjot 6-10
No ratings yet
ML Practical Kiranjot 6-10
10 pages
ML Practical Manjot 6-10
No ratings yet
ML Practical Manjot 6-10
10 pages
ML Industry Lab File With Code and IO
No ratings yet
ML Industry Lab File With Code and IO
8 pages
Python For Data Science IA 1 Programs
No ratings yet
Python For Data Science IA 1 Programs
14 pages
MLT 8 KK
No ratings yet
MLT 8 KK
2 pages
ML Lab Programs (1-12)
No ratings yet
ML Lab Programs (1-12)
35 pages
ML Record
No ratings yet
ML Record
19 pages
ML Lab Record8to15
No ratings yet
ML Lab Record8to15
23 pages
Machine Learning, NLP - Text Classification Using Scikit-Learn, Python and NLTK
No ratings yet
Machine Learning, NLP - Text Classification Using Scikit-Learn, Python and NLTK
9 pages
CS178 Homework #1: Problem 0: Getting Connected
No ratings yet
CS178 Homework #1: Problem 0: Getting Connected
4 pages
Retailer Outlet Name Retailer Nametelephone Number 1 Street No - Member Name
No ratings yet
Retailer Outlet Name Retailer Nametelephone Number 1 Street No - Member Name
5 pages
Automated Warehouse PDF
No ratings yet
Automated Warehouse PDF
345 pages
Data Entry Operator Job Description
100% (1)
Data Entry Operator Job Description
2 pages
NEJE KZ Board Schematic
0% (1)
NEJE KZ Board Schematic
1 page
Nokia 5 Schematics (Phonelumi - Com)
No ratings yet
Nokia 5 Schematics (Phonelumi - Com)
92 pages
KSS Catalog-E
No ratings yet
KSS Catalog-E
236 pages
Technical Writing Syllabus
No ratings yet
Technical Writing Syllabus
2 pages
Large It List
No ratings yet
Large It List
864 pages
Spring Boot REST
No ratings yet
Spring Boot REST
19 pages
Dms-Mba Data Analytics-Syllabus
No ratings yet
Dms-Mba Data Analytics-Syllabus
103 pages
Module 03 OS
No ratings yet
Module 03 OS
35 pages
Telit 3g Modules at Commands Reference Guide r9
No ratings yet
Telit 3g Modules at Commands Reference Guide r9
537 pages
RG-RAP2260 (H) Datasheet-20240104
No ratings yet
RG-RAP2260 (H) Datasheet-20240104
12 pages
A6 T802C Manual
No ratings yet
A6 T802C Manual
16 pages
Chapter Two
No ratings yet
Chapter Two
31 pages
Online Management Information System With Appointment System With AI Powered Chatbot
No ratings yet
Online Management Information System With Appointment System With AI Powered Chatbot
38 pages
Ricardo Vargas Simplified Pmbok Flow 6ed PROCESSES En-A4
No ratings yet
Ricardo Vargas Simplified Pmbok Flow 6ed PROCESSES En-A4
1 page
Mini
No ratings yet
Mini
6 pages
Microsoft Cognitive Toolkit
No ratings yet
Microsoft Cognitive Toolkit
2 pages
3HAC056431 PS IRB 910SC-en PDF
No ratings yet
3HAC056431 PS IRB 910SC-en PDF
56 pages
CiteSeerX - Wikipedia
No ratings yet
CiteSeerX - Wikipedia
6 pages
Project Management: Openings For Disruption From AI and Advanced Analytics
No ratings yet
Project Management: Openings For Disruption From AI and Advanced Analytics
30 pages
Optical Character Recognition Using Neural Networks: Title of The Project
No ratings yet
Optical Character Recognition Using Neural Networks: Title of The Project
5 pages
60% PDF
No ratings yet
60% PDF
1 page
GTPL
No ratings yet
GTPL
2 pages
2010A IP Questions
No ratings yet
2010A IP Questions
47 pages
05 Laboratory Exercise 1 Dacuba
No ratings yet
05 Laboratory Exercise 1 Dacuba
2 pages
Articulo 1
No ratings yet
Articulo 1
12 pages
Telit Le920-Family Datasheet
No ratings yet
Telit Le920-Family Datasheet
2 pages
Cody's Data Cleaning Techniques Using SAS, Third Edition
From Everand
Cody's Data Cleaning Techniques Using SAS, Third Edition
Ron Cody
4.5/5 (3)
Data Science with .NET and Polyglot Notebooks: Programmer's guide to data science using ML.NET, OpenAI, and Semantic Kernel
From Everand
Data Science with .NET and Polyglot Notebooks: Programmer's guide to data science using ML.NET, OpenAI, and Semantic Kernel
Matt Eland
No ratings yet
Pragmatic Machine Learning with Python: Learn How to Deploy Machine Learning Models in Production
From Everand
Pragmatic Machine Learning with Python: Learn How to Deploy Machine Learning Models in Production
Avishek Nag
No ratings yet
Data Mining Models: Techniques and Applications
From Everand
Data Mining Models: Techniques and Applications
Ravi Deshpande
No ratings yet
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: SUPPORT VECTOR MACHINE, LOGISTIC REGRESSION, DISCRIMINANT ANALYSIS and DECISION TREES: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: SUPPORT VECTOR MACHINE, LOGISTIC REGRESSION, DISCRIMINANT ANALYSIS and DECISION TREES: Examples with MATLAB
César Pérez López
No ratings yet
Artificial Intelligence 2024 Book 2 of 2: AI, #2
From Everand
Artificial Intelligence 2024 Book 2 of 2: AI, #2
Yang Yen Thaw
No ratings yet
The Secret Of Machine Learning
From Everand
The Secret Of Machine Learning
Mhd Arjunanta
No ratings yet
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet
C++ Basics for New Programmers: A Practical Guide with Examples
From Everand
C++ Basics for New Programmers: A Practical Guide with Examples
William E. Clark
No ratings yet
Contemporary Machine Learning Methods: Harnessing Scikit-Learn and TensorFlow
From Everand
Contemporary Machine Learning Methods: Harnessing Scikit-Learn and TensorFlow
Adam Jones
No ratings yet
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
DATA MINING and MACHINE LEARNING: CLUSTER ANALYSIS and kNN CLASSIFIERS. Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING: CLUSTER ANALYSIS and kNN CLASSIFIERS. Examples with MATLAB
César Pérez López
No ratings yet
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: NAIVE BAYES, NEAREST NEIGHBORS and NEURAL NETWORKS: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: NAIVE BAYES, NEAREST NEIGHBORS and NEURAL NETWORKS: Examples with MATLAB
César Pérez López
No ratings yet
Machine Learning Mastery for Engineers
From Everand
Machine Learning Mastery for Engineers
Abdellatif Sadeq
No ratings yet
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
From Everand
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
César Pérez López
No ratings yet
Python for Machine Learning: From Fundamentals to Real-World Applications
From Everand
Python for Machine Learning: From Fundamentals to Real-World Applications
Kameron Hussain
No ratings yet

Practical File OF Machine Learning

Uploaded by

Practical File OF Machine Learning

Uploaded by

PRACTICAL FILE

BACHELOR OF TECHNOLOGY IN COMPUTER SCIENCE

Linear and Logistic Regression

# Download necessary NLTK resources

# Sample text for analysis

# 1. Tokenization (Splitting text into words)

# 3. Stemming (Reducing words to their root form)

# 4. Lemmatization (Converting words to their base form)

# 5. Word Frequency Analysis

 Tokenization: Splits the input text into individual words.

Tokenized Words: ['machine', 'learning', 'is', 'a', 'branch',

Filtered Words (Without Stopwords): ['machine', 'learning',

Stemmed Words: ['machin', 'learn', 'branch', 'artifici', 'intellig',

Lemmatized Words: ['machine', 'learning', 'branch', 'artificial',

Word Frequency: {'machine': 1, 'learning': 1, 'branch': 1, ...}

Sentiment Analysis Score: 0.3

This approach is widely used in text classification, chatbots, and AI-driven

Note: The labels assigned to clusters ("Budget Shoppers," "Impulse Buyers,"

This experiment demonstrates how K-Means clustering can be employed to

Experiment 4: Three assignments on designing

# Import necessary libraries

To implement the K-Means Clustering algorithm for unsupervised

Use the Iris dataset from sklearn.datasets

1. Load the Iris dataset.

Sample Python Code:

# Select features for clustering

# Visualize the clusters

# Evaluate with Silhouette Score

 Understand unsupervised learning and clustering.

You might also like