0% found this document useful (0 votes)
20 views8 pages

Aam Report

The micro project report details the implementation of the K-Nearest Neighbors (KNN) algorithm for clustering data using Python, focusing on its application in various fields such as customer segmentation and anomaly detection. The project outlines the methodology, resources used, and skills developed, including data preprocessing and visualization techniques. It emphasizes the benefits of understanding unsupervised machine learning and provides insights into the practical applications of the project outcomes.

Uploaded by

kolekarsiddhi056
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
20 views8 pages

Aam Report

The micro project report details the implementation of the K-Nearest Neighbors (KNN) algorithm for clustering data using Python, focusing on its application in various fields such as customer segmentation and anomaly detection. The project outlines the methodology, resources used, and skills developed, including data preprocessing and visualization techniques. It emphasizes the benefits of understanding unsupervised machine learning and provides insights into the practical applications of the project outcomes.

Uploaded by

kolekarsiddhi056
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 8

A

Micro Project Report On

“Implement unsupervised machine learning algorithm (KNN) in python on dataset


to cluster data. (Assume suitable dataset.)”

IN FULFILLMENT OF DIPLOMA IN AIML ENGINEERING OF

MSBTE MUMBAI

Submitted by:

1) MANADE SAKSHI SHASHIKANT (3109)

2) KOLEKAR SIDDHI SANTOSH (3110)

3) BAMANE NEHA RAMESH (3111)

4) BHOSALE ADITEE SANGRAMSINH (3112)

Program name: Artificial Intelligence & Machine Learning (AIML)


Course name : Advanced Algorithms in AIML (22683)

Subject Teacher: MR. A.A. PARITEKAR

Department of AIML Engineering

Shree Swami Vivekanand Shikshan Sanstha’s

Dr. BAPUJI SALUNKHE INSTITUTE OF ENGINEERING AND TECHNOLOGY,


KOLHAPUR
Academic Year 2024-2025
Dr. BAPUJI SALUNKHE INSTITUTE OF ENGINEERING AND TECHNOLOGY’s

Academic Year 2024-2025

Maharashtra State Board of Technical Education, Mumbai

CERTIFICATE
This is to certify that the following students of first year AIML Engineering

1) MANADE SAKSHI SHASHIKANT (3109)

2) KOLEKAR SIDDHI SANTOSH (3110)

3) BAMANE NEHA RAMESH (3111)

4) BHOSALE ADITEE SANGRAMSINH (3112)

has successfully completed Micro-Project of Course Name- Advanced Algorithms in AIML


(22683) for the academic year 2024-25 as prescribed in the curriculum of
MSBTE.

SUBJECT TEACHER HOD PRINCIPAL

MR. A.A PARITEKAR MR. A.A PARITEKAR DR. S.G. SAPATE


INDEX

Sr.No. Contents Page No.

1. Rationale 1

2. Aims/Benefits 1

3. Course Outcomes Achieved 1

4. Literature Review 1

5. Actual Methodology 2
Followed

6. Actual Resources Used 2

7. Implementation / Outputs of 2-4

Micro-Project

8. Skills Developed 4

9. Application of Micro-Project 4-5


“Implement unsupervised machine learning algorithm (KNN) in python on dataset
to cluster data. (Assume suitable dataset.)”

1. Rationale: Machine learning plays a vital role in modern data analysis and pattern recognition. Clustering is an
essential technique used in various fields, including customer segmentation, anomaly detection, and
recommendation systems. This microproject focuses on implementing the K-Nearest Neighbors (KNN) algorithm
for clustering data using Python while understanding its advantages and limitations. The project aims to analyze
real-world datasets, apply KNN clustering, and visualize the results effectively.

2. Aims/Benefits of the Micro-Project


Aim: “To implement the K-Nearest Neighbors (KNN) algorithm for clustering data using Python and evaluate
its effectiveness.”

Benefits:
By completing this project, the following benefits can be expected:

1. Understanding the principles of unsupervised machine learning and clustering.


2. Learning how to apply KNN for clustering in real-world datasets.
3. Enhancing knowledge of Python libraries such as NumPy, pandas, scikit-learn, and Matplotlib.
4. Gaining hands-on experience in data visualization and model evaluation.

3. Course Outcomes Addressed


4. Understanding unsupervised learning algorithms and their applications.

a) Understanding unsupervised learning algorithms and their applications.


b) Implementing the K-Nearest Neighbors (KNN) algorithm for clustering
c) Analyzing clustering performance using evaluation metrics.
d) Applying data preprocessing techniques for better clustering results.

5. Literature Review

Sr.No Title of Book Resources Used Major Concepts

1 Machine Learning with Python Laptop, Internet Supervised & Unsupervised


Learning

2 Hands-On Machine Learning Online Research KNN, Clustering, Data


Preprocessing
6. Actual Methodology Followed:
1. Literature Search & Dataset Selection:
• Researched the KNN algorithm and its application in clustering.
• Chose a suitable dataset (e.g., Iris dataset, Customer Segmentation dataset).
2. Data Preprocessing:
 Loaded the dataset using pandas.
 Standardized the dataset using scikit-learn’s StandardScaler.
 Split the data into training and testing sets.
3. KNN Clustering Implementation:
• Used KNeighborsClassifier from scikit-learn.
• Defined the number of neighbors (K).
• Trained the model and predicted clusters.
4. Cluster Evaluation & Visualization:
 Visualized clusters using Matplotlib & Seaborn.
 Evaluated clustering using the silhouette score.

6. Actual Resources Used

Sr. No Name of Resource used Specification

1. Operating System Windows

2. Software Google Colab

3. Python Libraries NumPy, pandas, scikit-learn,


Matplotlib, Seaborn
7. Implementation / Outputs of the Micro-Project:
Output:-
 Successfully clustered the dataset into meaningful groups.
 Silhouette Score used to measure clustering performance.
 Visualization generated using Matplotlib and Seaborn.

8.0 Skills Developed / Learning Outcome of this Micro-Project:

 Machine Learning Expertise: Implemented unsupervised learning (KNN).


 Python Programming Skills: Worked with scikit-learn, pandas, and Matplotlib.
 Data Preprocessing Knowledge: Learned feature scaling and normalization.
 Cluster Evaluation Techniques: Used the silhouette score to analyze clustering.
8. Application of this Micro-Project:
This project can be applied in various fields, including:
1. Customer Segmentation: Clustering users based on purchasing behavior.
2. Anomaly Detection: Identifying outliers in network security.
3. Medical Data Analysis: Grouping patients based on symptoms.
4. Recommendation Systems: Clustering users for personalized recommendations.

Subject Teacher
MR. A.A PARITEKAR

You might also like