0% found this document useful (0 votes)
6 views4 pages

Capstone Projects

The document outlines various capstone project topics related to machine learning, including Linear Regression, Logistic Regression, KNN, SVM, Decision Tree, K-means, PCA, and CNN. Each topic includes specific tasks such as exploratory data analysis, model building, training, evaluation, and testing with associated datasets. A hint sheet link is also provided for additional guidance.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
6 views4 pages

Capstone Projects

The document outlines various capstone project topics related to machine learning, including Linear Regression, Logistic Regression, KNN, SVM, Decision Tree, K-means, PCA, and CNN. Each topic includes specific tasks such as exploratory data analysis, model building, training, evaluation, and testing with associated datasets. A hint sheet link is also provided for additional guidance.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 4

Capstone Project Topic

Submit by 09/03/2025

Select any one of the following.


Title Task Dataset
1. Linear • Exploratory data analysis for the car_price
Regression given data
dataset
Model • Data Visualization

• Identify Features & labels
• Making a car Price prediction ML
model
• Training the model
• Evaluation of the model: Confusion
matric, Classification Report
• Saving Model
• Plot regression line
• Testing on real-time data

2. Logistic • Exploratory data analysis for the Road


Regression given data
Accident
• Data Visualization
• Identify Features & labels Survival
• Develop a survival status prediction Dataset
model
• Supervised Logistic Regression
model
• Binary Class model (Yes/No)
• Confusion Matrix
• Classification Report
• Evaluation of the model
3. KNN • Exploratory data analysis for the Stroke
given data
Risk
• Identify Features & labels
• Data Visualization Predictio
• Building a KNN model for predicting n Dataset
stroke risk based on the symptoms Based on
• Training the model
Symptom
• Evaluation of the model Confusion
matric, Classification Report s
• Plot the clustering curve
• Saving Model
• Testing on real-time data
• Plot the clustering curve

4. SVM • Exploratory data analysis for the Human


given data
Activity
• Data Visualization
• Identify Features & labels Recogniti
• Building an SVM model for on Using
identifying the human activity Smartpho
• Training the model
nes - UCI
• Evaluation of the model Confusion
matric, Classification Report Machine
• Saving Model Learning
• Testing on real-time data Repositor
y
5. Decision • Exploratory data analysis for the Fraud
given data
Tree Detection
• Data Visualization
Transactio
• Building a decision tree model for ns
predicting the IP address flag Dataset
• Training the model
• Evaluation of the model Confusion
matric, Classification Report
• Plot the tree
• Saving Model
• Testing on real-time data

Online
• Exploratory data analysis for the
Retail -
given data
6. K-means • Data Visualization UCI
• Explain the feature set here Machine
• Prepare K-means clustering Learning
unsupervised model for mall sales
data
Repositor
• Data cleaning and processing y
• Removing the outlier from the
dataset using the concept box plot
• Finding the “K” value for cluster
formation using the elbow method
• Implementing the K-means
• Plot the cluster
• Evaluating model

7. PCA • Exploratory data analysis for the Hierarchi


given data
cal Sales
• Data Visualization
Data -
1. Explain the concept of UCI
features and clustering here. Machine
• Do Principal Component Analysis
(PCA) for mall data to reduce the
Learning
dimension of the dataset. Repositor
• Compute PC1, PC2… y
• Variance ratio has been computed

8. CNN • Building an image multi-class MNIST


classification model for MNIST data
Database
• Data Visualization
• Reading the image data of
• Processing of image data Handwrit
• Preparing dataset with labels
ten Digits
• Label Encoding
• Preparing CNN layers - UCI
• Compiling the CNN Machine
• Training the CNN model Learning
• Analyzing Accuracy & Loss Curve Repositor
• Testing the Model
• Evaluating the model
y
• Saving & Loading the Model

Hint Sheet link


https://colab.research.google.com/drive/1ymwpji6K6st0_fyHYG9cZG77YHtv54V8?usp=sharing

You might also like