0% found this document useful (0 votes)

15 views21 pages

Minor Project

Mini project

Uploaded by

Samiksha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views21 pages

Minor Project

Mini project

Uploaded by

Samiksha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 21

Predicting

Spinal
Abnormaliti
es using ML
Introduction:
Problem Statement
Lower Back Pain is one of the most common
afflictions and our project attempts to find a
solution to reduce it. The reason why this is a
major issue to be tackled:
➔ Occurs for a large variety of reasons in a large
range of age groups.
➔ Ailment is painful and long-lasting.
➔ Difficult to cure quickly and efficiently, if not
impossible.
➔ Preventative measures are superior to
curative measures.
Introduction:
Our Solution
As we have established, a
prevention is better than
cure. Hence we will
perform accurate
mathematical predictions.
We will utilize a variety of Machine
Learning Algorithms in order to predict
abnormal spine problems. This feature
will be made available in an
application.

This way, preventative steps can be

taken by anyone utilizing this
application.
Workflow of
Project
We followed a standard machine learning
methodology that is generally adopted:

1.Data Extraction - Data to be worked with is

retrieved, obtained or collected.

2. Data Cleaning - Corrupt and inaccurate data is

checked for and dealt with.

3. Feature Extraction - The data is altered into more

manageable groups.

4. Classification - Determine the type of ML Model

that is needed.

5. Supervised/Unsupervised ML - Fit the data to the

model.

6. Validation - Check if the obtained data matches

The Data Set
1

For the project we will be

using a data set consisting
of the biometric data of 310
patients: 210 Abnormal
Cases, 100 Normal Cases.
Biometric Data Present:
Pelvic Incidence, Pelvic Tilt,
Lumbar Lordosis Angle,
Pelvic Radius, etc.
Identifying and handling the missing
values
➔ In data preprocessing, it is pivotal to identify
and correctly handle the missing values,
failing to do this, you might draw inaccurate
and faulty conclusions and inferences from
the data.

➔ To make sure that there are no missing

values, we use a Data Frame class, that
contains a function isnull(), which checks for
NaN values and returns a boolean value
True or False. We further use the sum()
method to count the number of NaN, aka, the
null values.
Visualization of Data

● Histogram of each feature is plotted using the given dataset. It shows the range
of the values of the each feature. It is plotted with the help of the hist() function in
matplotlib.pyplot, which falls under the matplotlib library.

● Distribution and box plots are also plotted using the seaborn library and matplotlib
library respectively.

● All the features are plotted to find the bivariate relationships between the
combinations of variables using a scatter plot matrix.

● Finally, a heatmap is plotted to show the correlation among the features of the
dataset.
Data Visualization: Histogram
Data Visualization: Heatmap
Extraction of values and
splitting
We use the iloc[] function, of the Training set denotes the subset
Pandas library, to extract the of a dataset that is used for
selected rows and columns from training the machine learning
the dataset. model.
.iloc[] is primarily integer position A testing set is the subset of the
based(from 0 to length-1 of the dataset that is used for testing
axis), but may also be used with a the machine learning model.
boolean array.
Here, we split the dataset into a
66.67% section for training and
a 33.33% section for test.
Model Selection and
Discussions
Decision Tree Classifier

➔ Decision Trees are algorithms where the data is continuously split according to a
certain parameter.

➔ The tree is explained by two entities called decision nodes and leaves. The leaves
are the decisions or the final outcomes, and the decision nodes are where the data is
split.

➔ It partitions the tree in recursive manner called recursive partitioning. This flowchart-
like structure helps us in decision making.

➔ The time complexity of decision trees is a function of the number of records and
number of attributes in the given data. The decision tree is a distribution-free or non-
parametric method, which does not depend upon probability distribution
assumptions. Decision trees can handle high dimensional data with good accuracy.
Random Forest Classifier

➔ Random forests or random decision forests operate by constructing a multitude of

decision trees at training time and outputting the class that is the mode of the
classes (classification) or mean prediction (regression) of the individual trees.

➔ There are two stages in RF algorithm, one is random forest creation, and the other is
to make a prediction from the random forest classifier created in the first stage.

➔ We take the test features and use the rules of each randomly created decision tree to
predict the outcome and stores the predicted outcome (target)
K - nearest Neighbours (KNN)
Classifier
➔ K nearest neighbors is a simple algorithm that stores all available cases and
classifies new cases based on a similarity measure (e.g., distance functions).

➔ The data is classified by a majority vote of its neighbors, with the case being
assigned to the class most common amongst its K nearest neighbors measured by a
distance function.

➔ If k = 1, then the case is simply assigned to the class of its nearest neighbor.

➔ We also implement the weighted KNN algorithm in order to predict and classify the
data into normal and abnormal spines in this research.

➔ Choosing the number of nearest neighbors, which means determine the value of k
plays a crucial role in determining the efficacy of the model. A high k-value has an
advantage which includes reducing the variance due to the noisy data.
Deep Neural Network Classifier

➔ TensorFlow is a open-source deep learning library with tools for building almost any
type of neural network (NN) architecture. It builds a feedforward multilayer neural
network that is trained with a set of labeled data in order to perform classification on
similar, unlabeled data.

➔ When we create a DNNClassifier, we need to specify the feature columns (input

layer), the architecture of the neural network (hidden layers) and the number of
classes (output layer).

➔ Recall that the DNNClassifier builds a feedforward multilayer neural network, hence
when we call the function, we need to indicate how many hidden layers we want and
how many nodes there should be on each of the layer.
Model Evaluation and Selection
● .After exhaustive training on different classification algorithm We
choose a small DNN for this task.
● We first compared the models based on Accuracy on test set.
And if some models were going to be comparable we were going
to use evaluation based on F1 score and Area Under the Curve.
● But fortunately the small DNN out performed other models by
huge margin.So we directly choose it as the final Model.
Graphical User Interface &
Deployment
How to make the Application

● We made an simple Windows Application by which a person can

use our model get predictions.
● For Making the FrontEnd We used the the python Library called
Tkinter.
● Its a basic Basic GUI Library by which we can create simple and
clean Interface.
● We have also took care of some basic things such as you
can’t submit with all your column being empty.
● The Deep Learning Model was used to do the Job.
GUI
Conclusion and Future Scope
We have acquired an optimized model that gives 96.45% accuracy as well as a simple GUI
that can allow a user to interact with our application.

The following future scope and uses are possible for this project:

➔ Usable by any individuals who have performed preliminary health check-up.

➔ Can be integrated into a application using API, thus increasing platform reach.

➔ Can be utilized directly in medical and health-care industries for classification.

➔ Integrated into a larger scope personalized health-care application.(i.e Mobile App)

References
https://numpy.org/

https://pandas.pydata.org/

https://matplotlib.org/3.1.1/tutorials/introductory/pyplot.html

https://www.tensorflow.org/

https://deepai.org/machine-learning-glossary-and-terms/

Thank You

Machine Learning Guide: Meher Krishna Patel
No ratings yet
Machine Learning Guide: Meher Krishna Patel
121 pages
Course Work AI - Foundation
No ratings yet
Course Work AI - Foundation
12 pages
Ds Notes Mca
No ratings yet
Ds Notes Mca
30 pages
5 Markd
No ratings yet
5 Markd
24 pages
Phase 3 IBM
No ratings yet
Phase 3 IBM
7 pages
Approaching (Almost) Any Machine Learning Problem - Abhishek Thakur - No Free Hunch
No ratings yet
Approaching (Almost) Any Machine Learning Problem - Abhishek Thakur - No Free Hunch
22 pages
Module 5.pptx - 20250608 - 201231 - 0000
No ratings yet
Module 5.pptx - 20250608 - 201231 - 0000
43 pages
Mini Project 2024
No ratings yet
Mini Project 2024
48 pages
Lab 2
No ratings yet
Lab 2
17 pages
Final Research Paper
No ratings yet
Final Research Paper
3 pages
Lec 2
No ratings yet
Lec 2
13 pages
Beginner's Guide To Implementing A Simple Machine Learning Project - DeV Community
No ratings yet
Beginner's Guide To Implementing A Simple Machine Learning Project - DeV Community
9 pages
IEEE Conference Team ATOM
No ratings yet
IEEE Conference Team ATOM
5 pages
Disease Prediction Using Machine Learning
No ratings yet
Disease Prediction Using Machine Learning
4 pages
Jalali@mshdiua - Ac.ir Jalali - Mshdiau.ac - Ir: Data Mining
No ratings yet
Jalali@mshdiua - Ac.ir Jalali - Mshdiau.ac - Ir: Data Mining
50 pages
Lecture 7.2 - DTC Algorithm Implementation
No ratings yet
Lecture 7.2 - DTC Algorithm Implementation
7 pages
Machine Learning Project Checklist
No ratings yet
Machine Learning Project Checklist
30 pages
ML Practical
No ratings yet
ML Practical
61 pages
Module 4 - Supervised Learning - First ML Model
No ratings yet
Module 4 - Supervised Learning - First ML Model
23 pages
Slay The Day
No ratings yet
Slay The Day
21 pages
Draft Xai
No ratings yet
Draft Xai
16 pages
Classification
No ratings yet
Classification
36 pages
Experiment 8
No ratings yet
Experiment 8
4 pages
Introduction To Classification - PPT Slides 1
No ratings yet
Introduction To Classification - PPT Slides 1
62 pages
ML4 - Decision Trees & Random Forest
No ratings yet
ML4 - Decision Trees & Random Forest
44 pages
AI and ML Lab Ex3 To 12
No ratings yet
AI and ML Lab Ex3 To 12
27 pages
ML Practical Updated
No ratings yet
ML Practical Updated
64 pages
Machine Learning With PySpark and MLlib - Solving A Binary Classification Problem - by Susan Li - Towards Data Science
No ratings yet
Machine Learning With PySpark and MLlib - Solving A Binary Classification Problem - by Susan Li - Towards Data Science
10 pages
(REPORT) LAB - 2 - Decision - Tree
No ratings yet
(REPORT) LAB - 2 - Decision - Tree
17 pages
Session 1 Coding - Supervised Learning Recap and Code
No ratings yet
Session 1 Coding - Supervised Learning Recap and Code
25 pages
ML Practical 205160694034
No ratings yet
ML Practical 205160694034
33 pages
Model Learning Steps
No ratings yet
Model Learning Steps
12 pages
ML Report2
No ratings yet
ML Report2
21 pages
Decision Support
No ratings yet
Decision Support
21 pages
Machine Learning Lecture1 - 26-27 Aug
No ratings yet
Machine Learning Lecture1 - 26-27 Aug
30 pages
Machine Learning Practical
No ratings yet
Machine Learning Practical
59 pages
"Classifiers": R & D Project by Under The Guidance of
No ratings yet
"Classifiers": R & D Project by Under The Guidance of
59 pages
MLA Lab 6:-Implementation of Decision Tree
No ratings yet
MLA Lab 6:-Implementation of Decision Tree
16 pages
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
Machine Learning 1707965934
No ratings yet
Machine Learning 1707965934
15 pages
Team No-7
No ratings yet
Team No-7
12 pages
20AI16 - ML Record
No ratings yet
20AI16 - ML Record
24 pages
ML Book Notes
No ratings yet
ML Book Notes
9 pages
ML Important
No ratings yet
ML Important
11 pages
ML
No ratings yet
ML
8 pages
Project Occupancy Alfonso Vicente Aragues
No ratings yet
Project Occupancy Alfonso Vicente Aragues
18 pages
Data Algo Metrics
No ratings yet
Data Algo Metrics
5 pages
Module 4 - Classification
No ratings yet
Module 4 - Classification
10 pages
DWDM Unit-3
No ratings yet
DWDM Unit-3
9 pages
ML Notion 1
No ratings yet
ML Notion 1
18 pages
Slides On DataI
No ratings yet
Slides On DataI
33 pages
Week 7 Laboratory Activity
No ratings yet
Week 7 Laboratory Activity
12 pages
Diabetes Prediction
No ratings yet
Diabetes Prediction
15 pages
Divorce Prediction System: Devansh Kapoor 179202050
No ratings yet
Divorce Prediction System: Devansh Kapoor 179202050
12 pages
AI Lab M.Tech
No ratings yet
AI Lab M.Tech
29 pages
Deep Learning
No ratings yet
Deep Learning
25 pages
Advance AI and ML LAB
No ratings yet
Advance AI and ML LAB
16 pages
DWDM Unit 4
No ratings yet
DWDM Unit 4
22 pages
CHAPTER 4 Diabetes
No ratings yet
CHAPTER 4 Diabetes
6 pages
MLDA1
No ratings yet
MLDA1
8 pages
Java
No ratings yet
Java
114 pages
Chapter 2
No ratings yet
Chapter 2
43 pages
Recurrence Relations
No ratings yet
Recurrence Relations
59 pages
DATA STRUCTURE Notes 2
No ratings yet
DATA STRUCTURE Notes 2
16 pages
Edge Enhancement Based Transformer For Medical Image Denoising PDF
No ratings yet
Edge Enhancement Based Transformer For Medical Image Denoising PDF
8 pages
Improving Accuracy in Solving Feldtkeller Equation
No ratings yet
Improving Accuracy in Solving Feldtkeller Equation
4 pages
SVM Data Imputation
No ratings yet
SVM Data Imputation
6 pages
Prims Vs Kruskal
No ratings yet
Prims Vs Kruskal
3 pages
System of Linear Equations Matlab
No ratings yet
System of Linear Equations Matlab
3 pages
HK 10 Maths CH 2 Polynomial
No ratings yet
HK 10 Maths CH 2 Polynomial
3 pages
SYLLABUS
No ratings yet
SYLLABUS
3 pages
Using A Heuristic Approach To Design Personalized Urban Tourism Itineraries With Hotel Selection
No ratings yet
Using A Heuristic Approach To Design Personalized Urban Tourism Itineraries With Hotel Selection
14 pages
Determinnant 3 by 3 Matrix Practice
100% (1)
Determinnant 3 by 3 Matrix Practice
4 pages
Lesson 18
No ratings yet
Lesson 18
32 pages
Topic: Non-Negative Matrix Factorisation: Assignment - 2
No ratings yet
Topic: Non-Negative Matrix Factorisation: Assignment - 2
6 pages
CSE256
No ratings yet
CSE256
2 pages
Generalized Gaussian Quadrature Rules For Systems of Arbitrary Functions
No ratings yet
Generalized Gaussian Quadrature Rules For Systems of Arbitrary Functions
27 pages
Medium Level Array Practice
No ratings yet
Medium Level Array Practice
6 pages
Module 3 - DAA Vtu
No ratings yet
Module 3 - DAA Vtu
80 pages
Data Mining
No ratings yet
Data Mining
18 pages
Coding Theory Binary Linear Codes
100% (1)
Coding Theory Binary Linear Codes
5 pages
Numerical Methods I
No ratings yet
Numerical Methods I
44 pages
Solution CRC
No ratings yet
Solution CRC
3 pages
The Design Revolution of Logarithmic Number System Architecture
No ratings yet
The Design Revolution of Logarithmic Number System Architecture
7 pages
Lung Cancer Detection Using Image Processing Synopsis Report
No ratings yet
Lung Cancer Detection Using Image Processing Synopsis Report
19 pages
DSP Tut1 Questions
No ratings yet
DSP Tut1 Questions
3 pages
Reduction
No ratings yet
Reduction
91 pages
DS CSIT Lecture 06
No ratings yet
DS CSIT Lecture 06
17 pages
Backtracking and Branch and Bound Final
No ratings yet
Backtracking and Branch and Bound Final
63 pages
Chapter-1 2
No ratings yet
Chapter-1 2
79 pages
An Introduction To The Extended Kalman Filter
No ratings yet
An Introduction To The Extended Kalman Filter
4 pages
17-BFS, DFS-05-02-2025
No ratings yet
17-BFS, DFS-05-02-2025
32 pages
Nitte Meenakshi Institute of Technology
No ratings yet
Nitte Meenakshi Institute of Technology
13 pages
CS502 Fundamentals of Algorithms 2013 Final Term Questions Answers Solved With References by Moaaz
100% (1)
CS502 Fundamentals of Algorithms 2013 Final Term Questions Answers Solved With References by Moaaz
19 pages

Minor Project

Uploaded by

Minor Project

Uploaded by

Predicting

This way, preventative steps can be

1.Data Extraction - Data to be worked with is

2. Data Cleaning - Corrupt and inaccurate data is

3. Feature Extraction - The data is altered into more

4. Classification - Determine the type of ML Model

5. Supervised/Unsupervised ML - Fit the data to the

6. Validation - Check if the obtained data matches

For the project we will be

➔ To make sure that there are no missing

➔ Random forests or random decision forests operate by constructing a multitude of

➔ When we create a DNNClassifier, we need to specify the feature columns (input

● We made an simple Windows Application by which a person can

➔ Usable by any individuals who have performed preliminary health check-up.

➔ Can be utilized directly in medical and health-care industries for classification.

➔ Integrated into a larger scope personalized health-care application.(i.e Mobile App)

You might also like