100% found this document useful (1 vote)

820 views4 pages

Wine Quality Prediction Using Machine Learning Algorithms

Wine classification is a difficult task since taste is the least understood of the human senses. A good wine quality prediction can be very useful in the certification phase, since currently the sensory analysis is performed by human tasters, being clearly a subjective approach. An automatic predictive system can be integrated into a decision support system, helping the speed and quality of the performance. Furthermore, a feature selection process can help to analyze the impact of the analytical

Uploaded by

ATS

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

100% found this document useful (1 vote)

820 views4 pages

Wine Quality Prediction Using Machine Learning Algorithms

Uploaded by

ATS

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

International Journal of Computer Applications Technology and Research

Volume 8–Issue 09, 385-388, 2019, ISSN:-2319–8656

Wine Quality Prediction using Machine Learning

Algorithms
Devika Pawar[1] Aakanksha Mahajan[2] Sachin Bhoithe[3]
M.Sc. (Big Data Analytics) M.Sc. (Big Data Analytics) Faculty of Science
MIT-WPU MIT-WPU MIT-WPU
Pune, India Pune, India Pune, India

Abstract: Wine classification is a difficult task since taste is the least understood of the human senses. A good wine quality
prediction can be very useful in the certification phase, since currently the sensory analysis is performed by human tasters, being
clearly a subjective approach. An automatic predictive system can be integrated into a decision support system, helping the
speed and quality of the performance. Furthermore, a feature selection process can help to analyze the impact of the analytical
tests. If it is concluded that several input variables are highly relevant to predict the wine quality, since in the production process
some variables can be controlled, this information can be used to improve the wine quality. Classification models used here are
1) Random Forest 2) Stochastic Gradient Descent 3) SVC 4)Logistic Regression .

Keywords: Machine Learning, Classification,Random Forest, SVM,Prediction.

I. INTRODUCTION that the significant difference between the two is small.
Then this paper uses the Cronbach Alpha coefficient
The aim of this project is to predict the quality of wine on method to analyze the credibility of the two groups of
a scale of 0–10 given a set of features as inputs. The data.[1]
dataset used is Wine Quality Data set from UCI Machine
Learning Repository. Input variables are fixed acidity, Paulo Cortez ,Juliana Teixeira,António CerdeiraFernando
volatile acidity, citric acid, residual sugar, chlorides, free AlmeidaTelmo MatosJosé Reis wrote a paper on wine
sulphur dioxide, total sulphur dioxide, density, pH, Quality assesment using Data Mining techniques.In this
sulphates, alcohol. And the output variable is quality paper,they proposed a data mining approach to predict
(score between 0 and 10).We are dealing only with red wine preferences that is based on easily available
wine. We have quality being one of these values: [3, 4, 5, analytical tests at the certification step. A large dataset was
6, 7, 8]. The higher the value the better the quality. In this considered with white vinho verde samples from the
project we will treat each class of the wine separately and Minho region of Portugal. Wine quality is modeled under a
their aim is to be able and find decision boundaries that regression approach, which preserves the order of the
work well for new unseen data. These are the classifiers. grades. 95% accuracy was obtained using these data
mining techniques.[2]
In this paper we are explaining the steps we followed to
build our models for predicting the quality of red wine in a The study of this paper was done at International Journal
simple non-technical way. We are dealing only with red of Intelligent Systems and Applications in Engineering and
wine. We would follow similar process for white wine or this paper was published on 3rd September 2016. The
we could even mix them together and include a binary main objective of this research paper was to predict wine
attribute red/white, but our domain knowledge about wines quality based on physicochemical data. In this study, two
suggests that we shouldn’t. Classification is used to large separate data sets which were taken from UC Irvine
classify the wine as good or bad. Before examining the Machine Learning Repository were used. The instances
data it is often referred to as supervised learning because were successfully classified as red wine and white wine
the classes are determined. with the accuracy of 99.5229% by using Random Forests
Algorithm.[3]
II. RELATED WORK
III. PROPOSED WORK

Various researches and students have published related A. Data Set:

work in national and international research papers, thesis
to understand the objective, types of algorithm they have
used and various techniques for pre-processing. Dataset/Source: Kaggle
https://www.kaggle.com/uciml/red-wine-qua
College of Intelligent Science and Engineering, China has
lity-cortez-et-al-2009
written a paper on Evaluation and Analysis Model of Wine
Quality Based on Mathematical Model.They have used
various mathematical test to predict the quality of Structured/Unstructured data: Structured
wine.The Mann-Whitney U test is used to analyze the wine Data in CSV format.
evaluation results of the two wine tasters, and it is found

www.ijcat.com 385
International Journal of Computer Applications Technology and Research
Volume 8–Issue 09, 385-388, 2019, ISSN:-2319–8656

Dataset Description: The two datasets are related Other than that the selection is being done randomly with
to red wine of the Portuguese "Vinho Verde" wine. For uniform distribution.
more details, consult: [Web Link] or the reference [Cortez Various classification and regression algorithms are used
et al., 2009]. Due to privacy and logistic issues, only to fit the model. The algorithms used in this paper are as
physicochemical (inputs) and sensory (the output) follows:
variables are available (e.g. there is no data about grape
types, wine brand, wine selling price, etc.). For classification:
These datasets can be viewed as classification or Random Forest Decision Trees classifier
regression tasks. The classes are ordered and not balanced
(e.g. there are many more normal wines than excellent or Support Vector Machine classifier
poor ones). Outlier detection algorithms could be used to
Stochastic gradient descent
detect the few excellent or poor wines. Also, we are not
sure if all input variables are relevant. So it could be Logistic Regression classifier
interesting to test feature selection methods.
Preprocessing: Label Encoding is used to convert
1)fixed acidity the labels into numeric form so as to convert it into the
2) volatile acidity machine-readable form. It is an important pre-processing
3) citric acid step for the structured dataset in supervised learning. We
4) residual sugar have used label encoding to label the quality of data as
5) chlorides good or bad. Assigning 1 to good and 0 to bad.
6)free sulfur dioxide
7)total sulfur dioxide
8)density
9)pH Feature Selection:
10) sulphates
As we can clearly see, volatile acidity and residual sugar
11) alcohol
are both not very impact full of the quality of wine. Hence
Output variable (based on sensory data):
we can eliminate these features. Though we are selecting
12)quality (score between 0 and 10)
these features, they will change according to the domain
experts.
IV. DATA PROCESSING METHODS

For making automated decisions on model selection

we need to quantify the performance of our model and
give it a score. For that reason, for the classifiers, we are
using F1 score which combines two metrics: Precision
which expresses how accurate the model was on predicting
a certain class and Recall which expresses the inverse of
the regret of missing out instances which are misclassified.
Since we have multiple classes we have multiple F1
scores. We will be using the unweighted mean of the F1
scores for our final scoring. This is a business decision
because we want our models to get optimized to classify
instances that belong to the minority side, such as wine
quality of 3 or 8 equally well with the rest of the qualities
that are represented in a larger number. For the regression
task we are scoring based on the coefficient of
determination, which is basically a measurement of
whether the predictions and the actual values are highly
correlated. The larger this coefficient the better. For
regressors we can also get F1 score if we first round our
prediction.

Splitting for Testing : We are keeping 20% of our

dataset to treat it as unseen data and be able and test the
performance of our models. We are splitting our dataset in
a way such that all of the wine qualities are represented
proportionally equally in both training and testing dataset.

www.ijcat.com 386
International Journal of Computer Applications Technology and Research
Volume 8–Issue 09, 385-388, 2019, ISSN:-2319–8656

Result and Discussion: Algorithms used for classification

are:
Exploratory Data Analysis:
1) Logistic Regression
 The below bar plot shows the count of data 2) Stochastic gradient descent
which is good or bad. We can see 80% of the 3) Support Vector Classifier
data is classified with good wine quality and 4) Random Forest
20% with bad quality of wine.

 Logistic Regression gave us an accuracy of 86%

Performance matrix of Logistic Regression:

Precision Recall F1-Score Support

0 0.88 0.98 0.93 273

1 0.71 0.26 0.37 47

 Stochastic gradient descent was able to give an

average accuracy of 81%.
Performance matrix of SGD:
 This bar plot shows a directly proportional
relation between citric acid and quality.As the Precision Recall F1-Score Support
quality of wine increases the amount of citric
acid also increases which shows that citric acid
is the important feature on which quality of 0 0.89 0.93 0.91 273
wine depends.
1 0.42 0.30 0.35 47

 Support Vector Classifier has given an accuracy

of 85%.

Performance matrix of SVC:

Precision Recall F1-Score Support

 Free sulphur dioxide is greatly contributing to 0 0.89 0.93 0.91 273

the quality of wine, this bar plot gives us a
more clear picture. 1 0.71 0.26 0.37 47

www.ijcat.com 387
International Journal of Computer Applications Technology and Research
Volume 8–Issue 09, 385-388, 2019, ISSN:-2319–8656

 Random Forest gave us an accuracy of 87.33%

Precision Recall F1-Score Support

0 0.90 0.97 0.93 273

1 0.68 0.40 0.51 47

CONCLUSION
Based on the bar plots plotted we come to an conclusion
that not all input features are essential and affect the data,
for example from the bar plot against quality and residual
sugar we see that as the quality increases residual sugar is
moderate and does not have change drastically. So this
feature is not so essential as compared to others like
alcohol and citric acid, so we can drop this feature while
feature selection.

For classifying the wine quality, we have implemented

multiple algorithms, namely

1) Logistic Regression

2) Stochastic gradient descent

3) Support Vector Classifier

4) Random Forest

We were able to achieve maximum accuracy using

random forest of 88%. Stochastic gradient descent
giving an accuracy of 81% .SVC has an accuracy of
85% and logistic regression of 86%.

References:
[1] Yunhui Zeng1 , Yingxia Liu1 , Lubin Wu1 , Hanjiang
Dong1. “Evaluation and Analysis Model of Wine Quality
Based on Mathematical Model ISSN 2330-2038 E-ISSN
2330-2046,Jinan University, Zhuhai,China.

[2] Paulo Cortez1, Juliana Teixeira1, Ant´onio

Cerdeira2.“Using Data Mining for Wine Quality
Assessment”.

[3] Yesim Er*1 , Ayten Atasoy1. “The Classification of

White Wine and Red Wine According to Their
Physicochemical Qualities”,ISSN
2147-67992147-6799,3rd September 2016

www.ijcat.com 388

Safety and Security of Cyber Physical Systems Engineering Dependable Software Using Principle Based Development 1st Edition by Frank Furrer ISBN 9783658371821 365837182X Instant Download
100% (5)
Safety and Security of Cyber Physical Systems Engineering Dependable Software Using Principle Based Development 1st Edition by Frank Furrer ISBN 9783658371821 365837182X Instant Download
75 pages
AppoinmentSchedulerpdf2 0
No ratings yet
AppoinmentSchedulerpdf2 0
37 pages
DMS Microproject
73% (11)
DMS Microproject
48 pages
C SPAR Datasheet
No ratings yet
C SPAR Datasheet
2 pages
V9.2.1a Releasenotes v3
No ratings yet
V9.2.1a Releasenotes v3
72 pages
VT530 / Access Sensor: Function and Purpose
No ratings yet
VT530 / Access Sensor: Function and Purpose
17 pages
Wine Quality Classification
No ratings yet
Wine Quality Classification
36 pages
Prediction of Wine Quality Using Machine Learning
100% (1)
Prediction of Wine Quality Using Machine Learning
12 pages
ML Lecture 15 Ensemble
No ratings yet
ML Lecture 15 Ensemble
27 pages
Use and Analysis On Cyclomatic Complexity in Software Development
No ratings yet
Use and Analysis On Cyclomatic Complexity in Software Development
4 pages
Wine Quality Prediction Using ML PPR
100% (1)
Wine Quality Prediction Using ML PPR
8 pages
Ue22cs342aa2 20241114095341
No ratings yet
Ue22cs342aa2 20241114095341
23 pages
Networks
No ratings yet
Networks
3 pages
F-54 Vendor Down Payment Clearing
No ratings yet
F-54 Vendor Down Payment Clearing
8 pages
2022 11 Condition Monitoring of Rolling Stock Guideline Preview
No ratings yet
2022 11 Condition Monitoring of Rolling Stock Guideline Preview
8 pages
Erp & CRM
No ratings yet
Erp & CRM
26 pages
Red Wine Quality Prediction Using Machine Learning Techniques
No ratings yet
Red Wine Quality Prediction Using Machine Learning Techniques
7 pages
Manual Datalogic Memor x3
No ratings yet
Manual Datalogic Memor x3
155 pages
An IT All PPT For Fifth Year
No ratings yet
An IT All PPT For Fifth Year
105 pages
Latihan 3a - Neighbor Keys
No ratings yet
Latihan 3a - Neighbor Keys
2 pages
DSP HW
No ratings yet
DSP HW
3 pages
Case Study On Apple's Business Strategies - MBA Knowledge Base
No ratings yet
Case Study On Apple's Business Strategies - MBA Knowledge Base
12 pages
Machine Learning Miniproject
No ratings yet
Machine Learning Miniproject
10 pages
Final Chapter 1-3-Boaquiña
100% (1)
Final Chapter 1-3-Boaquiña
21 pages
Air Quality Prediction Using Machine Learning Algorithms
100% (1)
Air Quality Prediction Using Machine Learning Algorithms
4 pages
The "Promotion" and "Call For Service" Features in The Android-Based Motorcycle Repair Shop Marketplace
No ratings yet
The "Promotion" and "Call For Service" Features in The Android-Based Motorcycle Repair Shop Marketplace
5 pages
Engine Houdini 3 Prorar PDF
No ratings yet
Engine Houdini 3 Prorar PDF
4 pages
AMD Thermal Mechanical Chassis Cooling Design Guide
No ratings yet
AMD Thermal Mechanical Chassis Cooling Design Guide
31 pages
License Plate Detection and Recognition Using OCR Based On Morphological Operation
No ratings yet
License Plate Detection and Recognition Using OCR Based On Morphological Operation
5 pages
Effect of Wind Environment On High Voltage Transmission Lines Span
No ratings yet
Effect of Wind Environment On High Voltage Transmission Lines Span
6 pages
Compressor Technical Specification: Huangshi Donper Compressor Co., LTD
No ratings yet
Compressor Technical Specification: Huangshi Donper Compressor Co., LTD
9 pages
Designing Framework For Data Warehousing of Patient Clinical Records Using Data Visualization Technique of Nigeria Medical Records
No ratings yet
Designing Framework For Data Warehousing of Patient Clinical Records Using Data Visualization Technique of Nigeria Medical Records
14 pages
Wine Prediction
100% (1)
Wine Prediction
13 pages
Android-Based High School Management Information System
No ratings yet
Android-Based High School Management Information System
5 pages
PGD-2101-OOP Using JAVA PDF
No ratings yet
PGD-2101-OOP Using JAVA PDF
10 pages
Applications of Machine Learning For Prediction of Liver Disease
No ratings yet
Applications of Machine Learning For Prediction of Liver Disease
3 pages
Work Experience SAP Consultant: Capgemini Airbus SE Known As The European Aeronautic Defence and Space Company (EADS), Is A European Aerospace
No ratings yet
Work Experience SAP Consultant: Capgemini Airbus SE Known As The European Aeronautic Defence and Space Company (EADS), Is A European Aerospace
3 pages
Wine Quality Synopsis
No ratings yet
Wine Quality Synopsis
3 pages
Description Commodity Type Part Number
No ratings yet
Description Commodity Type Part Number
2 pages
Defining Pretension in A Joint Fastener
No ratings yet
Defining Pretension in A Joint Fastener
5 pages
Density Based Traffic Signalling System Using Image Processing
No ratings yet
Density Based Traffic Signalling System Using Image Processing
4 pages
Search Engine Development To Enhance User Communication
No ratings yet
Search Engine Development To Enhance User Communication
3 pages
Modeling and Analysis of Lightning Arrester For Transmission Line Overvoltage Protection
No ratings yet
Modeling and Analysis of Lightning Arrester For Transmission Line Overvoltage Protection
5 pages
Expert System For Student Placement Prediction
No ratings yet
Expert System For Student Placement Prediction
5 pages
Application of Matrices in Human's Life
No ratings yet
Application of Matrices in Human's Life
6 pages
DSP Lab 1 Fall 20.PDF NEW
No ratings yet
DSP Lab 1 Fall 20.PDF NEW
12 pages
Optical Communication Paper Uptu Pattern
No ratings yet
Optical Communication Paper Uptu Pattern
1 page
Analysis of Student Feedback Using Deep Learning
No ratings yet
Analysis of Student Feedback Using Deep Learning
4 pages
Individual Household Electric Power Consumption Forecasting Using Machine Learning Algorithms
No ratings yet
Individual Household Electric Power Consumption Forecasting Using Machine Learning Algorithms
4 pages
Research On Modulation Recognition Technology Based On Machine Learning
No ratings yet
Research On Modulation Recognition Technology Based On Machine Learning
5 pages
Titanic Data Analysis
No ratings yet
Titanic Data Analysis
11 pages
Application of Knowledge Management System Using Influence of Inukshuk and Kano Model Case Study: Palembang Private Higher Education
No ratings yet
Application of Knowledge Management System Using Influence of Inukshuk and Kano Model Case Study: Palembang Private Higher Education
6 pages
Ch02 DSS BI
No ratings yet
Ch02 DSS BI
91 pages
A Course in Machine Learning 1648562733
No ratings yet
A Course in Machine Learning 1648562733
193 pages
Efatigue - Training and Seminars PDF
No ratings yet
Efatigue - Training and Seminars PDF
1 page
Student Performance Prediction
No ratings yet
Student Performance Prediction
4 pages
The Banking Cartel Wants Me To Turn You Off by Ignoring Your Input Into The Global Currency Reset
No ratings yet
The Banking Cartel Wants Me To Turn You Off by Ignoring Your Input Into The Global Currency Reset
4 pages
Electronic Transcript Management System
No ratings yet
Electronic Transcript Management System
5 pages
LSTM
No ratings yet
LSTM
42 pages
K2 Cold Storage Case Study
0% (1)
K2 Cold Storage Case Study
1 page
Campus Placement Analyzer: Using Supervised Machine Learning Algorithms
No ratings yet
Campus Placement Analyzer: Using Supervised Machine Learning Algorithms
5 pages
Lead Scoring Case Study Presentation
100% (2)
Lead Scoring Case Study Presentation
11 pages
Delhivery Research
No ratings yet
Delhivery Research
21 pages
Wine Quality
100% (1)
Wine Quality
2 pages
Chapter 17 - Logistic Regression
No ratings yet
Chapter 17 - Logistic Regression
32 pages
Machine Learning in 10 Pages PDF
No ratings yet
Machine Learning in 10 Pages PDF
10 pages
04 - TCSM3i Architechure and Functionality V1 (1) .3
No ratings yet
04 - TCSM3i Architechure and Functionality V1 (1) .3
39 pages
Cambium Networks PMP 100 Subscriber module-2400SMHH PDF
No ratings yet
Cambium Networks PMP 100 Subscriber module-2400SMHH PDF
2 pages
An Investigation of Wine Quality Testing Using Machine Learning Techniques
No ratings yet
An Investigation of Wine Quality Testing Using Machine Learning Techniques
8 pages
Credit Card EDA: Authored by
100% (1)
Credit Card EDA: Authored by
16 pages
Back-Propagation Is Very Simple. Who Made It Complicated
No ratings yet
Back-Propagation Is Very Simple. Who Made It Complicated
26 pages
Machine Learning: Lecture 13: Model Validation Techniques, Overfitting, Underfitting
100% (2)
Machine Learning: Lecture 13: Model Validation Techniques, Overfitting, Underfitting
26 pages
Customer Churn Analysis and Prediction
No ratings yet
Customer Churn Analysis and Prediction
4 pages
Disease Prediction Application Using Machine Learning
No ratings yet
Disease Prediction Application Using Machine Learning
12 pages
Document #4
No ratings yet
Document #4
2 pages
Machine Learning: Notes by Aniket Sahoo - Part II
No ratings yet
Machine Learning: Notes by Aniket Sahoo - Part II
140 pages
Mini Project 2A PPT 2.0
No ratings yet
Mini Project 2A PPT 2.0
19 pages
Predicting Cardiovascular Disease Using Logistic Regression Research Paper
No ratings yet
Predicting Cardiovascular Disease Using Logistic Regression Research Paper
4 pages
Rainfall Prediction Using Machine Learning Algorithms A Comparative Analysis Approach
100% (1)
Rainfall Prediction Using Machine Learning Algorithms A Comparative Analysis Approach
4 pages
Restaurants Rating Prediction Using Machine Learning Algorithms
No ratings yet
Restaurants Rating Prediction Using Machine Learning Algorithms
4 pages
Machine Learning Project Report
100% (1)
Machine Learning Project Report
4 pages
Customer Analytics at Flipkart
No ratings yet
Customer Analytics at Flipkart
4 pages
Clustering Analysis: Reading The Data
100% (1)
Clustering Analysis: Reading The Data
15 pages
Prediction of House Prices Using Machine Learning
No ratings yet
Prediction of House Prices Using Machine Learning
8 pages
LDA 01 Linear Discriminant Analysis
No ratings yet
LDA 01 Linear Discriminant Analysis
65 pages
Coursera Capstone Project Final
No ratings yet
Coursera Capstone Project Final
6 pages
Smart Health Prediction System
No ratings yet
Smart Health Prediction System
5 pages
Problem 2 - Survey: Importing Nessceary Libraries
No ratings yet
Problem 2 - Survey: Importing Nessceary Libraries
10 pages
Sentiment Analysis On Movie Reviews Using RNN
No ratings yet
Sentiment Analysis On Movie Reviews Using RNN
10 pages
WorkshopPLUS - Data AI Azure Machine Learning
No ratings yet
WorkshopPLUS - Data AI Azure Machine Learning
2 pages
WNSAA Onsite Case Wine
No ratings yet
WNSAA Onsite Case Wine
3 pages
ML Course PDF
No ratings yet
ML Course PDF
133 pages
WINE Prediction Quality
100% (1)
WINE Prediction Quality
6 pages
Wine Quality Prediction: Implementation
No ratings yet
Wine Quality Prediction: Implementation
3 pages
Video Classification Using Deep Learning For Video Providers Project Report
No ratings yet
Video Classification Using Deep Learning For Video Providers Project Report
36 pages
Supervised Vs Unsupervised Learning What S The Difference IBM 24062021 035331pm
No ratings yet
Supervised Vs Unsupervised Learning What S The Difference IBM 24062021 035331pm
9 pages
Software Asset Management: What Is It and Why Do I Need It?: A Textbook on the Fundamentals in Software License Compliance, Audit Risks, Optimizing Software License ROI, Business Practices and Life Cycle Management
From Everand
Software Asset Management: What Is It and Why Do I Need It?: A Textbook on the Fundamentals in Software License Compliance, Audit Risks, Optimizing Software License ROI, Business Practices and Life Cycle Management
Carl A. Bolton
No ratings yet
Bias Variance Tradeoff
No ratings yet
Bias Variance Tradeoff
6 pages
UCI Machine Learning Repository - Heart Disease Data Set
No ratings yet
UCI Machine Learning Repository - Heart Disease Data Set
7 pages
Lecture 9 PDF
100% (1)
Lecture 9 PDF
28 pages
Big Data
No ratings yet
Big Data
9 pages
Supervised Vs Unsupervised Learning
No ratings yet
Supervised Vs Unsupervised Learning
20 pages
News Classification Using Machine Learning
No ratings yet
News Classification Using Machine Learning
5 pages
Wine Case Report
100% (2)
Wine Case Report
16 pages
Report On Linear Regression Using R
No ratings yet
Report On Linear Regression Using R
15 pages
12 Outlier
No ratings yet
12 Outlier
55 pages
Arm PPT
No ratings yet
Arm PPT
15 pages
1 - Machine Learning (Start)
No ratings yet
1 - Machine Learning (Start)
32 pages
Machine Lpipearning Interview Questions: Algorithms/Tp: Q1-What's The Trade-Off Between Bias and Variance?
No ratings yet
Machine Lpipearning Interview Questions: Algorithms/Tp: Q1-What's The Trade-Off Between Bias and Variance?
46 pages
Automobile
No ratings yet
Automobile
15 pages
Little Book of R For Multivariate Analysis
No ratings yet
Little Book of R For Multivariate Analysis
51 pages
Predictive Analytics Bigdata
No ratings yet
Predictive Analytics Bigdata
2 pages

Wine Quality Prediction Using Machine Learning Algorithms

Uploaded by

Wine Quality Prediction Using Machine Learning Algorithms

Uploaded by

International Journal of Computer Applications Technology and Research

Volume 8–Issue 09, 385-388, 2019, ISSN:-2319–8656

Wine Quality Prediction using Machine Learning

Keywords: Machine Learning, Classification,Random Forest, SVM,Prediction.

Various researches and students have published related A. Data Set:

For making automated decisions on model selection

Splitting for Testing : We are keeping 20% of our

Result and Discussion: Algorithms used for classification

 Logistic Regression gave us an accuracy of 86%

Performance matrix of Logistic Regression:

Precision Recall F1-Score Support

0 0.88 0.98 0.93 273

1 0.71 0.26 0.37 47

 Stochastic gradient descent was able to give an

 Support Vector Classifier has given an accuracy

Performance matrix of SVC:

Precision Recall F1-Score Support

 Free sulphur dioxide is greatly contributing to 0 0.89 0.93 0.91 273

 Random Forest gave us an accuracy of 87.33%

Precision Recall F1-Score Support

0 0.90 0.97 0.93 273

1 0.68 0.40 0.51 47

For classifying the wine quality, we have implemented

2) Stochastic gradient descent

3) Support Vector Classifier

We were able to achieve maximum accuracy using

[2] Paulo Cortez1, Juliana Teixeira1, Ant´onio

[3] Yesim Er*1 , Ayten Atasoy1. “The Classification of

You might also like