0% found this document useful (0 votes)

64 views6 pages

Predicting The Outcome of English Premier League Matches Using Machine Learning

football matches predcition using machine learning

Uploaded by

sivarashwanth.s2021

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

64 views6 pages

Predicting The Outcome of English Premier League Matches Using Machine Learning

football matches predcition using machine learning

Uploaded by

sivarashwanth.s2021

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

2020 2nd International Conference on Sustainable Technologies for

Industry 4.0 (STI), 19-20 December, Dhaka

Predicting the Outcome of English Premier League

Matches using Machine Learning
2020 2nd International Conference on Sustainable Technologies for Industry 4.0 (STI) |978-1-6654-0489-1/20/$31.00 ©2020 IEEE | DOI: 10.1109/STI50764.2020.9350327

Muntaqim Ahmed Raju∗ , Md. Solaiman Mia† , Md. Abu Sayed‡ and Md. Riaz Uddin§
∗ Department
of Computer Science and Engineering, Dhaka International University, Bangladesh
† AssistantProfessor, Department of Computer Science and Engineering, Green University of Bangladesh, Bangladesh
‡§ Lecturer, Department of Computer Science and Engineering, Dhaka International University, Bangladesh

Email: ∗ [email protected], † [email protected], ‡ [email protected], § [email protected]

Abstract—English Premier League (EPL) is the world’s most analyses to find out key factors for predicting football match
popular football league. Since this is a prominent league, there results. We have employed the feature-engineering technique
has been a variety of preceding endeavors both commercially to create the most substantial features. Thereafter, the feature
and scholastically to predict EPL match results. In this paper,
machine learning, a promising tool of the fourth industrial scaling is carried out by using min-max normalization to scale
revolution (Industry 4.0), has been used to introduce a model all the features. Uni-variate feature selection based on Chi-
for predicting the outcomes of EPL matches both in multi- Square statistical test is used as a feature selection method for
class (home, draw, and away) and in binary-class (home, and choosing the best viable features set primarily based on their
not-home) with the last five seasons football matches. We have scores for their correlation with the outcome variable. Then to
employed five machine learning algorithms along with different
machine learning techniques ranging from data pre-processing perceive the most promising method of the prediction, we have
to hyper-parameter optimization which find the best results. In monitored five different machine learning algorithms such as
addition, the comparative results demonstrate that, our proposed Support Vector Machine (SVM), Logistic Regression (LR),
model gives 70.27% accuracy in multi-class and 77.43% accuracy Naive Bayes (NB) classifier, Decision Tree Classifier (DTC)
in binary-class compared to the best known existing models in and AdaBoost Classifier (ABC). Lastly, the hyper-parameter
the literature.
Keywords—Football Prediction, English Premier League, Ma-
optimization technique is used to achieve the best possible
chine Learning, Data Mining hyper-parameters of every model.
A variety of experiments to forecast football matches have
I. I NTRODUCTION been carried out in the literature. In the exploration [1], the
authors discussed the prediction of football matches using
English Premier League (EPL) with a potential television tree-based model algorithm such as C5.0, random forest, and
audience of 4.7 billion people is the highest tier of the extreme gradient boosting and the best accuracy is generated
English football league system and the world’s largest sports by the random forest algorithm which is 68.55%. However,
community. There is a great deal of madness among the people the primary downside of this analysis is its feature collection.
about it and that is the reason of predicting the outcome of Exploration [1] can not be used to determine football matches
EPL matches with a huge phenomenon among football fans. prior to the game starts because they used features such as
Hence, a lot of football supporters and experts have been home team shots, away team shots, home team corners, away
giving predictions in different ways about who is going to win team corners, etc. which can not be addressed before the
before the match begins. Actually, there is a whole industry match started. Whereas, the accuracy of our study is higher
around it, there are pre-match analysis and post-match analysis than the study mentioned and is capable of predicting football
by commentators to anticipate who is going to win. Channel matches before the game starts. Some studies [2], [3] have
like ESPN are committed to try to predict who is going to been conducted using different algorithms but the predictive
dominate a game. This is actually very crazy thing that has accuracy is low, i.e., only 59% and 58.5%, respectively. In
been going for until the end of time. another research [4], the output of football matches has some
Artificial Intelligence (AI) is the brain behind Industry limitations. The algorithm used is the LR that gives only two
4.0. Machine Learning, a subset of AI has emerged as a results, i.e., home or not-home while in a football match there
promising tool in intelligent predictive applications for smart are three possible outcomes home win, away win, or draw.
manufacturing in Industry 4.0. Thus, in this paper, we have In this paper, we will discuss prior works before analyzing
used an intelligent machine learning predictive model which feature selection, discussing performance of various models,
attempts to anticipate who is going to win. The predominant and analyzing our results.
objective of this work is to accurately determine the outcome
in multi-class and binary-class of EPL matches. Initially, II. L ITERATURE R EVIEW
a survey of the last five seasons of the English Premier
League has been conducted. Since then, we have explored Various examines have been done to find the criteria for
a wide variety of soccer blogs, pre-match, and post-match foreseeing the result of football matches to be more exact.

978-1-6654-0489-1/20/$31.00©20XX IEEE

Authorized licensed use limited to: VIT University- Chennai Campus. Downloaded on October 11,2024 at 05:37:01 UTC from IEEE Xplore. Restrictions apply.
The following investigations have been led to locate an ideal IV. R ESEARCH M ETHODOLOGY
model for the prediction of football matches. In this section, we have presented our proposed examina-
Alfredo et al. [1] discussed the football match prediction tions of this exploration which employs five different popular
using tree-based model algorithms such as C5.0, random machine learning algorithms.
forest, and extreme gradient boosting. The backward wrapper
A. Data Collection
method was used as a feature selection methodology to assist
in picking the best feature to improve the accuracy of the The dataset employed in this research originates from
model. This study used 10 seasons of EPL football matches DataHub.io, which is a typical dataset to be utilized in football
history with 15 initial features to predict the match results match prediction research. The data used is based on five
(home win, away win or draw). The random forest algorithm seasons of EPL matches from the 2014-2015 season to the
generated the best accuracy of 68.55% whereas the C5.0 2018-2019 season. The total number of data used for this
algorithm had the lowest accuracy of 64.87% and the extreme whole investigation is 1870 historical match data.
gradient boosting algorithm provided 67.89% accuracy. B. Data Preprocessing
Sathe et al. [2] prepared dataset to predict the outcome Dataset used in this research needed to be preprocessed
(home win, away win or draw) of EPL matches by web since it was composed of several features of each season. Many
crawling of team ratings from sofifa and considering the of these features such as match date, referee name, football
performance of each team at home field and away field. Their team name, and bookmaker odds were practically superfluous.
final dataset consists of FIFA ratings of each team along In this process, our essential assignment was to remove the
with their performances of last 10 seasons. They used three irrelevant attributes or features which had no impact on the
machine learning classification methods, which are Support model development and keep only the attributes or features
Vector Machine (SVM), Naive Bayes (NB), and Random we particularly needed. From the retained attributes, feature
Forest (RF). The best accuracy obtained is 59% with SVM engineering was done to make the final features that were
method. utilized for model advancement.
Similarly, Baboota et al. [3] worked on the building of a 1) Feature Engineering: Feature engineering is an im-
generalized predictive model for predicting the results (home portant but labor-intensive component of machine learning
win, away win or draw) of the English Premier League. applications [5]. To use feature engineering, a model’s feature
They used data from 2005 to 2016 spanning of 11 seasons. vector is expanded by adding new features that compute
They divided their dataset into nine seasons of training data based on other properties [6]. The final 23 features that have
from 2005 to 2014, and kept the remaining two seasons been established with the help of our retained attributes are
from 2014 to 2016 as test data. Using feature engineering the mathematical conversion. Some of the features are given
and exploratory data analysis, they created a feature set for below:
determining the most important factors for predicting the result • Home team goals scored per game at home: It is a
of a football match, and consequently created a highly accurate function of home team goals scored at home and home
predictive system using machine learning. Their best model team match played at home. It helps to predict or forecast
using gradient boosting produced accuracy of 58.5%. the number of goals that may be scored by a home team
Rana et al. [4] described a Logistic Regression model to at home.
predict matches outcome (home, not-home) of the English • Home team goals conceded per game at home: This is
Premier League. They used SVM, XGBoost and Logistic based on home team goals conceded at home and home
Regression classifiers for primary classification of the data, and team match played at home. It allows to estimate or de-
then selected the best algorithm out of these three to predict termine how many goals a home team would potentially
that appropriate label. The application of these classifiers is conceive at home.
done on real team data which is gathered from football- • Home team win percentage: It is a component of total win
data.co.uk for the seasons ranging from 2003-04 to 2018-19. and the total match played by the home team. It gives the
Prediction accuracy of the built model is 65.63%. possibility to win the home team’s future match.
• Away team win percentage: This is a measure of the total
III. G OAL OF THE STUDY win and total match played by the away team. It provides
the potential of an away team to win future matches.
The main goal of this work is to create the most influential
2) Feature Scaling: Feature scaling is the technology of
features through feature engineering to accurately determine
standardizing individual features over a defined range [7]. For
the outcome in multi-class and binary-class of EPL matches.
the scaling intent of this study, we have exercised min-max
None of the existing works mentioned in Section II worked
normalization. It is a technique that scales an element or
for both multi-class and binary-class. Since football is a very
perception into the range of 0 and 1 [8]. The mathematical
adaptive game, we have designed our model in such a way that
equation for Min-Max Normalization is,
has added very recent data to the model. It will be possible
to predict every new season with the help of most influential xi − min(x)
features. xnew = (1)
max(x) − min(x)

Authorized licensed use limited to: VIT University- Chennai Campus. Downloaded on October 11,2024 at 05:37:01 UTC from IEEE Xplore. Restrictions apply.
3) Feature Selection: Feature Selection is the process of D. Models
selecting a subset of relevant features which contribute most For the intent of this analysis, we have primarily employed
to prediction variable or output [9]. In this study, we have five mainstream supervised machine learning algorithms [12]
used uni-variate feature selection [10] based on Chi-Square (SVM, LR, NB classifier, DTC, ABC) to address our classifi-
statistical test which picks up the intrinsic properties of the cation problem.
features. Features with the highest Chi-Square statistical test 1) Support Vector Machine (SVM): The SVM is a kernel-
score are illustrated in Fig. 1. based learning algorithm to address the problem of classifica-
tion and regression. It produces ideal isolating limits between
data sets by resolving a problem of quadratic optimization.
The algorithm characterizes the best hyper-plan which divides
the number of points with a maximum margin associated
with different class names [13]. SVM is a predictive data
classification algorithm. So, we have checked out it to take
care of our classification issues too. For making a model with
SVM, we have taken advantage of 23 features that we have
already developed through feature engineering. Thereafter, we
have tuned the hyper-parameters using grid search with k-folds
cross-validation (we used a k-value of 10) where best hyper-
parameters were C = 1, gamma = 0.1, kernel = ‘sigmoid’
illustrated in Fig. 2.

Fig. 1. Features Measured via Uni-Variate Statistics.

Uni-variate feature selection generated the combination of

ATWPAA VS HT (away-team win percentage at away vs
home-team), HTWPAH VS AT (home-team win percentage
at home vs away-team), H2H AW (head-to-head away win),
H2H HW (head-to-head home win), DRAWN (draw between
home-team and away-team), AT WNP (away-team win per-
centage), HT WNP (home-team win percentage), ATL2MWS
(away-team last 2 matches winning streak), AT GSPGAA
(away-team goals scored per game at away), AT LSLP (away-
team last season league points), and HT LSLP (home-team
last season league points) as the set of best features. These Fig. 2. Grid Search for Support Vector Machine model’s hyper-parameters.
features have the potential to have a top influence on the
SVM predictive model has accomplished a accuracy of
prediction and accuracy of the results.
68.99% in multi-class and 76.25% in binary-class where
evaluation technique was cross-validation. We then altered the
C. Evaluation Technique features with features obtained from feature selection which
A typical technique for assessment is to split the data into produced 69.15% of accuracy in multi-class and 76.85% of
two sub-scales for training and testing. Commonly two-thirds accuracy in binary-class, respectively.
2) Logistic Regression (LR): LR seeks to determine the
of the data set is used for model building, and one-third
likelihood of an occurrence on the basis of the indepen-
is left for testing. However, such single splits often offer
dent variables values. It is a statistical method which works
consequences that are at danger of sample bias. Many data
with data sets having one or more independent variables
scientists choose cross-validation in order to decrease bias
that determine the outcome [14]. Our problem is a multi-
[11]. In k-fold cross-validation, the dataset is split into k
class classification problem because there are more than two
randomly part and each model is trained and tested k times.
possible outcomes, such as home win, draw and away win.
The cross-validation accuracy is determined by taking the
We therefore used multi-nomial LR to construct the prediction
average of the k individual precision measures. Using each
model. Grid Search has rendered us the best hyper-parameters
fold, the aggregate accuracy CV of the cross-validation is
for LR model, where C = 0.1, penalty = ‘l1’ shown in Fig.
determined Ai .
3.
k
LR model has achieved 69.95% of accuracy in multi-class,
1X 77.11% of accuracy in binary-class utilizing all features and
CV = Ai (2)
k i=1 70.27% of accuracy in multi-class, 77.43% of accuracy in

Authorized licensed use limited to: VIT University- Chennai Campus. Downloaded on October 11,2024 at 05:37:01 UTC from IEEE Xplore. Restrictions apply.
67.71% of accuracy in multi-class, 74.92% of accuracy in
binary-class using features chosen through the feature selec-
tion system, respectively.
4) Decision Tree Classifier (DTC): The DTC algorithm
represents a function that takes as input a vector of attribute
values and returns a decision single output value. A decision
tree reaches its decision by performing a sequence of tests
[16]. It can be used to solve both regression and classifi-
cation problems. Since our problem is also a question of
classification, we have built up a predictive model utilizing
DTC. We have tuned its hyper-parameters to manipulate
the learning process using grid search where best hyper-
parameters were min samples split = 200, criterion =
‘gini’, min samples leaf = 1 which is illustrated in Fig.
Fig. 3. Grid Search for Logistic Regression model’s hyper-parameters. 5.

binary-class using features selected through feature selection

strategy, respectively.
3) Naive Bayes (NB): NB algorithm is a machine learning
algorithm for classification problems. It is primarily used for
text classification, which involves high-dimensional training
data sets. It is not only known for its simplicity but also for its
effectiveness [15]. We can build models fast and make quick
predictions using the NB algorithm. This algorithm learns
the probability of an object with certain features belonging
to a particular group in class. In short, it is a probabilistic
classifier. Probabilistic classifiers are exceptionally equipped
for predicting the likelihood distribution. So that, we have
set up a probabilistic predictive model for the purpose of Fig. 5. Grid Search for Decision Tree Classifier model’s hyper-parameters.
our three-class classification problem. For further enhancing
the performance of the model, we have optimized the hyper-
the DTC model acquired 67.54% of accuracy in multi-
parameters of the model and the best hyper-parameters were
class, 74.27% of accuracy in binary-class using all features
priors = None, var smoothing = 0.1 which is illustrated in
and 67.76% of accuracy in multi-class, 75.93% of accuracy in
Fig. 4.
binary-class using the features picked by the feature selection
system, respectively.
5) AdaBoost Classifier (ABC): ABC consolidates multiple
weak learners into a single solid learner. In ABC, the weak
learners are single split decision trees, called decision stumps.
When ABC stumps its first decision, all results are equally
weighted. To rectify the previous error, the incorrectly classi-
fied observations bear more weight than the correctly classified
observations. ABC is a very powerful boosting algorithm [17].
Not just that, it is commonly used for different problems in
machine learning due to its working nature. We have also made
a predictive model using it and tuned its hyper-parameters for
activating its best level. The most ideal parameters generated
by Grid Search were learning rate = 0.2, n estimators =
80 which is illustrated in Fig. 6.
The predictive model of the ABC has reached towards
68.72% of accuracy in multi-class, 75.02% of accuracy in
Fig. 4. Grid Search for Naive Bayes model’s hyper-parameters. binary-class using all number of features and 69.15% of
accuracy in multi-class, 76.15% of accuracy in binary-class
NB model has obtained 65.95% of accuracy in multi-class, with selected features using the technique of feature selection,
73.69% of accuracy in binary-class using all features and respectively.

Authorized licensed use limited to: VIT University- Chennai Campus. Downloaded on October 11,2024 at 05:37:01 UTC from IEEE Xplore. Restrictions apply.
TABLE IV
P ERFORMANCE OF THE DTC M ODEL

Class Precision Recall F1-Score

Away 66% 66% 66%
Draw 49% 62% 55%
Home 79% 71% 75%
Average 64.66% 66.33% 65.33%

TABLE V
P ERFORMANCE OF THE ABC M ODEL

Class Precision Recall F1-Score

Away 70% 68% 69%
Draw 42% 68% 52%
Home 83% 70% 76%
Average 65% 68.66% 65.66%
Fig. 6. Grid Search for AdaBoost Classifier model’s hyper-parameters.

TABLE VI
According to the accuracy rates of each model, the feature T HE R ESULTS OF THE P REDICTION P ROCESS IN M ULTI -C LASS
selection process has slightly increased the prediction perfor- Model Accuracy Precision Recall F1-Score
mance. SVM 69.15% 65% 68.66% 65.66%
LR 70.27% 65.33% 70.66% 66.66%
V. E XPERIMENTAL R ESULTS AND A NALYSIS NB 67.71% 64.66% 65.66% 65%
DTC 67.76% 64.66% 66.33% 65.33%
This section exhibits our research findings as well as a ABC 69.15% 65% 68.66% 65.66%
comparative analysis with the existing models. Since selected
features from feature selection process has slightly increased According to Table VI, performance values of the Logistic
the performance, we have employed those selected features Regression model are a little bit higher than rest of the models.
for the evaluation of each models. Therefore, we considered LR model as the proposed model of
A. Evaluation of Each Model for Multi-Class this literature for multi-class classification.
The total number of matches was 1870, which consisted of B. Evaluation of Each Model for Binary-Class
861 home team wins, 565 away team wins, and 444 draws. The total number of matches was 1870 matches, which
The 10-fold cross-validation method with the confusion matrix consisted of 861 home team wins, and 1009 wins for not-
was executed to measure the efficiency of each classification home. To evaluate the efficiency of each classification model,
model. The performance of each model for multi-class is 10-fold cross-validation method was used with the confusion
shown from Table I to Table V. matrix. The performance of each model for binary-class is
TABLE I displayed from Table VII to Table XI.
P ERFORMANCE OF THE SVM M ODEL
TABLE VII
Class Precision Recall F1-Score P ERFORMANCE OF THE SVM M ODEL
Away 70% 68% 69%
Draw 43% 68% 52% Class Precision Recall F1-Score
Home 82% 70% 76% Not-Home 85% 75% 80%
Average 65% 68.66% 65.66% Home 67% 79% 73%
Average 76% 77% 76.50%
TABLE II
P ERFORMANCE OF THE LR M ODEL TABLE VIII
P ERFORMANCE OF THE LR M ODEL
Class Precision Recall F1-Score
Away 69% 71% 70% Class Precision Recall F1-Score
Draw 42% 71% 53% Not-Home 77% 81% 79%
Home 85% 70% 77% Home 78% 74% 76%
Average 65.33% 70.66% 66.66% Average 77.50% 77.50% 77.50%
TABLE III
P ERFORMANCE OF THE NB M ODEL TABLE IX
P ERFORMANCE OF THE NB M ODEL
Class Precision Recall F1-Score
Away 72% 65% 69% Class Precision Recall F1-Score
Draw 46% 59% 52% Not-Home 74% 78% 76%
Home 76% 73% 74% Home 76% 71% 74%
Average 64.66% 65.66% 65% Average 75% 74.50% 75%

Authorized licensed use limited to: VIT University- Chennai Campus. Downloaded on October 11,2024 at 05:37:01 UTC from IEEE Xplore. Restrictions apply.
TABLE X VI. C ONCLUSION
P ERFORMANCE OF THE DTC M ODEL
The model we devised is based on statistical analysis of
Class Precision Recall F1-Score past football games. We will be able to make fairly accurate
Not-Home 77% 76% 76% predictions. Although the accuracy of this model is pretty
Home 71% 73% 72%
Average 74% 74.50% 74% good, it is not guaranteed to be always right and there is a
lot of scope for future work in this regard. We could bring
in sentiment analysis, features such as individual player and
TABLE XI team performance metrics, studying the trending hash-tags on
P ERFORMANCE OF THE ABC M ODEL
twitter on match day, the posts from fans on social media, etc
Class Precision Recall F1-Score to further enhance the accuracy of the model.
Not-Home 77% 77% 77%
Home 72% 73% 73% ACKNOWLEDGEMENT
Average 74.50% 75% 75%
This work was partially supported by the “Research Fund”
of Green University of Bangladesh.
TABLE XII
T HE R ESULTS OF THE P REDICTION P ROCESS IN B INARY-C LASS R EFERENCES
[1] Y. F. Alfredo and S. M. Isa, “Football Match Prediction with Tree Based
Model Accuracy Precision Recall F1-Score Model Classification”, I. J. Intelligent Systems and Applications, vol. 11,
SVM 76.85% 76% 77% 76.5% no. 7, pp. 20-28, 2019.
LR 77.43% 77.50% 77.50% 77.50% [2] S. Sathe, D. Kasat, N. Kulkarni and R. Satao, “Predictive Analysis of
NB 74.92% 75% 74.50% 75% Premier League Using Machine Learning”, I. J. Innovative Research in
DTC 75.93% 74% 74.50% 74% Computer and Communication Engineering, vol. 5, no. 3, pp. 4121-4124,
ABC 76.15% 74.50% 75% 75% 2017.
[3] R. Baboota and H. Kaur, “Predictive analysis and modelling football
results using machine learning approach for English Premier League”,
According to Table XII, performance values of the Logistic I. J. Forecasting, vol. 35, no. 2, pp. 741-755, 2019.
Regression model are a little bit higher than rest of the models. [4] D. Rana and A. Vasudeva, “Premier League Match Result Prediction
Therefore, we considered LR model as the proposed model of using Machine Learning”, Jaypee University of Information Technology,
2019.
this literature for binary-class classification. [5] Y. Bengio, A. Courville and P. Vincent, “Representation learning:
A review and new perspectives”, IEEE Trans. Pattern Analysis and
C. Comparative Results Machine Intelligence, vol. 35, no. 8, pp. 1798–1828, 2013.
In this sub-section, a comparative analysis is presented to [6] A. Coates, A. Y. Ng and H. Lee, “An analysis of single-layer networks
in unsupervised feature learning”, I. Con. Artificial Intelligence and
prove the superiority of the proposed model of EPL match Statistics, pp. 215–223, 2011.
prediction over existing models. [7] X. Wan, “Influence of feature scaling on convergence of gradient
iterative algorithm”, J. Physics: Conf. Series, vol. 1213, no. 3, pp. 1-5,
TABLE XIII 2019.
C OMPARISON OF THE P ROPOSED M ODEL WITH THE E XISTING M ODELS [8] S. G. K. Patro and K. K. Sahu, “Normalization: A Preprocessing Stage”,
IN M ULTI -C LASS IARJSET, vol. 2, no. 3, pp. 20-22, 2015.
[9] J. Tang, S. Alelyani and H. Liu, “Feature Selection for Classification: A
Parameters Accuracy Review”, Data Classification: Algorithms and Applications, CRC Press,
Proposed Model in Multi-Class 70.27% pp. 37-64, 2014.
Existing Model [1] in Multi-Class 68.55% [10] R. H. Subho, M. R. Chowdhury, D. Chaki and S. Islam,“A Univariate
Existing Model [2] in Multi-Class 59% Feature Selection Approach for Finding Key Factors of Restaurant
Existing Model [3] in Multi-Class 58.5% Business”, IEEE Region 10 Symposium (TENSYMP), pp. 605-610,
2019.
[11] E. Eryarsoy and D. Delen, “Predicting the Outcome of a Football Game:
Table XIII shows the comparison between the proposed A Comparative Analysis of Single and Ensemble Analytics Methods”,
model and the existing models [1], [2] and [3] in multi-class HICSS, pp. 1107–1115, Hawaii, 2019.
where proposed model accuracy is 70.27% and the existing [12] S. Chakravarty, H. Demirhan and F. Baser, “Fuzzy regression functions
with a noise cluster and the impact of outliers on mainstream machine
models [1], [2] and [3] have 68.55%, 59%, 58.5% of accuracy, learning methods in the regression setting”, Applied Soft Computing,
respectively. vol. 96, pp. 1-17, 2020.
[13] T. Cheng, D. Cui, Z. Fan, J. Zhou and S. Lu, “A new model to forecast
the results of matches based on hybrid neural networks in the soccer
TABLE XIV rating system”, Proc. Fifth Int. Conf. Computational Intelligence and
C OMPARISON OF THE P ROPOSED M ODEL WITH THE E XISTING M ODEL IN Multimedia Applications (ICCIMA), IEEE, 2003.
B INARY-C LASS [14] S. Dreiseitl and L. Ohno-Machado, “Logistic regression and artifi-
cial neural network classification models: a methodology review”, J.
Parameters Accuracy
Biomedical Informatics, vol. 35, no. 5–6, pp. 352-359, 2002.
Proposed Model in Binary-Class 77.43% [15] D. J. Hand and K. Yu, “Idiots Bayes—not so stupid after all?,” Int.
Existing Model [4] in Binary-Class 65.63% Statistical Review, vol. 69, no. 3, pp. 385–398, 2001.
[16] L. Breiman, J. H. Friedman, R. A. Olshen and C. J. Stone, “Classification
Table XIV shows the comparison between proposed model and Regression Trees”, Biometrics, vol. 40, no. 3, pp. 874, 1984.
[17] C. Ying, M. Qi-Guang, L. Jia-Chen and G. Lin., “Advance and prospects
and existing [4] model in binary-class where proposed model of AdaBoost algorithm”, Acta Automatica Sinica, vol. 39, no. 6, pp.
accuracy is 77.43% and existing model [4] accuracy is 65.63%, 745–758, 2013.
respectively.

Authorized licensed use limited to: VIT University- Chennai Campus. Downloaded on October 11,2024 at 05:37:01 UTC from IEEE Xplore. Restrictions apply.

Dynamic Cricket Match Outcome Prediction
No ratings yet
Dynamic Cricket Match Outcome Prediction
12 pages
Using Supervised Learning To Predict English Premier League Match
No ratings yet
Using Supervised Learning To Predict English Premier League Match
79 pages
Using Bookmaker Odds To Predict The Final Result of Football Matches
No ratings yet
Using Bookmaker Odds To Predict The Final Result of Football Matches
11 pages
Winning Prediction Analysis in One-Day-International (ODI) Cricket Using Machine Learning Techniques
No ratings yet
Winning Prediction Analysis in One-Day-International (ODI) Cricket Using Machine Learning Techniques
8 pages
Predicting Football Match Result Using Fusion-Based Classification Models
No ratings yet
Predicting Football Match Result Using Fusion-Based Classification Models
6 pages
EPL Prediction Web App
No ratings yet
EPL Prediction Web App
15 pages
Predicting Epl Football Matches
No ratings yet
Predicting Epl Football Matches
9 pages
Machine Learning For Football Matches and Tournaments
No ratings yet
Machine Learning For Football Matches and Tournaments
8 pages
Predicting Game Results For Football League Using Deep Learning
No ratings yet
Predicting Game Results For Football League Using Deep Learning
6 pages
Comparison of Football Results Using Machine Learning Algorithms
No ratings yet
Comparison of Football Results Using Machine Learning Algorithms
7 pages
Predicting Outcome of Soccer Matches Using Machine Learning
No ratings yet
Predicting Outcome of Soccer Matches Using Machine Learning
12 pages
A Novel Approach For Predicting Football Match Results: An Evaluation of Classification Algorithms
No ratings yet
A Novel Approach For Predicting Football Match Results: An Evaluation of Classification Algorithms
8 pages
Sjoberg Fredrik
No ratings yet
Sjoberg Fredrik
75 pages
Football Match Winner Prediction
No ratings yet
Football Match Winner Prediction
3 pages
Ben Ulmer, Matt Fernandez, Predicting Soccer Results in The English Premier League PDF
100% (1)
Ben Ulmer, Matt Fernandez, Predicting Soccer Results in The English Premier League PDF
5 pages
Ben Ulmer, Matt Fernandez, Predicting Soccer Results in The English Premier League
No ratings yet
Ben Ulmer, Matt Fernandez, Predicting Soccer Results in The English Premier League
5 pages
Sports Result Prediction System: Random Forest Algorithm Performing Regression and Database
No ratings yet
Sports Result Prediction System: Random Forest Algorithm Performing Regression and Database
7 pages
A Comparative Study of The Different Classification Algorithms On Football Analytics
No ratings yet
A Comparative Study of The Different Classification Algorithms On Football Analytics
16 pages
Sports Result Prediction System
No ratings yet
Sports Result Prediction System
2 pages
Introduction New
No ratings yet
Introduction New
3 pages
Deep Learning Football
No ratings yet
Deep Learning Football
8 pages
A Comparative Study of Data Mining Techniques On Football Match Prediction
No ratings yet
A Comparative Study of Data Mining Techniques On Football Match Prediction
8 pages
An Improved Prediction System For Football A Match Result - Data Mining
No ratings yet
An Improved Prediction System For Football A Match Result - Data Mining
9 pages
Game ON! Predicting English Premier League Match Outcomes
No ratings yet
Game ON! Predicting English Premier League Match Outcomes
5 pages
Thesis Proposal Presentation
No ratings yet
Thesis Proposal Presentation
15 pages
Prediction of English Premier League Soccer Matches
No ratings yet
Prediction of English Premier League Soccer Matches
60 pages
Sminton,+13509 Article+ (PDF) 30287 1 11 20220414
No ratings yet
Sminton,+13509 Article+ (PDF) 30287 1 11 20220414
38 pages
Sports Analytics For Football League Table and Player Performance Prediction
No ratings yet
Sports Analytics For Football League Table and Player Performance Prediction
8 pages
Deep Learning and Transfer Learning Architectures For English Premier League Player Performance Forecasting
No ratings yet
Deep Learning and Transfer Learning Architectures For English Premier League Player Performance Forecasting
13 pages
Combining Machine Learning and Human Experts To Predict Match Outcomes in
No ratings yet
Combining Machine Learning and Human Experts To Predict Match Outcomes in
5 pages
Machine Learning For Soccer Match Result Prediction: Rory - [email protected] - Nagoya-U.ac - JP
No ratings yet
Machine Learning For Soccer Match Result Prediction: Rory - [email protected] - Nagoya-U.ac - JP
41 pages
Proyect Predict Football Match Winners With Machine Learning and Python Foundations of Programming
100% (1)
Proyect Predict Football Match Winners With Machine Learning and Python Foundations of Programming
5 pages
Football - Match - Result - Prediction - Using - Neural - Networks - and - Deep - Learning Yeah
No ratings yet
Football - Match - Result - Prediction - Using - Neural - Networks - and - Deep - Learning Yeah
4 pages
Entropy 23 00090 v3
No ratings yet
Entropy 23 00090 v3
12 pages
Prediction of Football Match Score and Decision Making Process
No ratings yet
Prediction of Football Match Score and Decision Making Process
4 pages
IPL Score Prediction (Journal) - 4nm18cs142-169-191-215.
No ratings yet
IPL Score Prediction (Journal) - 4nm18cs142-169-191-215.
10 pages
English Premier League (EPL) Soccer Matches Prediction Using An Adaptive Neuro-Fuzzy Inference System (ANFIS)
No ratings yet
English Premier League (EPL) Soccer Matches Prediction Using An Adaptive Neuro-Fuzzy Inference System (ANFIS)
8 pages
Journal Pone 0284318
No ratings yet
Journal Pone 0284318
15 pages
Report 730
No ratings yet
Report 730
5 pages
Predicting The Outcome of A Football Game: A Comparative Analysis of Single and Ensemble Analytics Methods
No ratings yet
Predicting The Outcome of A Football Game: A Comparative Analysis of Single and Ensemble Analytics Methods
9 pages
Corentin Herbinet Using Machine Learning Techniques To Predict The Outcome of Profressional Football Matches
No ratings yet
Corentin Herbinet Using Machine Learning Techniques To Predict The Outcome of Profressional Football Matches
73 pages
The Application of Machine Learning For Sport Result Prediction A Review
No ratings yet
The Application of Machine Learning For Sport Result Prediction A Review
49 pages
Predictiveanalysis of PSL Match Winners Using Machine Learning Techniques
No ratings yet
Predictiveanalysis of PSL Match Winners Using Machine Learning Techniques
12 pages
Allahyyy
No ratings yet
Allahyyy
54 pages
Results of Sports Matches For 2025
No ratings yet
Results of Sports Matches For 2025
8 pages
1 s2.0 S016920702300033X Main
No ratings yet
1 s2.0 S016920702300033X Main
11 pages
Predicting Football Matches Using Neural Networks in MATLAB
100% (1)
Predicting Football Matches Using Neural Networks in MATLAB
6 pages
Access-Template (MOST RECENT)
No ratings yet
Access-Template (MOST RECENT)
10 pages
Langseth SCAI14
No ratings yet
Langseth SCAI14
11 pages
1998 - Prediction and Retrospetive Analysis of Soccer Matches in A League
No ratings yet
1998 - Prediction and Retrospetive Analysis of Soccer Matches in A League
23 pages
Combining Textual Pre-Game Reports and Statistical Data For Predicting Success in The National Hockey League
No ratings yet
Combining Textual Pre-Game Reports and Statistical Data For Predicting Success in The National Hockey League
12 pages
Football Result Prediction With Bayesian Network in Spanish League-Barcelona Team
No ratings yet
Football Result Prediction With Bayesian Network in Spanish League-Barcelona Team
4 pages
A Comparative Study of Predictive Analysis Using Machine Learning Techniques
No ratings yet
A Comparative Study of Predictive Analysis Using Machine Learning Techniques
20 pages
iSCSi RodriguesPintokk
No ratings yet
iSCSi RodriguesPintokk
9 pages
1809 09813 PDF
No ratings yet
1809 09813 PDF
13 pages
Prediction of IPL Match Outcome Using Machine Lear
No ratings yet
Prediction of IPL Match Outcome Using Machine Lear
8 pages
Predicting The Outcome of Soccer Matches
100% (1)
Predicting The Outcome of Soccer Matches
97 pages
Predicting Winner of NFL Games Using Deep Learning
No ratings yet
Predicting Winner of NFL Games Using Deep Learning
20 pages
Exploring AutoCAD Map 3D 2023, 10th Edition
From Everand
Exploring AutoCAD Map 3D 2023, 10th Edition
Prof. Sham Tickoo
No ratings yet
Exploring AutoCAD Map 3D 2017, 7th Edition
From Everand
Exploring AutoCAD Map 3D 2017, 7th Edition
Prof. Sham Tickoo
No ratings yet
AI Algorithms Summary by Djemoui Badr
No ratings yet
AI Algorithms Summary by Djemoui Badr
5 pages
Mixture of Experts Explained Simply
No ratings yet
Mixture of Experts Explained Simply
8 pages
Report PPT
No ratings yet
Report PPT
18 pages
Co-So-Tri-Tue-Nhan-Tao - 2021-Reviewexercise09-Nn-Sol - (Cuuduongthancong - Com)
No ratings yet
Co-So-Tri-Tue-Nhan-Tao - 2021-Reviewexercise09-Nn-Sol - (Cuuduongthancong - Com)
2 pages
Stranger Detection: Yada Arun Kumar
No ratings yet
Stranger Detection: Yada Arun Kumar
9 pages
Chapter 44
No ratings yet
Chapter 44
26 pages
cst414 - Deep Learning
No ratings yet
cst414 - Deep Learning
34 pages
Soft Computing PPT Module1
No ratings yet
Soft Computing PPT Module1
102 pages
Me-Llama: Medical Foundation Large Language Models For Comprehensive Text Analysis and Beyond
No ratings yet
Me-Llama: Medical Foundation Large Language Models For Comprehensive Text Analysis and Beyond
21 pages
Large Language Models For Business Process Management
No ratings yet
Large Language Models For Business Process Management
18 pages
HistoryOfObjectRecognition PDF
No ratings yet
HistoryOfObjectRecognition PDF
2 pages
L10a - Machine Learning Basic Concepts
100% (1)
L10a - Machine Learning Basic Concepts
80 pages
Lightweight and Compact AI Models
No ratings yet
Lightweight and Compact AI Models
2 pages
Yolo1 11
No ratings yet
Yolo1 11
38 pages
Medical Image Analysis With Transformers
No ratings yet
Medical Image Analysis With Transformers
66 pages
Speech Emotion Recognition Using Deep Convolutional Neural Network and Discriminant Temporal Pyramid Matching
No ratings yet
Speech Emotion Recognition Using Deep Convolutional Neural Network and Discriminant Temporal Pyramid Matching
15 pages
MTech Deep Learning Syllabus
No ratings yet
MTech Deep Learning Syllabus
1 page
SS - GenAI and LLMs-May-2024-25 - Final
No ratings yet
SS - GenAI and LLMs-May-2024-25 - Final
2 pages
Andrychowicz Et Al 2016 Learning To Learn by Gradient Descent by Gradient Descent
No ratings yet
Andrychowicz Et Al 2016 Learning To Learn by Gradient Descent by Gradient Descent
17 pages
Human Emotion Detection Using Deep Learning
No ratings yet
Human Emotion Detection Using Deep Learning
7 pages
Materi AI Pertemuan 1
No ratings yet
Materi AI Pertemuan 1
9 pages
American Sign Language Recognition Using Machine Learning and Com
No ratings yet
American Sign Language Recognition Using Machine Learning and Com
57 pages
Easy Project List by Junaid Malik
No ratings yet
Easy Project List by Junaid Malik
3 pages
Machine Learning Approaches in Stock Market Prediction A
No ratings yet
Machine Learning Approaches in Stock Market Prediction A
8 pages
A Generative Adversarial Network With Adaptive Con
No ratings yet
A Generative Adversarial Network With Adaptive Con
12 pages
Generative AI and ChatGPT For Beginners - A Comprehensive Guide To Harness The Power of AI, Boost Productivity, and Get More Done in Less Time (Tech Mastery)
No ratings yet
Generative AI and ChatGPT For Beginners - A Comprehensive Guide To Harness The Power of AI, Boost Productivity, and Get More Done in Less Time (Tech Mastery)
118 pages
3 Sequence and Language Modeling
No ratings yet
3 Sequence and Language Modeling
56 pages
Subtitle
No ratings yet
Subtitle
2 pages
2024 Eacl-Long 104
No ratings yet
2024 Eacl-Long 104
14 pages
Neuroscience and Computers
No ratings yet
Neuroscience and Computers
13 pages

Predicting The Outcome of English Premier League Matches Using Machine Learning

Uploaded by

Predicting The Outcome of English Premier League Matches Using Machine Learning

Uploaded by

2020 2nd International Conference on Sustainable Technologies for

Industry 4.0 (STI), 19-20 December, Dhaka

Predicting the Outcome of English Premier League

Email: ∗ [email protected], † [email protected], ‡ [email protected], § [email protected]

Fig. 1. Features Measured via Uni-Variate Statistics.

Uni-variate feature selection generated the combination of

binary-class using features selected through feature selection

Class Precision Recall F1-Score

Class Precision Recall F1-Score

You might also like