Scouting Players With FIFA19: Data Driven Approach To Scouting

This document outlines a project to use data mining techniques on FIFA 19 player data to help soccer clubs scout for players. The goals are to identify undervalued players, analyze current rosters for over/underperformers, build a similarity database for player comparisons, and develop predictive models for future player potential/value. The team will cluster, analyze, and build models on the FIFA 19 dataset containing attributes for over 18,000 real players to achieve these objectives.

Uploaded by

Mauricio Peñaloza

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

204 views3 pages

Scouting Players With FIFA19: Data Driven Approach To Scouting

Uploaded by

Mauricio Peñaloza

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Scouting Players with FIFA19

Applying Data Mining to Scouting

Data Driven Approach to Scouting

In the era of eight-figure salaries and nine figure signing fees, player recruitment is a
high-stakes game. In the past, soccer scouts have relied on rudimentary data and intuition to
evaluate the performance and value of soccer players. With the recent rise in data analytics
that can capture many aspects of a player’s performance, statistics and data science are
beginning to play a more prominent role in identifying rising stars and overvalued /
undervalued players.

For this project, we are positioning ourselves as a scouting agency that uses analytics to,
among other things, enhance the discovery of talents and help soccer clubs better understand
the dynamics (features) that come into play when determining the value, overall and future
potential of a player. Our agency will be focusing on solving these fundamental scouting
problems:

1. Finding undervalued players for a given club to acquire,

2. Analyzing a team’s current roster for over-payed and/or underperforming
players that could be traded or sold,
3. Developing a database of similar players for clubs looking for a specific player
type,
4. Build a predictive model to evaluate the future potential of young players.

We will be utilizing the FIFA 19 Player dataset available on Kaggle and apply various Data
Mining techniques to achieve our objectives.

Project Objectives
• Cluster players based various features to identify different player types for our similarity
database.
• Identify under-valued and over-valued players based on ability measures relative to
their value, salary, and/or release clause.
• Building predictive models for future value and potential of players.

Dataset
• Source: Kaggle
• Description: Detailed attributes for every player registered in the latest edition of FIFA
2019 database.
• Size: 9.1MB (18.2k observations x 89 features)
• Features:

1
• ID • Value • Joined
• Name • Wage • Loaned From
• Age • Special • Contract Valid Until
• Photo • Preferred Foot • Height
• Nationality • International Reputation • Weight
• Overall • Weak Foot • Ability by positions (26 features)
• Potential • Skill Moves • Ability by skills (34 features)
• Club • Work Rate • Release Clause
• Position • Jersey Number

Team & Roles

• Markus Wehr: Finding undervalued players.
• Nazih Kalo: Analyzing current roster of players.
• Stephen Stark: Developing similarity database.
• Tam Nguyen: Predictive model for future potential/value.
• Woo Jong Choi: Predictive model for future potential/value.

Data Mining Steps:

• Missing value, data type
Data pre-
• Features distribution
processing
• Feature engineering
1. Pre-processing and EDA
2. Clustering
Analysis
3. Build predictive models
Stages
4. Analyze performance & make final predictions
5. Visualize Output
• PCA
• t-SNE
• K-means
• DBSCAN
• SVD
• Regression: linear/ logit
Potential
• Hierarchical Clustering
Methods
• Latent Class Clustering
• Discriminant Analysis
• Regression Trees
• Random forest
• Decision trees
• Association rules
1. Microsoft Teams
Tools 2. Python
− Jupyter Notebook, Google Collab

2
− Pandas, Numpy, Matplotlib, Seaborn, Scikit-learn, Scipy
3. Tableau

VanDongen MA BMS
No ratings yet
VanDongen MA BMS
96 pages
Fantasy Sports Prediction Clustering Analysis
No ratings yet
Fantasy Sports Prediction Clustering Analysis
21 pages
Kibana 8.x – A Quick Start Guide to Data Analysis: Learn about data exploration, visualization, and dashboard building with Kibana
From Everand
Kibana 8.x – A Quick Start Guide to Data Analysis: Learn about data exploration, visualization, and dashboard building with Kibana
Krishna Shah
No ratings yet
What Is Process? Discuss The Process Framework Activities
No ratings yet
What Is Process? Discuss The Process Framework Activities
6 pages
P.P.U.D. Practical 3: 1. To Study DDL-create and DML-insert Commands
No ratings yet
P.P.U.D. Practical 3: 1. To Study DDL-create and DML-insert Commands
8 pages
Chapter 2 - Mini Test - Attempt Review
No ratings yet
Chapter 2 - Mini Test - Attempt Review
16 pages
Introduction to Robotics
From Everand
Introduction to Robotics
Swarnalata Verma
No ratings yet
Soccerment TheClusteringProject ENG 20220615 PDF
100% (1)
Soccerment TheClusteringProject ENG 20220615 PDF
158 pages
Data Mining and Machine Learning in High-Performance Sport
No ratings yet
Data Mining and Machine Learning in High-Performance Sport
63 pages
OpenText TeamSite LiveSite OpenDeploy 21.4.2 Release Notes
No ratings yet
OpenText TeamSite LiveSite OpenDeploy 21.4.2 Release Notes
24 pages
Business Analytics in Sport Talent Acquisition Met
No ratings yet
Business Analytics in Sport Talent Acquisition Met
20 pages
Sequnital Tasks Vs Parallel Tasks
No ratings yet
Sequnital Tasks Vs Parallel Tasks
8 pages
Football Analytics
No ratings yet
Football Analytics
27 pages
SPORTS Final
No ratings yet
SPORTS Final
91 pages
Super Bowl
No ratings yet
Super Bowl
10 pages
Football Players Market Value Prediction
No ratings yet
Football Players Market Value Prediction
19 pages
SEPM Unit-4
No ratings yet
SEPM Unit-4
38 pages
Project - Management - PPT Final
No ratings yet
Project - Management - PPT Final
18 pages
Usage of Analytics in The World of Sports
No ratings yet
Usage of Analytics in The World of Sports
7 pages
May Jun 2024
No ratings yet
May Jun 2024
2 pages
Predictthe Valueof Football Players Using FIFAvideogamedataand Machine Learning Techniques
No ratings yet
Predictthe Valueof Football Players Using FIFAvideogamedataand Machine Learning Techniques
16 pages
Analyzing Football Player Performance With Python An EDA Approach
No ratings yet
Analyzing Football Player Performance With Python An EDA Approach
43 pages
Fbana PDF
No ratings yet
Fbana PDF
17 pages
Cap484 Final Project
No ratings yet
Cap484 Final Project
8 pages
Predict The Value of Football Players Using FIFA Video Game Data and Machine Learning Techniques
No ratings yet
Predict The Value of Football Players Using FIFA Video Game Data and Machine Learning Techniques
15 pages
Comprehensive Analysis of Football Player Market V
No ratings yet
Comprehensive Analysis of Football Player Market V
7 pages
57 - Step PPT 2 Cpr3 Final
No ratings yet
57 - Step PPT 2 Cpr3 Final
32 pages
GnuCOBOL C Interaction
No ratings yet
GnuCOBOL C Interaction
29 pages
Playerank: Data-Driven Performance Evaluation and Player Ranking in Soccer Via A Machine Learning Approach
No ratings yet
Playerank: Data-Driven Performance Evaluation and Player Ranking in Soccer Via A Machine Learning Approach
27 pages
Driblab How Do We Work FRMF
No ratings yet
Driblab How Do We Work FRMF
27 pages
Valuing Passes in Football Using Ball Event Data
No ratings yet
Valuing Passes in Football Using Ball Event Data
73 pages
Money Ball
No ratings yet
Money Ball
8 pages
Football Player Transfer Value Prediction Using Advanced Statistics and FIFA 22 Data
No ratings yet
Football Player Transfer Value Prediction Using Advanced Statistics and FIFA 22 Data
6 pages
2018 - BARRON - Artificial Neural Networks and Player Recruitment in Professional Soccer
No ratings yet
2018 - BARRON - Artificial Neural Networks and Player Recruitment in Professional Soccer
11 pages
Whitepaper The Soccer Analytics Revolution 1
No ratings yet
Whitepaper The Soccer Analytics Revolution 1
10 pages
Data Driven Football Scouting Assistance With Simulated Player Performance Extrapolation
No ratings yet
Data Driven Football Scouting Assistance With Simulated Player Performance Extrapolation
8 pages
Player Ank
No ratings yet
Player Ank
18 pages
FIFA Report
No ratings yet
FIFA Report
10 pages
A Survey On Football Player Performance and Value Estimation Using Machine Learning Techniques (#1215552) - 2816789
No ratings yet
A Survey On Football Player Performance and Value Estimation Using Machine Learning Techniques (#1215552) - 2816789
6 pages
What Is MIS? Characteristics, Objectives, Role, Component
No ratings yet
What Is MIS? Characteristics, Objectives, Role, Component
18 pages
Data Science Methodology in Football Players Recruitment
No ratings yet
Data Science Methodology in Football Players Recruitment
2 pages
RL - Exp-4 Updated
No ratings yet
RL - Exp-4 Updated
2 pages
Fridman LexPhD
No ratings yet
Fridman LexPhD
67 pages
It Glossary English
No ratings yet
It Glossary English
9 pages
Nutanix Files User Guide
No ratings yet
Nutanix Files User Guide
70 pages
Problem Statement - PBI
No ratings yet
Problem Statement - PBI
1 page
Project Traceability Matrix
No ratings yet
Project Traceability Matrix
10 pages
Ict450 SQL Exercise Question
No ratings yet
Ict450 SQL Exercise Question
12 pages
Mini Project Analysis On Messi
No ratings yet
Mini Project Analysis On Messi
10 pages
HPE Reference Configuration For Veeam Backup & Replication Version 12 With HPE StoreOnce
No ratings yet
HPE Reference Configuration For Veeam Backup & Replication Version 12 With HPE StoreOnce
71 pages
DEM Project Report
No ratings yet
DEM Project Report
7 pages
2212.11041-What Should Clubs Monitor To Predict Future Value of Footbal Players
No ratings yet
2212.11041-What Should Clubs Monitor To Predict Future Value of Footbal Players
22 pages
Transfer Portal Accurately Forecasting The Impact of A 2201.11533
No ratings yet
Transfer Portal Accurately Forecasting The Impact of A 2201.11533
25 pages
SA Unit 3
No ratings yet
SA Unit 3
14 pages
SQL PDF Raviraj
No ratings yet
SQL PDF Raviraj
249 pages
RDBMS Assignment1 - Oct 2024
No ratings yet
RDBMS Assignment1 - Oct 2024
5 pages
CapstoneSynopsis A
No ratings yet
CapstoneSynopsis A
6 pages
Scouting in Soccer With Applied Machine Learning
No ratings yet
Scouting in Soccer With Applied Machine Learning
19 pages
Entropy 23 00090 v3
No ratings yet
Entropy 23 00090 v3
12 pages
OS Module2 PDF
No ratings yet
OS Module2 PDF
22 pages
Additional Project Problem Statement - FIFA Data Analysis
No ratings yet
Additional Project Problem Statement - FIFA Data Analysis
2 pages
Player Performance in Football
No ratings yet
Player Performance in Football
6 pages
Artificial Neural Networks and Player Recruitment in Professional Soccer
No ratings yet
Artificial Neural Networks and Player Recruitment in Professional Soccer
8 pages
AF302 Exam
No ratings yet
AF302 Exam
14 pages
2020-21 Fall 41553 Bernardo-Pinto
No ratings yet
2020-21 Fall 41553 Bernardo-Pinto
49 pages
Problem Statement - FIFA
No ratings yet
Problem Statement - FIFA
1 page
Text Mining Tools On The Internet
No ratings yet
Text Mining Tools On The Internet
75 pages
Applying Data Mining To Scouting: Markus Wehr Nazih Kalo Stephen Stark Tam Nguyen Woojong Choi
No ratings yet
Applying Data Mining To Scouting: Markus Wehr Nazih Kalo Stephen Stark Tam Nguyen Woojong Choi
38 pages
EBK TMS Toolkit Technology Stack GTreasury
No ratings yet
EBK TMS Toolkit Technology Stack GTreasury
18 pages
FIFA Video Game - Players Classification
No ratings yet
FIFA Video Game - Players Classification
26 pages
Information Assurance - Defined and Explained
No ratings yet
Information Assurance - Defined and Explained
3 pages
INFO Assignment 1
No ratings yet
INFO Assignment 1
6 pages
ML in Soccer Analytics Gunjan Kumar
No ratings yet
ML in Soccer Analytics Gunjan Kumar
99 pages
Ekefre Non Confidential
No ratings yet
Ekefre Non Confidential
59 pages
Software Defined Networking (SDN) Overview: Includes Material From Scott Shenker and Nick Mckeown
No ratings yet
Software Defined Networking (SDN) Overview: Includes Material From Scott Shenker and Nick Mckeown
28 pages
Yuan He PDF
No ratings yet
Yuan He PDF
15 pages
AWS+Tagging Naming+Conventions
No ratings yet
AWS+Tagging Naming+Conventions
4 pages
6 powerBI Project PDF
100% (3)
6 powerBI Project PDF
16 pages
CheatSheet FortiOS 6.2
No ratings yet
CheatSheet FortiOS 6.2
3 pages
PlayeRank: Data-Driven Performance Evaluation and Player Ranking in Soccer Via A Machine Learning Approach
No ratings yet
PlayeRank: Data-Driven Performance Evaluation and Player Ranking in Soccer Via A Machine Learning Approach
18 pages
E. B. Magalona National High School
No ratings yet
E. B. Magalona National High School
2 pages
Beating The Odds: Learning To Bet On Soccer Matches Using Historical Data
No ratings yet
Beating The Odds: Learning To Bet On Soccer Matches Using Historical Data
7 pages
Rajesh 2020
No ratings yet
Rajesh 2020
9 pages
Handbook Fa
100% (1)
Handbook Fa
27 pages
FIFA 18 - Data Analysis: - Harsh Takrani - Pranay Lulla
No ratings yet
FIFA 18 - Data Analysis: - Harsh Takrani - Pranay Lulla
16 pages
Hbase PDF
No ratings yet
Hbase PDF
8 pages
Problem Statement - PBI - Docx-1
No ratings yet
Problem Statement - PBI - Docx-1
1 page
Player Stats Analysis Using Machine Learning
No ratings yet
Player Stats Analysis Using Machine Learning
4 pages
HCM Extract DBI List REL11 Updated
No ratings yet
HCM Extract DBI List REL11 Updated
5 pages
OnPrem SAP S4HANA Activate End To End Steps 1588902192 PDF
No ratings yet
OnPrem SAP S4HANA Activate End To End Steps 1588902192 PDF
17 pages

Scouting Players With FIFA19: Data Driven Approach To Scouting

Uploaded by

Scouting Players With FIFA19: Data Driven Approach To Scouting

Uploaded by

Scouting Players with FIFA19

Applying Data Mining to Scouting

Data Driven Approach to Scouting

1. Finding undervalued players for a given club to acquire,

Team & Roles

Data Mining Steps:

You might also like