Scouting Players With FIFA19: Data Driven Approach To Scouting
Scouting Players With FIFA19: Data Driven Approach To Scouting
For this project, we are positioning ourselves as a scouting agency that uses analytics to,
among other things, enhance the discovery of talents and help soccer clubs better understand
the dynamics (features) that come into play when determining the value, overall and future
potential of a player. Our agency will be focusing on solving these fundamental scouting
problems:
We will be utilizing the FIFA 19 Player dataset available on Kaggle and apply various Data
Mining techniques to achieve our objectives.
Project Objectives
• Cluster players based various features to identify different player types for our similarity
database.
• Identify under-valued and over-valued players based on ability measures relative to
their value, salary, and/or release clause.
• Building predictive models for future value and potential of players.
Dataset
• Source: Kaggle
• Description: Detailed attributes for every player registered in the latest edition of FIFA
2019 database.
• Size: 9.1MB (18.2k observations x 89 features)
• Features:
1
• ID • Value • Joined
• Name • Wage • Loaned From
• Age • Special • Contract Valid Until
• Photo • Preferred Foot • Height
• Nationality • International Reputation • Weight
• Overall • Weak Foot • Ability by positions (26 features)
• Potential • Skill Moves • Ability by skills (34 features)
• Club • Work Rate • Release Clause
• Position • Jersey Number
2
− Pandas, Numpy, Matplotlib, Seaborn, Scikit-learn, Scipy
3. Tableau