Predicting Car MPG Using Decision Tree and Random Forest Algorithm Main
Predicting Car MPG Using Decision Tree and Random Forest Algorithm Main
Professional Training - 1
Predicting Car Mpg using Decision Tree and
Random Forest Algorithm
• In this project, we aim to develop predictive models for estimating the MPG of cars
using machine learning techniques, specifically Decision Trees and Random Forest
algorithms. The goal is to create accurate, interpretable models that can predict a
car's fuel efficiency based on various features such as engine size, weight,
horsepower, and other relevant characteristics.
PROBLEM STATEMENT
• Data Collection and Preprocessing: Gather a comprehensive dataset of cars that includes
features impacting fuel efficiency. Clean and preprocess the data to handle missing values,
categorical variables, and outliers.
• Feature Selection: Identify the most significant features that influence MPG using techniques
such as correlation analysis and feature importance scores.
• Model Development: Implement Decision Tree and Random Forest algorithms to create
predictive models for MPG.
• Model Evaluation: Assess the performance of the models using appropriate metrics such as
Mean Absolute Error (MAE), Mean Squared Error (MSE), and R-squared values. Use cross-
validation to ensure robustness.
• Interpretability: Provide insights into the model's decision-making process, particularly for the
Decision Tree, to understand how different features contribute to MPG predictions.
• Comparison of Models: Compare the performance of the Decision Tree and Random Forest
models to determine which algorithm provides better accuracy and reliability in predicting MPG
LITERATURE SURVEY
1. A. Aliyu and S. Adeshina, "Classifying auto-MPG data set using neural network," 2014 11th
International Conference on Electronics, Computer and Computation (ICECCO), Abuja, Nigeria,
2014, pp. 1-4, doi: 10.1109/ICECCO.2014.6997582.
2. M. J. Cohen and R. P. Wolfson, "The electric passenger car and its competition in the mid-
1980s," 31st IEEE Vehicular Technology Conference, Washington, DC, USA, 1981, pp. 31-37, doi:
10.1109/VTC.1981.1622909.
3. M. S. Rusiman, E. Nasibov and R. Adnan, "The optimal fuzzy c-regression models (OFCRM) in
miles per gallon of cars prediction," 2011 IEEE Student Conference on Research and
Development, Cyberjaya, Malaysia, 2011, pp. 333-338, doi: 10.1109/SCOReD.2011.6148760