Data Science Challenge - 1
Data Science Challenge - 1
Problem Statement
In this challenge, you are tasked with developing a predictive model for traffic volume
forecasting using a real-world dataset. Your focus will be on data preprocessing, feature
engineering, and model training to achieve the best possible performance on the test
dataset.
Like other standard machine learning competitions, you need to submit predictions on a test
dataset, also your goal is to demonstrate strong data preparation skills and achieve a low
RMSE on the test data while following best practices in model development.
Dataset: Download the dataset using the below link:
Dataset Description
Your Task
3. Feature Engineering
• Generate new features that could improve model performance.
• Your submission will be evaluated using Root Mean Squared Error (RMSE) based on
actual test labels.
7. Documentation
• Clearly explain your approach, preprocessing steps, and feature engineering choices
in a Jupyter Notebook.
Challenge:
Submit your predicted CSV file with a single column named 'Traffic_Vol' to "
https://challenge.astrikos.xyz:3443/ " to see your ranking on the leaderboard. You have a total of 10
submission attempts, so focus on fine-tuning your existing model to improve the RMSE.
Submission Guidelines
Fill the below form for submission: (you can only fill the form once)
Evaluation Criteria
Criteria Weightage