Lab Project
Lab Project
Lab Project
Submitted to:
Mr. Amir Sohel
(Senior Lecturer, Computer Science and Engineering)
Submitted by:
Name: Pritom Das Shanto Name: Sazid Ahmed Tonmoy
ID: 201-15-13748 ID: 201-15-13774
Section: 55-G
Methodology: The graphic prediction methodology employed in this
project involves a step-by-step process to anticipate and visualize
trends, patterns, or outcomes within a given dataset. The methodology
is designed to provide insights into the data through the creation of
predictive graphics. By using this data and the power of computer
modeling, we aim to help the government make smarter decisions
about how to manage traffic. Whether it's improving roads or planning
for the future, our goal is to use data to make our cities work better for
everyone. Throughout this report, we'll take you on a journey—from
understanding the data we have to building computer models that can
predict future traffic.
Model Evaluation:
Predicted vs. Actual Plots:
The journey culminates in visualizing the performance of our predictive
models. Predicted vs. actual plots offer a tangible view of the accuracy
of our predictions, guiding us towards a deeper understanding of model
behavior.
Residual Plots: Residual plots provide a lens into the discrepancies
between predicted and actual values, offering insights into systematic
errors and areas for model refinement.
Dataset Splitting :
Features have been fitted to the train set after the completion of the
splitting. Because my dataset was not fully balanced, I have validated
my model with several cross-validation techniques. Using the feature
space sentiment, the train dataset size is 80% and the test dataset size is
20%
Linear regression is a statistical method used to model the relationship
between a dependent variable (also called the target or response
variable) and one or more independent variables (predictors or
features). The basic idea is to find the best-fitting linear relationship
that explains the variation in the dependent variable based on the
independent variables. Here's a breakdown of the key components of a
linear regression model performance is evaluated using various metrics,
including Mean Squared Error (MSE) = 0.00018, R-Squared = 0.9930.
These metrics help assess how well the model predicts the dependent
variable.
Predictions: - Once trained, the linear regression model can be used to
make predictions on new or unseen data by plugging in values for the
independent variables. Linear regression is a foundational and
interpretable model commonly used for predicting numerical
outcomes. However, it assumes a linear relationship, and its
performance may be affected if this assumption is not met.
Recursive Feature Elimination (RFE) is a feature selection technique
commonly used in machine learning to improve model performance
and reduce overfitting. The main idea behind RFE is to recursively
remove the least important features from the model until the optimal
subset of features is identified.
The criteria for selecting the optimal subset of features can vary. It
could be based on a certain number of features, or it could involve
monitoring the model's performance (e.g., cross-validated accuracy) as
features are removed.
The low MSE indicates that the model's predictions are close to the
actual values, while the high R-squared value suggests that a significant
proportion of the variance in the target variable is captured by the model.
This fine-tuned Decision Tree model showcases the effectiveness of
hyperparameter optimization in enhancing predictive accuracy,
providing a robust foundation for traffic prediction in our project.
Result :
Test Test Train Train
Model Accurac Accurac
Mean Square R Square y y
Name
Error Error Mean Square
Error R Square
Error