0% found this document useful (0 votes)

27 views37 pages

Delay Prediction

The document discusses the pervasive issue of flight delays in the aviation industry, highlighting their impact on travelers, airlines, and airport operations. It emphasizes the need for an intelligent delay prediction system utilizing machine learning and deep learning techniques to enhance operational efficiency and passenger satisfaction. The proposed system integrates various models and a comprehensive dataset to accurately forecast delays, enabling proactive decision-making for stakeholders.

Uploaded by

karthikeyanlavanya13

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

27 views37 pages

Delay Prediction

Uploaded by

karthikeyanlavanya13

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 37

ABSTRACT

Flight delays are a widespread and ongoing issue in the aviation industry, influencing not
only the experience of millions of air travelers but also causing notable operational
inefficiencies and financial burdens for airlines and airport authorities. These delays lead to
significant disruptions in airline schedules, increased fuel consumption, resource
mismanagement, and considerable passenger dissatisfaction. They also complicate the
coordination of airport infrastructure, baggage handling, and air traffic control operations.
Given these challenges, there is a pressing need for a robust and intelligent system that can
accurately forecast potential delays before they occur, enabling stakeholders to take proactive
measures. A reliable delay prediction system can contribute substantially to improving

operational efficiency, minimizing disruptions, lowering operating costs, and enhancing the
overall passenger experience. In machine learning and deep learning techniques offer
valuable tools for analyzing large volumes of historical flight and environmental data to
uncover patterns associated with delays. The predictive system integrates a wide range of
traditional machine learning models such as Logistic Regression, Random Forest, Support
Vector Machine (SVM), and Gradient Boosting, specifically the XGBoost algorithm. These
models are well-established in classification and regression tasks due to their robustness,
interpretability, and relatively fast computation. In addition, modern deep learning methods,
particularly Long Short-Term Memory (LSTM) networks, are employed to capture temporal
dependencies and sequential patterns in time-series data. LSTM models are particularly
suitable for delay prediction due to their ability to retain long-term memory and handle time-
dependent variables more effectively than standard feedforward networks.The dataset used
for training these models is comprehensive, containing numerous attributes relevant to flight
operations and weather conditions. Key features include scheduled and actual departure and
arrival times, total flight duration, airline identifiers, aircraft codes, origin and destination
airports, and route-specific information. Moreover, the dataset integrates meteorological
parameters such as temperature, wind speed, visibility, precipitation, and atmospheric
pressure, all of which have known influences on flight schedules. Historical delay data is also
included to help the models learn from past trends and seasonality, improving their predictive
power.

KEYWORDS:

Flight, Delay, AirTraffic, Weather, Airport, Congestion, FlightData, Classification

i
Regression

TABLE OF CONTENTS

CHAPTER TITLE PAGE

NO. NO.
ABSTRACT
LIST OF ABBREVIATIONS
LIST OF FIGURES
1. INTRODUCTION

1.1 Overview
1.2 Purpose
1.3 Problem with Existing System
1.4 Objective
1.5 Overall Description
2. LITERATURE SURVEY
2.1 Predicting Flight Delays Using NN
2.2 Multi-task Local Global Graph Network for FDP
2.3 Modeling Delay Propagation Effects Using Bayesian
Network
2.4 Flight Delay Classification Prediction Based on Stackin
Algorithm
2.5 Deep Learning Approach for FDP Through Time
Graphs
2.6 A Data Mining Approach to Flight Arrival Delay
Prediction for American Airlines
2.7 Limitations

3. SYSTEM MODELING
3.1 Data Flow Diagram
3.1.1 Data Flow Diagram level 0
3.1.2 Data Flow Diagram level
3.2 System Architecture
3.3 Process Description
3.3.1 Data Preprocessing
ii
3.3.2 Data Splitting
3.3.3 Model Training
3.3.4 Model Evaluation
3.4 System Requirements
3.4.1 Software Requirements
3.4.2 Hardware Requirements
3.5 Contribution of Individual Participants

iii
CHAPTER 1
INTRODUCTION
Flight delay prediction represents a practical and impactful use of machine learning in the
transportation and aviation industries. Its primary goal is to estimate whether a planned flight
will experience a delay and, if so, determine the likely extent of the delay. This task is driven
by the growing need to mitigate the adverse effects that delays have on airlines, airports, and
travelers. Unforeseen disruptions in flight schedules can result in financial losses for carriers,
operational congestion at terminals, and dissatisfaction among passengers. Consequently, the
ability to anticipate delays in advance is of significant importance for ensuring efficient air
travel operations.Machine learning models provide a robust solution to this issue, as they are
capable of analyzing large volumes of historical and real-time data to discover intricate
relationships and patterns that contribute to delays. These models surpass traditional
statistical methods by offering improved prediction accuracy through advanced learning
capabilities. Unlike conventional approaches, which may rely on limited parameters and
assumptions, machine learning algorithms can handle complex, multidimensional data
environments with greater flexibility.The process of developing a predictive model for flight
delays starts with the acquisition and preparation of relevant data. This typically involves
assembling a comprehensive dataset containing a wide array of features. Historical flight data
forms the core of such datasets, encompassing scheduled and actual departure and arrival
times, flight identifiers, airline codes, origin and destination airports, aircraft models, and
additional operational information. These attributes help the model understand past flight
performance and operational behavior under different conditions.In addition to flight-specific
records, incorporating supplementary data such as weather conditions, seasonal trends,
airport traffic congestion, and airspace limitations further enhances the model's ability to
make accurate forecasts. This enriched dataset enables the system to account for various
factors that influence delays and improves the model's overall generalization capability.The
choice of machine learning algorithms depends on the nature of the prediction task. When the
objective is to determine whether a flight will be delayed typically framed as a classification
problem supervised learning models are employed.

1
1.1 OVERVIEW

Flight delay prediction is an application of machine learning aimed at forecasting whether a

scheduled flight will be delayed and by how much, based on historical and real-time data.
Airlines, airports, and passengers all suffer due to unpredictable delays, making this a critical
problem in the aviation industry. Machine learning (ML) models can analyse large volumes
of past flight data and identify complex patterns, enabling them to predict potential delays
more accurately than traditional statistical methods. The process of flight delay prediction
typically begins with the collection of comprehensive datasets. These datasets include
historical flight records, which consist of scheduled and actual departure and arrival times,
airline codes, flight numbers, aircraft types, origin and destination airports, and other
operational details. Different machine learning models are applied depending on the nature of
the problem. If the goal is to classify whether a flight will be delayed or not, classification
algorithms like Logistic Regression, Decision Trees, Random Forests, and Gradient Boosting
methods such as XG Boost are commonly used.

1.2 PURPOSE

The primary purpose of using machine learning in flight delay prediction is to improve the
accuracy and reliability of delay forecasts. This helps airlines and airport authorities make
better operational decisions, reduce passenger inconvenience, and minimize financial losses.
Passengers can also benefit from timely notifications and rescheduling options, leading to an
overall enhanced travel experience.

 Minimizing Passenger Inconvenience

 Enhancing Airline Operational Efficiency
 Reducing Economic Losses
 Improving Air Traffic Management
 Gaining Data-Driven Insights

Minimizing Passenger Inconvenience: Flight delays can lead to missed connections,

rescheduling hassles, and extended waiting times for passengers. Accurate delay predictions
help airlines and passengers plan better by providing real-time updates and alternative travel
arrangements.

Enhancing Airline Operational Efficiency :

2
Airlines can optimize their schedules, crew assignments, and aircraft rotations by anticipating
delays. This leads to better resource management, reducing idle times and operational costs.

Air Traffic Management:

Predicting delays helps air traffic controllers manage congestion at airports and in airspace
more efficiently. This ensures smoother operations, reduces bottlenecks, and enhances overall
flight safety.

Reducing Economic Losses:

Airlines, airports, and passengers collectively suffer significant financial losses due to flight
delays. Predicting delays allows airlines to take preventive measures, reducing compensation
claims, refund costs, and additional expenses.

1.3 Problem with Existing System

Existing systems for predicting flight delays using machine learning face several significant
challenges that limit their effectiveness. One major issue is the quality and completeness of
available data. Flight delays are influenced by a wide range of factors including weather, air
traffic, maintenance issues, and airport congestion, but not all of this data is consistently
available or accurately recorded. Many models also struggle with the dynamic and complex
nature of the aviation environment, where delays can result from a chain reaction of events.
Furthermore, some systems rely on simplistic models that fail to capture non-linear
relationships between features, leading to low prediction accuracy. Lack of standardization
in data sources, data quality issues, and insufficient integration with airline operations also
contribute to the inefficiency of current delay prediction models. As a result, there is a
pressing need for more robust and intelligent systems that can provide accurate and
actionable delay predictions.

1.4 Objectives

 Predict flight delays using machine learning project is to develop an accurate model
that can predict potential delays before flight To departure.
 By analysing historical flight data, weather conditions, airline information, and other
relevant factors, the project aims to enhance operational efficiency, improve
passenger satisfaction, and support proactive decision-making for airlines and airport
authorities.

3
 Flight delay prediction models aim to enhance safety, efficiency, and customer
satisfaction across the aviation ecosystem while supporting smarter, data-informed
decision making.

1.5 Overall Description

Flight delay prediction using machine learning is an advanced approach to forecasting flight
delays by analysing historical and real-time data. Flight delays are a major challenge in the
aviation industry, causing inconvenience to passengers, financial losses for airlines, and
disruptions in airport operations. This project utilizes machine learning techniques to identify
patterns and relationships between various factors influencing delays, such as weather
conditions, air traffic congestion, airline schedules, departure and arrival times, and
operational inefficiencies. By implementing models like Random Forest, Logistic Regression,
and deep learning techniques such as Long Short-Term Memory (LSTM), the project aims to
determine the most effective predictive approach. The process involves data collection,
preprocessing, feature selection, model training, evaluation, and visualization to gain
meaningful insights. By providing accurate forecasts, this system helps airlines optimize
scheduling, minimize disruptions, and enhance passenger experience, ultimately improving
efficiency in the aviation industry. The primary goal is to provide timely and reliable
predictions that support better decision-making for airlines, airport authorities, and
passengers. By doing so, these models help improve operational efficiency, reduce costs,
enhance the passenger experience, and contribute to more sustainable and intelligent air
travel systems. As the aviation industry becomes more data driven, machine learning-based
delay prediction stands out as a crucial tool for addressing one of the most persistent
challenges in air transportation. Flight delay prediction focuses on estimating whether a flight
will be delayed and by how much, based on various influencing factors. These factors may
include weather conditions, airport traffic, airline operations, and scheduled flight times.
Accurately predicting delays helps improve scheduling, reduce passenger inconvenience, and
optimize airline and airport operations.To achieve this, historical flight data and external
variables (like weather and airport conditions) are collected and processed. Machine learning
models such as decision trees, support vector machines, or deep learning networks are then
trained to recognize patterns and make delay predictions for future flights.This approach
supports proactive decision-making, allowing airlines and airports to respond to potential
issues before they escalate, ultimately enhancing operational efficiency and customer
satisfaction.

4
CHAPTER 2

LITERATURE SURVEY

2.1 Etani N. Development of a predictive model for on-time arrival fight of

airliner by discovering correlation between fight and weather data. J Big
Data.

Recent research has shown that integrating machine learning with aviation data significantly
enhances flight delay prediction accuracy. Etani (2019) developed a predictive model that
leverages both flight operation data and meteorological information to estimate on-time
arrivals. The study revealed strong correlations between weather conditions such as wind,
visibility, and precipitation and delay occurrences. By incorporating these features, the model
achieved better performance compared to traditional methods. This work highlights the
importance of combining multiple data sources to improve predictive reliability.Machine
learning models, including Random Forests, Support Vector Machines (SVM), and XGBoost,
have become widely adopted due to their ability to model non-linear relationships in large
datasets. Etani’s findings influenced many subsequent studies to adopt hybrid data
approaches, integrating weather and operational data. Preprocessing techniques like data
cleaning, encoding, and normalization have been recognized as essential for improving model
performance.

2.2 R R. Khan, S. Akbar and T. A. Zahed, "Flight delay prediction based

on gradient boosting ensemble techniques," in 16th Int. Conf. on Open
Source Systems and Technologies (ICOSST), December 14 - 15,
2022,Lahore, Pakistan, 2022

Khan et al. proposed a machine learning-based approach using Gradient Boosting methods to
predict flight delays. The study was presented at the 16th International Conference on Open
Source Systems and Technologies (ICOSST) held in Lahore, Pakistan, in December 2022.
They utilized real-world flight datasets and emphasized the importance of preprocessing steps
such as feature selection and data balancing. The authors compared different ensemble
techniques, including AdaBoost, Random Forest, and Gradient Boosting Machines (GBM).
Among them, GBM yielded the best predictive performance in terms of accuracy and
robustness. The models were evaluated using performance metrics like accuracy, precision,

5
recall, and F1-score. The study demonstrated that ensemble methods significantly enhance
prediction capabilities over traditional approaches. The research also highlighted the
importance of integrating external factors such as weather and traffic data. This paper mainly
focuses on data-driven machine learning techniques without relying on simulation.

2.3 " Li Q, Jing. Generation and prediction of fight delays in air transport.
IET Intell Transp Syst. 2021;"

The paper primarily analyzed how flight delays are generated and their temporal
characteristics. The author used historical air transport data to identify delay patterns and
predict future delays. Unlike purely ML-based studies, this research incorporated simulation
techniques to understand the system-level behavior of delay propagation. Seasonal variations,
time-of-day patterns, and weather conditions were key variables considered in the model. The
study also compared machine learning models with traditional statistical baselines, showing
that ML models offered better predictive performance. Li emphasized the importance of
integrating both macro-level (system-wide) and micro-level (individual flight) data for
accurate predictions. The research offers a more holistic approach by combining data science
and domain-specific knowledge. It serves as a valuable contribution to transport system
planning and operational management.

2.4 "Bisandu DB, Moulitsas I, Filippone S. Social ski driver conditional

autoregressive-based deep learning classifer for fight delay prediction.
Neural Comput Appl. 2022; "

In their 2022 study, Bisandu et al. proposed a deep learning model called the Social Ski
Driver Conditional Autoregressive (CAR) classifier for flight delay prediction. The model
incorporates spatiotemporal correlations among flights to enhance predictive accuracy. It
outperformed conventional machine learning models like random forests and SVMs on
benchmark datasets. Their approach captures the influence of surrounding flight behaviors,
offering a more dynamic prediction system. The study demonstrated strong generalization
in complex real-world scenarios. Evaluation metrics showed significant improvements in
accuracy and reliability. This work advances deep learning applications in transportation
analytics.

6
2.5 " W. Shao, A. Prabowo, S. Zhao, S. Tan, P. Koniusz, J. Chan, X. Hei,
B. Feest and F. D. Salim, "Flight delay prediction using airport
situational awareness map," in Proc. of 27th ACM SIGSPATIAL Int.
Conf. on Advances in Geographic Information Systems, November 5-8,
2019 "

W. Shao et al. (2019) proposed a novel approach for flight delay prediction using an
Airport Situational Awareness Map. This method integrates spatiotemporal data, such as
runway occupancy, taxiway congestion, and gate availability, to enhance prediction
accuracy. The study was presented at the 27th ACM SIGSPATIAL conference,
highlighting the use of geographic information systems (GIS) in aviation analytics. Their
model outperformed traditional delay prediction methods by incorporating real-time airport
conditions. It emphasized the importance of spatial context in operational forecasting. The
approach also supports proactive decision-making for air traffic controllers. This work
bridges GIS and AI for smarter air transport systems.

2.6"Wu,Y.,Yang,H.,Lin,Y., & Liu, H. (2022). Spatiotemporal Propagation

Learning for Network-Wide Flight Delay Prediction"
Wu et al. (2022) proposed the SpatioTemporal Propagation Network (STPN) to model
delay propagation across airport networks. By employing a space-time separable graph
convolutional network, they captured complex spatiotemporal dependencies. The model
considered both geographic proximity and airline schedules in its spatial analysis.
Temporally, a multi-head self-attentional mechanism was utilized to reason various delay
time series dependencies. The integration of a squeeze and excitation module further
enhanced feature learning. Evaluated on U.S. and China flight delay datasets, STPN
outperformed existing methods. The study provided interpretable insights into delay
propagation patterns.(arXiv)

2.7 "Wang, L., Tien, A., & Chou, J. (2021).Multi-Airport Delay Prediction
with Transformers"
Wang et al. (2021) introduced a Temporal Fusion Transformer (TFT) model for predicting
delays across multiple airports. The approach captured complex temporal dynamics of
inputs like traffic, demand, and weather. A self-supervised learning model encoded high-

7
dimensional weather data into lower-dimensional representations. This facilitated efficient
training of the TFT model. The model achieved satisfactory performance with smaller
prediction errors. Interpretability analysis identified key input factors influencing delays.
The study aimed to assist air traffic managers in proactive decision-making.

2.8 "Sahfienya, H., & Regan, A. C. (2021).4D flight trajectory prediction

using a hybrid Deep Learning prediction method based on ADS-B
technology: "
Sahfienya and Regan (2021) developed a hybrid deep learning model for 4D flight
trajectory prediction at ATL airport. Utilizing ADS-B data, the model combined CNN and
GRU architectures to extract spatial and temporal features. Monte Carlo dropout was
incorporated to address prediction uncertainties. The model demonstrated superior
performance over traditional methods, reducing prediction errors by an average of 21%.
The study emphasized the importance of considering uncertainty in trajectory predictions. It
provided a robust approach for optimizing airport infrastructure usage. The methodology
can be extended to other high-traffic airports for improved scheduling.

2.9"Chakrabarty, A., & Balaji, S. (2021). Flight Delay Prediction Using

Machine Learning Techniques"
The authors investigated flight delay prediction using Random Forest, Decision Tree, and
Gradient Boosting models. They used historical U.S. domestic flight data enriched with
weather and airport parameters. The study emphasizes data preprocessing techniques,
including label encoding and feature selection. Gradient Boosting delivered the highest
performance. Their approach supports real-time operational improvements. They also
highlighted the potential of ML in air traffic decision-making. The study forms a baseline
for comparative modeling.

2.10"Patgiri, D., et al. (2020). Airline Delay Prediction: A Machine

Learning Approach"
This paper explores several classification algorithms like Logistic Regression, Naive Bayes,
and Random Forest to predict airline delays. It uses a large dataset covering over 20 years
of flight history. Random Forest achieved superior accuracy due to its robustness in

8
handling nonlinear features. Feature importance indicated weather and airline schedules as
dominant factors. The study included balancing techniques to address class imbalance. It
emphasized timely predictions to reduce cascading delays. Their results support airport-
level strategic planning.

2.11"Sternberg, H., et al. (2017). A Review of Flight Delay Prediction"

This survey categorizes existing delay prediction approaches into simulation, statistical, and
machine learning techniques. The authors identify data availability and real-time accuracy
as critical challenges. They also address the complexity of feature integration from multi-
source datasets. The review highlights the transition from regression models to ML and
deep learning. Key gaps include integration of dynamic weather and airport traffic. The
study provides a comprehensive research framework. It sets the stage for hybrid AI
modeling in aviation.

2.12"Yu, X., et al. (2024). Flight Departure Delay Prediction Based on

Spatio-Temporal Graph Neural Networks"
Yu et al. developed a deep learning model combining GCN, 3D-CNN, and LSTM to
capture temporal and spatial dynamics of flight data. The model integrated real-time
weather, airport layout, and air traffic. Results showed significant improvement over
traditional ML models. Their approach enhanced delay prediction accuracy in dynamic
scenarios. Attention mechanisms and graph structures offered deep contextual learning. The
study confirms deep learning’s value in aviation analytics. It also opens avenues for real-
time deployment.

2.13"Sharma, R., et al. (2023). Flight Delay Prediction Using Machine

Learning. In Advances in Data and Information Sciences"
This book chapter outlines the use of Decision Trees, Random Forest, and SVM for delay
prediction. The authors discuss common challenges such as class imbalance and missing
data. Key features analyzed include airport traffic, aircraft type, and weather factors. Data
normalization and encoding improved model performance. The study emphasizes the
importance of AI in airport logistics. Comparative analysis shows Random Forest as most
reliable. The chapter concludes with future AI integration recommendations.

2.14"Shao, Y., et al. (2019). Departure Delay Prediction for Flights Using
Airport Situational Awareness Maps and Machine Learning"

9
The study used Airport Situational Awareness Maps (ASAMs) to improve departure delay
predictions. Gradient Boosting was employed with engineered features from weather,
ground operations, and ASAMs. The system significantly improved delay detection
accuracy. It supports proactive decision-making by airlines. Real-time ASAM data
provided high-resolution contextual inputs. The work demonstrates the value of localized
airport data in ML pipelines. Their framework is scalable across airports of varying sizes.

2.15"Jha, R., et al. (2024). Flight Delay Prediction Using Deep Learning:
A Hybrid Approach"
Jha and colleagues developed a hybrid architecture combining XGBoost and LSTM for
flight delay classification. Tabular data were processed through XGBoost while sequential
patterns were learned via LSTM. The hybrid model improved precision, especially for long-
haul delays. Feature engineering included time, weather, and operational data. The authors
tackled noise and imbalance using SMOTE. Comparative metrics proved the hybrid model's
superiority. Their design promotes both performance and interpretability.

2.16"Wang, M., et al. (2023). Att-Conv-LSTM-Based Flight Delay

Prediction Model"
Wang et al. proposed an attention-based convolutional LSTM model to capture spatio-
temporal dependencies in flight data. The model utilizes convolution for spatial features
and LSTM for temporal dynamics. Attention layers enhanced learning of critical delay
causes. Key inputs included weather patterns and airport load. Their method outperformed
traditional LSTM and GRU models. Results confirmed the model's accuracy and
robustness. It is suitable for real-time delay forecasting systems.

2.17"Li, Y., et al. (2021). A Two-Stage Approach to Predict Flight Delays

Using Deep Learning. IET Intelligent Transport Systems"
Li et al. created a two-stage prediction framework where RNNs model high-level delay
trends, and deep neural networks refine final predictions. This hierarchical structure
improves scalability. The dataset includes diverse airports and weather conditions. Stage
one filters out irrelevant inputs, reducing computational load. Stage two enhances accuracy
with granular inputs. Their method achieved over 90% accuracy for selected routes. The
architecture adapts well to changing aviation environments.

10
2.18"Pophale, A., et al. (2022). Airline Delay Analysis Using Machine
Learning Algorithms. Mathematical Statistician and Engineering
Applications"
This paper applies Linear and Polynomial Regression models to forecast departure delays.
The dataset, sourced from Kaggle, includes flight origin, airline, and weather data.
Polynomial Regression outperformed Linear Regression in capturing non-linear delay
trends. Data visualization was used to understand delay patterns. The study achieved
moderate accuracy (\~72%) but provided insight into model tuning. It's especially valuable
for educational and baseline modeling. It lays the groundwork for integrating more complex
methods.

LIMITATIONS

1.Data Quality and Availability

Machine learning models require high-quality, complete datasets. In flight delay

prediction, missing values, outdated records, or limited access to real-time information (like
weather or air traffic updates) can significantly reduce model accuracy and reliability.

2. Unpredictable External Factors

Delays are often caused by sudden and unpredictable events such as adverse weather,
technical malfunctions, or emergency landings. These factors are difficult to capture or
predict accurately through historical data alone, limiting the model’s effectiveness.

3. Feature Engineering Complexity

Flight delay prediction involves complex features like time-based patterns, airport
traffic, and flight routes. Constructing meaningful input features from raw data requires
domain expertise and careful preprocessing to avoid overfitting or underperformance.

4. Poor Model Generalization

11
A model trained on data from specific regions, airlines, or time periods may not
generalize well to others. Variations in geography, infrastructure, and airline operations
demand frequent model retraining and validation across different datasets.

5.Real-Time Data Integration Issues

Real-time prediction systems must process live feeds (e.g., weather, air traffic). This
requires robust data pipelines and low latency processing, which can be technically
challenging and prone to failure, affecting timely and accurate predictions.

6.Limited Interpretability of Models

Advanced models like LSTM and Transformers often lack transparency. Their complex
internal workings make it hard to explain predictions to stakeholders, reducing trust and
making it difficult to use them in operational decision-making contexts.

7. Ethical and Legal Challenges

Using flight and passenger data must comply with data protection laws like GDPR.
Additionally, models can unintentionally learn biases from historical data, leading to unfair or
skewed predictions if not properly monitored and corrected.

CHAPTER 3
SYSTEM MODELING
3.1 Data Flow Diagram

The Data Flow Diagram (DFD) for a flight delay prediction system outlines how data
is collected, processed, and used to generate predictions. The system receives inputs from
multiple external sources, such as airline databases and weather services. These inputs
include historical flight data, airport information, aircraft details, and weather conditions.

3.1.1 Data Flow Diagram Level 0

12
The Level 0 Data Flow Diagram provides a high-level view of a machine learning-based
flight delay prediction system. The system receives flight data including date, time, airport,
and weather—as input. This data is processed by the ML prediction system which analyzes
the input using a trained machine learning model. The system then outputs the, delay status
indicating whether a flight is likely to be on time or delay. This diagram captures the core
function of the system without detailing the internal processes.

Figure 3.1 Data Flow Diagram Level 0

3.1.2 Data Flow Diagram Level 1

The Level 1 Data Flow Diagram (DFD) for the flight delay prediction system provides a
detailed view of how the system operates and interacts with external data sources and users.
The system receives input from two main external entities, Airline and Weather Services.
Airline data includes flight schedules, historical delays, aircraft details, and operational
metrics, while weather services supply real-time and forecasted weather conditions that can
affect flight performance. These data sets are processed by the Flight Delay Prediction
System, which utilizes machine learning algorithms to analyze patterns and generate
predictive insights. In return, they receive an output in the form of delay predictions which
help them make informed decisions.

13
Figure 3.2 Data Flow Diagram

3.1.3 SYSTEM ARCHITECTURE

The image below illustrates a complete machine learning workflow designed to predict flight
delays. The process is divided into three major stage data pipeline ,Model
training ,Deployment & Prediction.

Process Description:
The optimized model is then evaluated for accuracy and registered in a model registry
for deployment. The image represents a comprehensive end-to-end machine learning pipeline
designed for predicting flight delays. It is structured into three main stages: Data Pipeline,
Model Training, and Deployment & Prediction. Each stage contains multiple steps that
transform raw data into a final prediction output.

1. Data Pipeline

The process begins with the data pipeline, where the system gathers and processes raw flight
data.

14
Flight Data Sources: The system collects flight-related data from various sources such as
airline databases, airport systems, weather reports, and air traffic logs. This data is essential
for understanding the different factors that can affect flight delays.

Ingestion: Once the data is collected, it is ingested into the system. This step involves
loading data into a centralized location where it can be processed and analyzed.

Data Lake / Database: The ingested data is stored either in a data lake (a repository that can
store structured, semi-structured, or unstructured data) or in a traditional database. This
storage layer serves as the foundation for subsequent data processing.

ETL & Data Cleaning: In this critical step, ETL (Extract, Transform, Load) processes are
applied to clean and transform the data. This includes removing duplicates, handling missing
values, standardizing formats, and ensuring data consistency.

Feature Engineering: After cleaning the data, the system creates new features that can
improve model performance. For example, it might generate features like "time of day," "day
of the week," "weather conditions," or "historical delay patterns" that are useful for predicting
future delays. Once the data is fully prepared, it is split into training and testing datasets for
model development.

2. Model Training

This stage involves building, tuning, and validating a machine learning model.

Train/Test Split: The cleaned and engineered dataset is split into training and testing sets.
The training data is used to build the model, while the testing data is used to evaluate its
performance.

Machine Learning Model: A machine learning algorithm is selected (such as Decision

Trees, Random Forest, Gradient Boosting, or Neural Networks) and trained on the data.

Hyperparameter Tuning: Once the initial model is trained, its hyperparameters are tuned
using techniques such as Grid Search or Random Search. Tuning helps optimize the model’s
performance.

Optimized Model: The best-performing version of the model, after tuning, is selected as the
optimized model.

15
Model Evaluation: The optimized model is then evaluated on the test dataset using various
performance metrics like accuracy, precision, recall, F1-score, or RMSE (Root Mean Squared
Error), depending on the nature of the problem.

Model Registry: Once the model passes evaluation standards, it is stored in a model registry.
This registry acts as a version controlled system where models can be tracked, compared, and
retrieved for deployment.

3. Deployment & Prediction

In the final stage, the trained model is deployed and used to make predictions on new data.

New Flight Data: Fresh flight data is collected in real-time or batch mode for which
predictions are needed.

Preprocess: This new data undergoes the same preprocessing and cleaning steps used during
training to ensure consistency.

Feature Extraction: Features are extracted from the new data using the same methods
applied earlier, ensuring the input format matches what the model expects.

Model API / Web Service: The trained and registered model is deployed as an API
(Application Programming Interface) or a web service. This allows external systems to send
flight data to the model and receive delay predictions in return.

Predict Delay: The deployed model processes the new flight data and returns predictions—
typically indicating whether a flight will be delayed and possibly by how much time.

Predicted Flight Delay: The final prediction is output, which can be shown to users,
integrated into decision-making systems, or used to notify stakeholders.

This pipeline illustrates a complete machine learning workflow for a real-world problem:
predicting flight delays. It starts from raw data ingestion and ends with a live prediction
system, demonstrating how machine learning models are developed, evaluated, and used in
production environments. Each stage is crucial and interconnected, ensuring the final model
is both accurate and reliable.

16
Fig 3.3 System Architecture

3.4 SYSTEM REQUIREMENTS

17
Hardware Requirements

Processor: Intel i5, AMD equivalent, Intel i7/i9 or AMD Ryzen

RAM: 8 GB, 16–32 GB

Storage:100 GB free disk space, SSD with 250+ GB free.

GPU: Not necessary for traditional ML, but useful for deep learning like LSTM

Software Requirements

Operating System: Windows 10/11.

Programming Language: Python

Python Libraries

Data Handling: pandas, numpy, pipeline.

Visualization: matplotlib, seaborn.

Machine Learning: svm, random forest, LR

Deep Learning: LSTM

Model Evaluation: sklearn.metrics

Tools & Platforms

IDE: VS Code, google collaboration

CHAPTER 4
18
METHODOLOGY
4.1 Machine Learning

Machine Learning (ML) is a subfield of Artificial Intelligence (AI) that enables systems
to learn patterns from data and make decisions or predictions without being explicitly
programmed. Instead of writing rules manually, ML algorithms learn from examples
provided through data. The key idea is to develop algorithms that can generalize from
historical data to make predictions on new, unseen data.

Types of Machine Learning

Machine Learning is broadly classified into three main categories based on the kind of
learning signal or feedback available to the system:

4.1.1 Supervised Learning

In Supervised Learning, the model is trained using a labeled dataset, meaning that each
training example is paired with the correct output (label). The goal is to learn a mapping from
inputs (features) to outputs (target labels).

Examples:

 Predicting if a flight will be delayed or not (binary classification)

 Estimating the delay time in minutes (regression)

Subtypes:

 Classification: When the output is a category (e.g., Delay or No Delay)

 Regression: When the output is a continuous value (e.g., number of minutes of delay)

4.1.2 Unsupervised Learning

In Unsupervised Learning, the model is given unlabeled data and must discover patterns,
groupings, or structures on its own. There are no output labels provided.

Examples:

 Grouping passengers based on travel habits

 Anomaly detection in flight behavior

4.1.3 Semi-Supervised Learning

19
Semi-Supervised Learning is a mix of supervised and unsupervised learning. It uses a small
amount of labeled data and a large amount of unlabeled data. This is useful when labeling
data is expensive or time-consuming.

Example:

Using a small set of labeled flight delay records along with a large set of unlabeled flight data
to improvement.

4.2 Random Forest Classifier

Random Forest is an ensemble learning method that operates by constructing multiple

decision trees during training and outputting the class that is the mode of the classes
(classification) or mean prediction (regression) of the individual trees. It handles overfitting
better than a single decision tree and works well with tabular data. In our implementation, the
Random Forest model was trained with 100 estimators and default parameters. It achieved a
good balance between bias and variance, providing reliable results on unseen test data.

4.2.1 Working with Random Forest Classifier

In flight delay prediction, the algorithm starts by taking the historical flight data including
features like departure time, wind speed, weather conditions, and airline information and
creating multiple random subsets of this data. For each subset, a separate decision tree is
trained to learn patterns that could indicate whether a flight is likely to be delayed.

Unlike a single decision tree that might overfit to the data, Random Forest introduces
randomness both in the data it selects and the features it uses, making the overall model more
generalizable. Once all the trees have made their predictions, the algorithm takes a majority
vote: if most trees predict a delay, the final output is "Delayed"; otherwise, it is "Not
Delayed".

This ensemble approach reduces errors, handles missing data well, and provides a strong
performance even with complex or noisy flight datasets. It is especially effective when the
goal is to make accurate classifications using structured, tabular information.

20
4.3 Logistic Regression

Logistic Regression is a linear model for binary classification. It estimates the probability that
a data instance belongs to a particular class using the logistic function. This model was
applied to the dataset after feature scaling. Logistic Regression provides a strong baseline for
classification tasks and showed moderately good performance on the flight delay dataset. It is
computationally efficient and interpretable, making it a reliable choice for initial
benchmarking.

4.3.1 Working with Logistic Regression

Logistic Regression is a straightforward and widely used algorithm for binary classification
tasks like predicting whether a flight will be delayed or not. The process begins by collecting
structured data such as departure time, weather, distance, airline, and delay history. The
features are often scaled to improve learning efficiency. Logistic Regression models the
relationship between the input features and the delay status using a logistic (sigmoid)
function, which transforms the result into a probability value between 0 and 1.

If the predicted probability is greater than 0.5, the model classifies the flight as "Delayed";
otherwise, it predicts "Not Delayed". During training, the algorithm adjusts its internal
coefficients to minimize the difference between predicted and actual values using an
optimization method like gradient descent

4.4 XG BOOST(Extreme Gradient Boosting)

XGBoost is an advanced implementation of gradient boosting that is optimized for speed and
performance. It uses a more regularized model formalization to control overfitting, making it
suitable for structured/tabular data. The model was trained using default hyperparameters and
demonstrated superior predictive performance on the test set compared to Random Forest and
Logistic Regression. XGBoost's boosting mechanism allows it to correct the errors of
previous models iteratively, leading to improved overall accuracy.

21
4.4.1 Working with XG BOOST(Extreme Gradient Boosting)

XGBoost, short for Extreme Gradient Boosting, is a powerful machine learning algorithm
known for its high speed and accuracy in structured data problems like flight delay
prediction. It operates by building decision trees sequentially, where each new tree focuses on
correcting the prediction errors of the previous trees.

The workflow begins by feeding historical flight data into an initial simple model. The
algorithm calculates the errors (residuals) and trains the next tree to predict these errors.

This process repeats over several boosting rounds, with each tree making the model more
accurate. A weighted sum of all the trees’ outputs is used to make the final prediction.
XGBoost also includes regularization, which helps prevent overfitting, and supports missing
value handling natively.

4.5 LSTM(Long Short Term Memory)

LSTM is a type of Recurrent Neural Network (RNN) capable of learning long-term

dependencies. It is well-suited for time-series and sequence prediction problems. For this
project, LSTM was implemented using sequences of wind speed and delay information to
capture temporal dependencies. The data was reshaped into 3D format to fit the LSTM input
structure. It was trained for several epochs with a batch size of 64.Despite being
computationally intensive, LSTM provided valuable insight into sequential patterns
contributing to flight delays. It performed well, particularly on time-sensitive variables.

4.5.1 Working with LSTM(Long Short Term Memory)

LSTM is a type of Recurrent Neural Network (RNN) that is especially designed to handle
sequence and time-series data, making it highly suitable for flight delay prediction based on
temporal patterns. The LSTM algorithm works by analyzing sequences of data points for
example, a time series of wind speeds, airport congestion, or previous flight delays.

The input data is reshaped into a 3-dimensional structure representing samples, time steps,
and features. LSTM networks contain special units called memory cells that can retain or
forget information over long periods using internal gates (input, forget, and output gates).

22
This memory allows the model to learn how past patterns (e.g., consecutive delays or
worsening weather) influence future delays.

4.6 Evaluation metrices

4.6.1 Accuracy

Accuracy is the simplest and most intuitive metric. It measures the proportion of total correct
predictions made by the model out of all predictions.

•TP (True Positives): Correctly predicted "Delayed" flights.

•TN (True Negatives): Correctly predicted "Not Delayed" flights.

•FP (False Positives): Flights incorrectly predicted as "Delayed".

•FN (False Negatives): Flights incorrectly predicted as "Not Delayed".

Example: If the model makes 90 correct predictions out of 100 total cases, the accuracy is
90%.

Good for: Balanced datasets

Not enough when classes are imbalanced (e.g., 90% on-time, 10% delayed)

4.6.2 Precision

Precision tells us how many of the flights predicted as “Delayed” were actually delayed. It
answers the question:” when the model says a flight is delayed, how often is it correct”

23
Example: If the model predicted 50 flights as delayed and 40 were truly delayed, precision =
40/50 = 80%.

Good for: When the cost of false alarms is high (e.g., unnecessary rescheduling)

4.6.3 Recall

Recall measures how many of the actual delayed flights were correctly predicted by the
model. It answers the question: “Out of all actual delays, how many did the model detect?”

Example: If there are 100 delayed flights and the model correctly identifies 80, recall =
80/100 = 80%.

Good for: When missing a delay is critical (e.g., airport planning or safety)

4.6.4 F1 Score

The F1 Score is the harmonic mean of Precision and Recall. It gives a single score that
balances both concerns. It is especially useful when the class distribution is uneven (e.g., far
more “Not Delayed” than “Delayed” flights). Good for overall balance bet ween Precision
and Recall

Example: If a model has a precision of 75% and a recall of 60%, the F1 Score = 2 × (0.75 ×
0.60) / (0.75+0.60) = 0.666 or 66.6%.

Good for: When both detecting delays and minimizing false alerts are important—such as in
real-time flight monitoring systems or automated scheduling adjustments.

24
4.7 Dataset creation methodology

This project aggregates flight data from official sources such as the Kaggle and airport
databases. By using available APIs and public datasets, historical flight records were
collected over a span of four years. To build the dataset, flight identifiers, schedules, and
actual performance metrics were extracted, resulting in a consolidated dataset comprising
nearly 3000 records. These entries capture detailed information about flight timings, delays,
and operational attributes during the specified period. The Origin Airport and Destination
Airport codes:

Indicate where the flight starts and ends, as different airports may have varying congestion
levels or operational efficiency.

Scheduled Departure Time

It is crucial, as delays often vary depending on the time of day, and it can also be used to
derive additional features like "Hour of Day".

The Actual Departure Time

is recorded to calculate the Departure Delay, while Arrival Delay shows how late the flight
landed compared to the schedule.

Scheduled Arrival Time

It refers to the time at which the flight is officially planned to land at its destination airport.

Actual Arrival Time

It is the real timestamp when the aircraft touches down at the destination. The difference
between Scheduled and Actual Arrival Time determines the arrival delay, which is a key
factor in flight delay prediction models.

Carrier

25
It refers to the airline company that operates a given flight, such as Indigo, Air India, or
SpiceJet.

CHAPTER 5

SYSTEM IMPLEMENTATION

5.1 MODULE DESCRIPTION:

MODULES:

This project is divided into two modules,

 Data Collection and Preprocessing

 Feature Extraction
 Model selection and Training
 Model Evaluation
 Model Deployment

5.1.1 Implementation

import pandas as pd
import numpy as np
from sklearn.model_selection import train_test_split
from sklearn.preprocessing import LabelEncoder, StandardScaler
from sklearn.ensemble import RandomForestClassifier
from sklearn.linear_model import LogisticRegression
from xgboost import XGBClassifier
from sklearn.metrics import accuracy_score, precision_score,
recall_score, f1_score
from tensorflow.keras.models import Sequential
from tensorflow.keras.layers import LSTM, Dense, Dropout
from tensorflow.keras.optimizers import Adam

Pandas and numpy:

26
It starts by importing pandas and numpy, which are foundational libraries for handling data
structures and numerical computations.

Scikit-learn – Model Selection and Preprocessing:

Train_test_split from Scikit-learn is used to divide the dataset into training and testing
subsets, which is essential for evaluating a model’s performance on unseen data.

LabelEncoder is used to convert categorical text labels (like airport codes or status labels)
into numerical form, making them suitable for machine learning algorithms.

StandardScaler standardizes numerical features by removing the mean and scaling to unit
variance, which helps improve model training and convergence for algorithms sensitive to
feature scaling.

Scikit-learn

RandomForestClassifier is an ensemble machine learning algorithm that builds multiple

decision trees and combines their outputs for more accurate and stable predictions. It’s
particularly useful for handling non-linear data and capturing feature importance.

LogisticRegression is a linear classification algorithm used for binary classification tasks,

such as predicting whether a flight will be delayed or on time.

XGBoost(XGBClassifier):

XGBoost is an optimized gradient boosting framework that builds powerful predictive

models by combining multiple weak learners (typically decision trees).

It is well-known for its performance and efficiency in structured data problems and often
outperforms traditional machine learning models in classification tasks like flight delay
prediction.

Scikit-learn Evalution Metrics

It shows accuracy_score, precision_score, recall_score, and f1_score are used to evaluate the
performance of classification models. These metrics provide insights into how well the model
is making predictions — for instance, precision and recall help understand the trade-off
between false positives and false negatives, which is important in real-world delay prediction
systems.

TensorFlow keras
27
Sequential is a Keras model type that allows stacking layers in a linear manner. The LSTM
(Long Short-Term Memory) layer is a type of recurrent neural network (RNN) ideal for time-
series or sequential data, such as predicting delays based on a sequence of historical records.
Dense layers are fully connected neural network layers that process information between
neurons. Dropout is a regularization technique that randomly disables neurons during training
to reduce overfitting.

5.1.2 Dataset uploading

import pandas as pd
file_path=("/content/drive/MyDrive/flight delay
prediction/Airline Delay (1).csv")
df=pd.read_csv(file_path)
df.head()

Imports pandas.Loads a CSV file containing airline delay data from Google Drive. Stores the
data in a DataFrame called df. Displays the first five rows to give you a quick look at the
dataset's structure and contents.

5.1.3 Data preprocessing

28
Figure No. 5.1 Data Preprocessing

Data preprocessing is a vital step in machine learning that involves preparing and
transforming raw data into a clean and organized format suitable for modeling. Since real-
world data often contains missing values, inconsistencies, and noise, preprocessing helps
improve the quality and reliability of the data. This process typically includes cleaning the
data by handling missing or incorrect entries, normalizing or scaling numerical features to
ensure uniformity, and converting categorical variables into a numerical format through
encoding techniques. Additionally, feature selection or extraction may be applied to identify
the most relevant information for the model, enhancing both accuracy and efficiency.

file_path = '/content/drive/MyDrive/flight delay prediction/Airline

Delay (1).csv'
df = pd.read_csv(file_path)
# Function to preprocess numeric columns
def preprocess_column(col):
if pd.api.types.is_numeric_dtype(col):
# Replace values that are not non-negative integers with NaN
col = col.apply(lambda x: x if pd.notnull(x) and x >= 0 and
float(x).is_integer() else np.nan)

For numeric columns: Checks each value:

 Keeps the value if it’s not null, non-negative, and a whole number.
 Replaces anything else with NaN.
 Drops any rows with NaN and converts the remaining values to integers.

For non-numeric columns: it returns them unchanged.

29
5.1.4 Feature Extraction

Figure No. 5.2 Feature Engineering

# Add features to DataFrame

df['origin'] = origins
df['destination'] = destinations
df['scheduled_departure_time'] = scheduled_departure
df['actual_departure_time'] = actual_departure
df['scheduled_arrival_time'] = scheduled_arrival
df['actual_arrival_time'] = actual_arrival
df['flight_status'] = flight_statuses

The goal is to prepare new flight-related features such as departure and arrival times, origin
and destination airports, and flight status and add them as new columns to an existing
DataFrame (df).To do this, the code first calculates the number of rows in the DataFrame and
initializes several empty lists to store the new values for each row.

These lists include scheduled_departure, actual_departure, scheduled_arrival, actual_arrival,

origins, destinations, and flight_statuses. However, there is a critical error in the code: the lis
flight_statuses is mentioned but not initialized.

5.1.5 Model selection and Training

30
Figure No. 5.3 Model Selection and Training

# ✅ New feature list with actual departure

features = [
'origin_enc', 'destination_enc', 'carrier_enc',
'sched_dep_min', 'sched_arr_min', 'actual_dep_min',
'year']
# Train model
self.model = XGBClassifier(eval_metric='logloss')
self.model.fit(X, y)
# Save model and encoders
drive_path = '/content/drive/MyDrive/model.pkl'
with open(drive_path, 'wb') as f:
pickle.dump({
'model': self.model,
'le_origin': self.le_origin,
'le_dest': self.le_dest,

The load_and_train method loads the dataset from a specified CSV file, checks for the
presence of critical columns such as actual_departure_time, and drops rows with missing
values in any essential fields. Then, it performs feature engineering by converting scheduled
and actual departure times to minutes and encoding categorical variables using the label
encoders.

A binary label is created based on the flight_status column, mapping 'on-time' to 0 and
'delayed' to 1. These processed features such as encoded airports, carrier, departure times,

31
and the year are used to train an XGBClassifier, which is a gradient boosting model well-
suited for classification tasks. After training, the model and encoders are saved as a serialized
.pkl file using the pickle module to Google Drive, enabling future reuse without retraining.

5.1.6 Model Evaluation: Random forest

X = final_df[['origin_enc', 'destination_enc',
'scheduled_departure_min', 'scheduled_arrival_min']]
y = final_df['flight_status'].map({'on-time': 0, 'delayed':
1})
# Split the dataset into train and test sets
X_train, X_test, y_train, y_test = train_test_split(X, y,
test_size=0.2, random_state=42)
# Create and train the Random Forest model with
hyperparameters tuning
rf_model = RandomForestClassifier(n_estimators=200,
max_depth=10, random_state=42, class_weight='balanced')
# Fit the model
rf_model.fit(X_train_smote, y_train_smote)
# Predict on the test set
y_pred = rf_model.predict(X_test)
# Evaluate the model
accuracy = accuracy_score(y_test, y_pred)
print(f"✅ Accuracy: {accuracy:.4f}\n")

32
XGBOOST

features = ['Origin_enc', 'Destination_enc',

'Scheduled_Departure_Min',
'Scheduled_Arrival_Min', 'Dep_Delay_Min'] #
Removed Arr_Delay_Min
X = df[features]
y = df['Flight_Status_Label']
# Train-test split
X_train, X_test, y_train, y_test = train_test_split(X, y,
test_size=0.2, random_state=42, stratify=y)
# Train XGBoost Classifier
scale_pos_weight_value = (y_train == 0).sum() / (y_train
== 1).sum()
model = XGBClassifier(eval_metric='logloss',
random_state=42)
model.fit(X_train, y_train)
# Predict and evaluate
y_pred = model.predict(X_test)
accuracy = accuracy_score(y_test, y_pred)

33
34

Predicting-Flight-Delays-AI ML
No ratings yet
Predicting-Flight-Delays-AI ML
7 pages
Differential Equations Mcqs
No ratings yet
Differential Equations Mcqs
20 pages
Slide BigData English English
No ratings yet
Slide BigData English English
26 pages
Model
No ratings yet
Model
20 pages
Doormen 1 (Ing)
No ratings yet
Doormen 1 (Ing)
10 pages
Project 1.1
No ratings yet
Project 1.1
3 pages
FlightDelay SVR
No ratings yet
FlightDelay SVR
43 pages
The Structured Interview An Alternative To The Assessment Center?
No ratings yet
The Structured Interview An Alternative To The Assessment Center?
15 pages
HRD BSP
No ratings yet
HRD BSP
11 pages
Flight Delay Report
No ratings yet
Flight Delay Report
29 pages
Ebralinag VS Division Superintendent
No ratings yet
Ebralinag VS Division Superintendent
2 pages
Achenbach Anna WHU Diss 2018
No ratings yet
Achenbach Anna WHU Diss 2018
109 pages
Agency Report Ahs 8100
No ratings yet
Agency Report Ahs 8100
9 pages
Project 1
No ratings yet
Project 1
9 pages
Atg-Lesson 3
No ratings yet
Atg-Lesson 3
9 pages
Educational Leadership Platform of Melissa Cobb Nocoverpage
100% (1)
Educational Leadership Platform of Melissa Cobb Nocoverpage
3 pages
A Hybrid Machine Learning Based Model For Predicting Flight Delay Through Aviation Big Data
No ratings yet
A Hybrid Machine Learning Based Model For Predicting Flight Delay Through Aviation Big Data
16 pages
FLIGHT DELAY Prediction 4th
No ratings yet
FLIGHT DELAY Prediction 4th
18 pages
Flightdelay
No ratings yet
Flightdelay
53 pages
Assignment1 Code and Conclude DSA Nikhil Mishra
No ratings yet
Assignment1 Code and Conclude DSA Nikhil Mishra
36 pages
UIIC (AO) Legal 2024: 8 Weeks Study Plan
No ratings yet
UIIC (AO) Legal 2024: 8 Weeks Study Plan
8 pages
Kkwieer Category Wise Cap-I, Cap-II & Cap-III Off 2024-2025
No ratings yet
Kkwieer Category Wise Cap-I, Cap-II & Cap-III Off 2024-2025
4 pages
Bda Kav
No ratings yet
Bda Kav
9 pages
Adulthood: By: Beverlycovita
No ratings yet
Adulthood: By: Beverlycovita
20 pages
Guía 4
No ratings yet
Guía 4
3 pages
@career Guidance Implementation Report
No ratings yet
@career Guidance Implementation Report
3 pages
5th International Conference On Electronics and Sustainable Communication Systems (ICESC 2024)
No ratings yet
5th International Conference On Electronics and Sustainable Communication Systems (ICESC 2024)
15 pages
TESDA Circular No. 026-2023
No ratings yet
TESDA Circular No. 026-2023
22 pages
Highlights: Machine Learning-Enhanced Aircraft Landing Scheduling Under Un-Certainties
No ratings yet
Highlights: Machine Learning-Enhanced Aircraft Landing Scheduling Under Un-Certainties
43 pages
PD 1408
No ratings yet
PD 1408
2 pages
Professional Reference List
No ratings yet
Professional Reference List
2 pages
Belcastro 2016
No ratings yet
Belcastro 2016
20 pages
Flight Fare Predictor
No ratings yet
Flight Fare Predictor
21 pages
Crlbelgad RP Bis-24
No ratings yet
Crlbelgad RP Bis-24
11 pages
Module 2 Turbulent Flow
No ratings yet
Module 2 Turbulent Flow
6 pages
Jezreel S. Buenafe: Award Recieved
No ratings yet
Jezreel S. Buenafe: Award Recieved
2 pages
Classroom Incentive System
100% (1)
Classroom Incentive System
2 pages
Duplichecker Plagiarism Report
No ratings yet
Duplichecker Plagiarism Report
3 pages
Flight Delay Prediction Based On Machine Learning Full
No ratings yet
Flight Delay Prediction Based On Machine Learning Full
9 pages
Literature Survey Big Data
No ratings yet
Literature Survey Big Data
15 pages
Personal Data Form PDF
No ratings yet
Personal Data Form PDF
4 pages
Software Project1
No ratings yet
Software Project1
76 pages
Major Project Final
No ratings yet
Major Project Final
21 pages
DLP 1 Arts - Q4
No ratings yet
DLP 1 Arts - Q4
3 pages
Big Data Journalpaper
No ratings yet
Big Data Journalpaper
41 pages
Flight Delay Project Main
No ratings yet
Flight Delay Project Main
54 pages
Flight DElay Report
No ratings yet
Flight DElay Report
49 pages
Seminar PPT - Lipika-1
No ratings yet
Seminar PPT - Lipika-1
21 pages
Prashant Major Project Final
No ratings yet
Prashant Major Project Final
90 pages
IJRTI2305086
No ratings yet
IJRTI2305086
6 pages
Thinking (Week 9) Reviewer
No ratings yet
Thinking (Week 9) Reviewer
4 pages
Machine Learning Approach For Flight Departure Delay Prediction and Analysis
No ratings yet
Machine Learning Approach For Flight Departure Delay Prediction and Analysis
15 pages
A Data Mining Approach To Flight Arrival Delay Pre
No ratings yet
A Data Mining Approach To Flight Arrival Delay Pre
6 pages
DLP-DLL Making
No ratings yet
DLP-DLL Making
44 pages
Flight DElay Report
No ratings yet
Flight DElay Report
49 pages
Base Paper (Flight Delay Prediction)
No ratings yet
Base Paper (Flight Delay Prediction)
6 pages
Mca Final Year Project
100% (2)
Mca Final Year Project
76 pages
Project Synopsis - Prediction of Flight Delay Analysis
No ratings yet
Project Synopsis - Prediction of Flight Delay Analysis
5 pages
Example On Flight Delay Data
No ratings yet
Example On Flight Delay Data
10 pages
Lifeskills 8 Simple Ways To Build Stronger Relationships, Communicate More Clearly, and Improve Your Health Full Chapter Download
100% (15)
Lifeskills 8 Simple Ways To Build Stronger Relationships, Communicate More Clearly, and Improve Your Health Full Chapter Download
15 pages
On The Relevance of Data Science For Fli
No ratings yet
On The Relevance of Data Science For Fli
17 pages
Big Data Analytics Using Predictive Analysis
No ratings yet
Big Data Analytics Using Predictive Analysis
4 pages
S1366554518311979
No ratings yet
S1366554518311979
1 page
Report
No ratings yet
Report
5 pages
Fin Irjmets1676179194
No ratings yet
Fin Irjmets1676179194
6 pages
A Machine Learning Model For Flight Delay Prediction: Certificate
No ratings yet
A Machine Learning Model For Flight Delay Prediction: Certificate
17 pages
Flight Delay Detection in BIG Data Analysis
No ratings yet
Flight Delay Detection in BIG Data Analysis
11 pages
Flight Delay Prediction System Paper - 802 - 826 - 828
No ratings yet
Flight Delay Prediction System Paper - 802 - 826 - 828
7 pages
cs703 Mid
No ratings yet
cs703 Mid
11 pages
Aerospace 08 00152 v3
No ratings yet
Aerospace 08 00152 v3
20 pages
Lesson Plan Rational Numbers Differentiated
No ratings yet
Lesson Plan Rational Numbers Differentiated
5 pages
Flight Delay Prediction Based On Aviation Big Data: ISSN PRINT 2319 1775 Online 2320 7876
No ratings yet
Flight Delay Prediction Based On Aviation Big Data: ISSN PRINT 2319 1775 Online 2320 7876
5 pages
HERQA Accreditation List of College in English
79% (14)
HERQA Accreditation List of College in English
43 pages
Predicting Flight Delays
No ratings yet
Predicting Flight Delays
7 pages
Flight Delay Prediction
No ratings yet
Flight Delay Prediction
17 pages
A Review On Flight Delay Prediction
No ratings yet
A Review On Flight Delay Prediction
21 pages
Predicting Flight Delays With Error Calculation Using Machine Learned Classifiers
No ratings yet
Predicting Flight Delays With Error Calculation Using Machine Learned Classifiers
6 pages
Icaart 2023 94 CR-4
No ratings yet
Icaart 2023 94 CR-4
11 pages
Mayer Salovey Caruso Emotional Intellige
No ratings yet
Mayer Salovey Caruso Emotional Intellige
5 pages
Departure Delay Prediction Using Machine Learning
No ratings yet
Departure Delay Prediction Using Machine Learning
6 pages
Netaji Subhash Engineering College
No ratings yet
Netaji Subhash Engineering College
24 pages
(IJCST-V10I5P36) :mrs R Jhansi Rani, T Govardhan Reddy
No ratings yet
(IJCST-V10I5P36) :mrs R Jhansi Rani, T Govardhan Reddy
5 pages
Flight Delay Prediction Team3
No ratings yet
Flight Delay Prediction Team3
8 pages
REPORT On Time Flights Performance
No ratings yet
REPORT On Time Flights Performance
9 pages
Airline Delay Prediction
No ratings yet
Airline Delay Prediction
6 pages
De Cuong GK2 Tieng Anh 11 Isw
No ratings yet
De Cuong GK2 Tieng Anh 11 Isw
4 pages
INSET Program Invitationedited
No ratings yet
INSET Program Invitationedited
5 pages
Airline Delay Model
No ratings yet
Airline Delay Model
11 pages