The document contains details of 4 students including their names and university roll numbers. It also contains an introduction to machine learning based flight delay prediction. Key steps discussed include problem definition, dataset used containing over 100,000 flight records, modeling approaches including logistic regression, decision tree and random forest, and future work on improving models and predicting taxi delays.
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0 ratings0% found this document useful (0 votes)
416 views24 pages
Netaji Subhash Engineering College
The document contains details of 4 students including their names and university roll numbers. It also contains an introduction to machine learning based flight delay prediction. Key steps discussed include problem definition, dataset used containing over 100,000 flight records, modeling approaches including logistic regression, decision tree and random forest, and future work on improving models and predicting taxi delays.
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 24
Name-Samir Akhter Name-Ramesh Kumar
Univ Roll-10900216026 Univ Roll-10900216031
Name-Saurabh Kumar Name-Payal Kumari
Univ Roll-10900216022 Univ Roll-10900216037
NETAJI SUBHASH ENGINEERING COLLEGE
Machine Learning FLIGHT DELAY PREDICTION Roadmap • Machine Learning • Introduction • Application • Problem Definition • Dataset Used • Our Approach • Data Visualization • System Design • Future Work • Conclusion What is Machine Learning? • application of artificial intelligence (AI) • automatically learn and improve from experience without being explicitly programmed. • needs large volume of data to find patterns in them, learn and make predictions based on what it learns. What is Flight Delay? • Delay is one of the most the annoying thing we people face in our day to day life. • What basically delay is -simply delay is the difference between scheduled time and real time of departure or arrival of a plane. Application • Imagine you're scheduled to fly out of Delhi four hours from now and there's a storm in between. The airline could use this data to determine that the storm is likely to delay your plane. They could then contact you and offer you a seat on an alternate flight. You will save time, and the airline doesn't have to accommodate you on a later flight after the delay happens." Problem Definition • In this context, there are many studies. Some of them treat flight delay prediction as a regression problem, predicting the delay by the minute, and others as a classification problem, predicting a time interval where the delay will fall. • So in this project, two types of prediction mechanisms are considered: regression, where the continuous output is an estimate of the arrival delay, and classification, where the output is a binary prediction of whether the arrival delay more or less than the predefined threshold. Dataset Used The results presented in this paper were obtained using data from the www.umetrip.com. In order to get the flight information, we develop a crawler to crawl data from the website. We had access to a large dataset of 119432 regular commercial passenger flights performed from November 3, 2015 to March 5, 2016. The database provides detailed data for individual flight by phase of flight, airport weather data, arrival and departure time. Our Approach After going through the previous work on it , we are glad to know about its initiative and we have curiosity to proceed it further. Now we're going to implement the same algorithm to build machine learning models to predict the flight delay. Moreover, we will try our best to give maximum accuracy in our project. We'll train the following models: • Logistic Regression • Decision Tree • Random Forest Logistic Regression • Classification Algorithm • Used to predict binary outcome(1/0 , Yes/No , True/False) given a set of independent variable. • To represent binary and categorial outcome , we use dummy variables. • Special case of linear regression when the outcome variable is categorical. Random Forest Classifier • a type of ensemble classifier • also a supervised algorithm. • It basically creates a set of decision trees. Decision Tree • belongs to the family of supervised learning algorithms. • Unlike other supervised learning algorithms, decision tree algorithm can be used for solving regression and classification problems too. • The general motive of using Decision Tree is to create a training model which can use to predict class or value of target variables by learning decision rules inferred from prior data(training data). System Design • The first step is the conversion of this raw data into processed data. • So the first step is feature extraction, where the key attributes are extracted from the whole list of attributes available in the raw dataset. • The feature extraction process is followed by a classification process wherein the data that was obtained after feature extraction is split into two different and distinct segments. • The splitting is done in a way that training data maintain a higher proportion than the test data. Steps for machine learning process Real life working model User Interface
Real life working model
Future Work • For further work I like to improve my model, perhaps with more training- data or deeper neural network, or both. • Taxi-delay prediction is a natural progression to this work, considering amount of fuel wasted while taxing. • Accurate taxi-delay prediction requires taking airport runway and taxiway configurations into consideration where very little work exists. Conclusion • This project and the analysis retrieved will be useful not only for passengers point of view, but for every decision maker in the aviation industry. • Apart from the financial losses incurred by the industry, flight delay also portray a negative reputation of the airlines, and decreases their reliability. • The analysis carried here not only predicts delays based on the previous available data, but also give statistical description of airlines, their rankings based on their on-time performance, and delays with respect to time, showing the peak hours of delay. References • [1] C. Cetek, E. Cinar, F. Aybek, and A. Cavcar, “Capacity and delay analysis for airport manoeuvring areas using simulation,” Aircraft Engineering and Aerospace Technology, vol. 86, no. 1, pp. 43–55, 2013. [Online]. Available: https://doi.org/10.1108/AEAT-04-2012-0058 • [2] K. B. Nogueira, P. H. Aguiar, and L. Weigang, “Using ant algorithm to arrange taxiway sequencing in airport,” International Journal of Computer Theory and Engineering, vol. 6, no. 4, p. 357, 2014. • [3] R. R. Clewlow, I. Simaiakis, and H. Balakrishnan, “Impact of arrivals on departure taxi operations at airports,” 2010 Thank You