Bus Optimization
Bus Optimization
net/publication/330877942
CITATIONS READS
5 3,280
6 authors, including:
Mehmet Dalkilic
Indiana University Bloomington
85 PUBLICATIONS 631 CITATIONS
SEE PROFILE
Some of the authors of this publication are also working on these related projects:
DCEM: Clustering for Multivariate and Univariate Data Using Expectation Maximization Algorithm (R package) View project
All content following this page was uploaded by Hasan Kurban on 20 July 2019.
(b)
(c)
Fig. 2: Example of GPS data (a), original APC data (b), and
cleaned APC data (c). We took the data generated from IUCBS
and represented it in a relational model with supporting tables.
Before data reduction there were ~96,000,000 GPS data points
from 2016-04-08 to 2018-04-11 and ~7,000,000 APC data
points from 2017-08-21 to 2018-04-01, afterwards there are Fig. 3: Map of IUCBS routes with a partial satellite image of
16,877,381 GPS data points and 640,597 APC data points. the center. Each route begins and ends at a single point north
of campus. They are composed of a series of stops represented
by the black and yellow dots. Some stops are located in busier
discussion is as follows: background, methodology, results, areas of campus (see Fig. 12) and the travel through that
related work, summary and future work. area can affect the overall time of the bus at certain times of
day. The A route for example is approximately 7.48 km and
II. BACKGROUND circumscribes the campus of 7.82 km2 , and the terminus is
A. IU Campus Bus Service (IUCBS) Memorial Stadium. The yellow arrows point to same location.
Indiana University is located in the city of Bloomington, The satellite image shows the numbers of different paths and
Indiana, USA. Bloomington currently has a population of modes of transportation that students can choose.
around 85K, making it the 7th largest city in Indiana. Indiana
University itself has an enrollment of around 50K students
with an additional 5K faculty and staff. To model our system
for testing we are using data (∆) from IUCBS which is the
management PT arm of Indiana University. ∆GP Si initially is
a 1-2 sec snapshot of a bus’ position shown in Fig. 2a. ∆AP Ci
records passenger events at every bus stop on every route
shown in Fig. 2c. IUCBS has 25 total buses, with 19 buses
operating at peak times running four routes on any given week-
day, with the A route accounting for approximately 45% of
total ridership. Total system ridership is approximately 300K
passengers per month. Additionally, there is a municipality
owned and managed bus service that slightly overlaps with
IUCBS. There is currently not any coordination between the
two and, therefore, we do not include this in our work.
The goal for IUCBS is to increase the quality of service Fig. 4: Graph percentage of transportation options used by IU
available to passengers while minimizing costs. Measuring students on campus [28].
quality directly is prohibitively difficult and falls outside the
current scope of this research. Some remarks, however, should
be made. To move between classes, students have many
choices: walking, biking, driving, public transportation, taxi
service, or some combination these. Furthermore, there are
numerous paths that students can choose as well. Without
consideration of campus registration fees e.g., bike registration
is $20, the choices are relatively indistinguishable with respect
to cost: the city transportation fee is $.50 per day translating
Fig. 5: Measuring goodwill. (Left) Illustrative indifference
into about $40-$50 per semester while the others are free.
curve. The arrow points to maximum utility. (Right) Goodwill
Biking is appealing, since both the campus and city have ded-
is done implicitly through the difference between the posted
icated bike paths (https://tinyurl.com/ybu78rrt). Alternatively,
time and actual arrival time. Currently, we measure goodwill
the campus provides Zipcars (www.zipcar.com), and vehicle
as a number in [0, 1] and is a function of this difference (A).
registration comes with access to student parking. We point
Likely, the actual goodwill is something akin to (B) and will
to two approaches to assessing goodwill: microeconomics via
be part of our future work in addition to studying the degree
utility [26] or directly as a function of some distinguished
of the quality of this model affects the optimization.
factors [21] [31] [24]. For utility (Fig. 5 (Left)), we can
build indifference curves constructed over two axes: choosing
the campus bus and alternative transportation. The utility Challenge Solution
(combination of choices) is determined by, in this case, time–
the student desires to maximize utility using equi-marginal Wide variation in Use ∆ from past to adjust service throughout
traffic density and the day through optimization techniques
principle [26]. Determining these family of curves in this passenger load
scenario, however, is overly complex for our needs, and the throughout the day
benefit is unlikely to exceed the cost of building the curves.
A simpler, more straightforward approach is directly survey- Timetable does not ∆GP S is used to fill in timetable
match actual times of
ing riders. In [18] (Fig. 5 (Right)), the authors, for example, stops
determine goodwill (customer satisfaction) from service, ac-
cess, availability, time, and environment–commonly regarded Schedule does not line Use ∆AP C to determine when students
determinants for goodwill. In our setting, surveying is not up with passenger’s arrive at stops
needs (e.g. stop is not
feasible. We determine goodwill solely by a function of time. close to beginning or
We provide two curves that model goodwill as a real-valued end of classes)
[0, 1] scalar function where 0 means the passenger will not
enter and 1 means the passenger always enters. The curve IUCBS operates with Reduce optimization problem by eliminating
a fixed cost (fixed terms that IUCBS has no control over
is effectively a function of the difference between the posted fleet size, etc.), while
schedule time (expected) and observed bus arrival time. We most frameworks
currently presume a flat curve (5A) that drops to zero when include cost in their
optimization
outside ±[4, 5] minutes, but likely it is somewhat Gaussian in
shape (5B). Our intent is to model goodwill as a probability Passenger demand Implement Ghost Bus service to alleviate
in the future, but the complexity of the optimization lead us to causing platooning demand during troublesome periods. The
believe this simplification would not affect the initial outcome– and overflowed buses Ghost Bus is a bus not assigned to a route that
alleviates issues by assigning it dynamically to
which is consistent with our results. routes that are underserved during times of
Additionally the University has mandated that operations high demand
across the state-wide institution address sustainability; for
IUCBS this means reducing the PT carbon footprint as well. TABLE I: Challenges faced by IUCBS and our solutions.
Before our solution, IUCBS used a schedule developed heuris- Similar challenges exist for other transportation problems, as
tically that did not take into account the wealth of data well as within different domains. The challenges presented
available to them. We will propose a method that takes into will eventually inform the choice of optimization.
account these factors and optimizes the bus service on a
college campus using several data driven algorithms.
to the usual 2-3 so-called rush hour periods that a traditional
B. Previous Work transit system experiences. Indiana University operates on a
IUCBS, like any other college campus bus service [34] [27], fixed class schedule starting at 8:00 a.m., and proceeds in
faces several unique challenges to schedule creation. Campus one hour and five minute increments until approximately 2:30
transportation services can vary wildly, but all are focused p.m.; consequently, the service experiences peak demand and
primarily on serving the student population. [11] The first traffic several times throughout a given service day that can
challenge for non-urban college campuses is the existence of be addressed by algorithms that take into account past data.
several periods of high traffic throughout the day, in contrast Traditional heuristic scheduling practices, with fixed de-
parture intervals, and fixed intervals between stops, do not
adequately address this challenges faced by IUCBS. The result
is a schedule that is sub-optimal for both consumers, and
bus operators. The consumer is unable to trust the schedule,
reducing passenger satisfaction since the bus is liable to be
exceptionally late during times of peak demand or unaccept-
ably early during down times. As well passengers might be
unable to determine if their regularly scheduled bus will be
overfilled. Furthermore, the typical scheduling process does
incorporate local events such as the end of classes, instead
relying on the set schedule hopefully lining up with passenger
demand without actual data to corroborate. The bus operators
are also unable to count on regular breaks which conflicts with
their collective bargaining agreement specifying a minimum
percentage of a scheduled trip be recovery time.
Before the advent of large data sources of GPS or APC data, Fig. 6: Data Flow and Process. A bus collects GPS and pas-
the transit scheduling problem was solved heuristically with a senger data along a route (orange arrows indicate movement)
rigid schedule that could not adapt to the shifting passenger that is stored on the bus (A). At the end of the route, the data is
demand. Data driven methods typically use load profile data, uploaded (B), cleaned, transformed, and reduced (C), driving
from either manual or ∆AP C (Automated passenger count algorithms (E) for schedule production. The subsequent year’s
data) and ∆GP S (GPS data) from the buses. [15] Previous data is used to assess the schedule (F).
work has even explored dynamic scheduling of trips from
∆GP S [13]. Using data generated from bus services can
show that historical methods of generating schedules, without include the bus drivers and supervisors in the process and
incorporating passenger data, are insufficient in modelling communicate with them the results at each step. They then
actual transit times [16]. These traditional methods do not can suggest changes based off of their own personal heuristics
adequately address challenges experienced by IUCBS–and that can improve the scheduling.
likely other similar urban areas. We found no other methods When modeling this kind of transportation, we face an
that incorporated both APC and GPS data as part of their optimization problem whose dimensions are at odds with
model from beginning to end for scheduling. each other. On the one hand, for example, we want to reduce
travel time, the number of buses, and the fuel used. On the
III. M ETHODOLOGY other we want to increase passenger count and goodwill. The
In Fig. 6 we provide an overview of the solution. As a most common approach is to ultimately focus reducing time,
bus moves along a route (6A) its transponder emits a signal bounding other dimensions by physical or optimally required
that, when next to a stationary drum (stationed along routes), values e.g., number of passengers,buses, or travel time. One
triggers a save (locally on the bus computer). A set of dual of the many formulations of the optimization problem for
lasers placed at the entry and exit trigger GPS-time data that designing both routes and timetables can be described as [3] :
is saved as well. Events are then assigned and grouped by
stop (not a physical stop, but a location that has two different Xn X
n X
times with one GPS). A running total of passengers on a M in{c1 [ dij tij ] + c2 [ fk Tk ]} (1)
bus is computed at each stop and cleared at the completion j=1 i=1 all k∈SR
25
Fall 2016 Std Dev Spring 2016: 2.746
Spring 2017 Mean Fall 2016: 0.01 ●
●
● ●
● ●
●
20
●
0.15
●
●
● ●
Frequency
●
0.10
15
●
0.05
10
0.00
−10 −5 0 5 10 67 39 38 37 41 1 4 6 8 10 11 12 13 14 36 30 35
Fig. 8: Bus punctuality on the A route across several semesters. Fig. 9: A Route load profile. Data was taken from APC data
Spring 2016 was the last semester before this project, Fall 2016 accumulated over almost a full year. After removing outliers,
was the initial first pass solution, and Spring 2017 was after the we averaged the ridership on each stop of each route by both
full optimization solution. The most important measure of bus time and day. This figure shows the average of those plots for
punctuality is the standard deviation being low - representing the A Route Monday-Thursday.
a tighter grouping of when the buses actually arrive around
the posted schedule. This shows the improvement we’ve made
over the traditional schedule using ∆GP S and ∆AP C develop our current schedule, we have the opportunity to use
current data to evaluate that same schedule - some of which
is explained here.
used as a cushion in the event of late buses. We were able to To determine the efficiency of the schedule we generated
make these changes without adjusting break times or causing a large set of visualizations that allowed the bus operators
more work for our drivers. Given that the new schedule should to determine how well the schedule was meeting their goals.
accurately predict demand, and transit times, it should further We leveraged ∆AP C to determine the mean passenger load
result in a larger amount of usable break times for drivers, at each stop (See fig. 9), as well as outlier/full bus detection
as the recovery time is needed less as a cushion. As such, it to observe both the quality of the data as well as potential
should be possible to reduce the recovery percentage in the problem areas. Additionally we used ∆GP S to keep track
future to be more in line with the industry standard, without during the semester to figure out if the schedule was correctly
significantly effecting driver break time. estimating bus travel times (See Fig. 10). We also kept track
In addition the new schedule improved the accuracy of of platooning, where 2 buses serving the same route are close
the schedule. Fig. 8 shows the improvement overall from the enough to impact service. (See Fig. 11 ) Platooning causes both
old schedule to the new in relation to when buses actually longer wait times and insufficiently filled buses and can happen
arrive at their stops, i.e. the difference in punctuality has been during periods of intermittent and excessive traffic conditions.
significantly reduced.
level
60000
40000
20000
Fig. 10: Indiana Memorial Union (IMU) (the most shared Fig. 11: A Route platooning heatmap. Data was generated
stop). Actual stop times difference from posted times (A from the GPS data set. We applied a filter where we tracked
Route). This plot shows the difference between the schedule whether buses were within 100 feet of each other going the
and the actual arrival times of buses. We took the set of points same direction on the same route within 5 seconds. Each
of buses on each route within 50 feet of the stop of interest. instance of this was added as an example of platooning
The earliest point for each set of points was selected as the and added to the plot as an average of the two buses GPS
arrival time for that particular bus. We then calculated the coordinates.
difference between the arrival time and the closest scheduled
stop on the timetable. experience viz, if a passenger arrives to the bus stop according
to the schedule, but the bus left 2 minutes early, there could
be an extremely long wait for the next bus. In fact, because
part of the transit problem. One such involves using Constraint of this tandem phenomenon, arriving later is better than early,
Programming, a programming paradigm that codes the con- which is accounted for in our model.
straints of the problem to develop a solution using software VII. F UTURE W ORK
packages [4]. Other ideas involve using Mixed Integer Non-
We are currently researching ways to use the data from a
Linear Programming (MINLP) a group of techniques that
university’s Registrar’s Office to determine the student traffic
solve an objective function and a series of constraints. [22]
on campus more accurately, since using ∆AP C alone is
[33] One interesting approach for scheduling fixes cost of
insufficient to determine the actual times passengers arrive at,
waiting to be identical for all passengers for the model in
or wish to arrive at the bus stop for travel to their destination.
[12]; however, their model includes knowing when passengers’
As shown in Fig. 12 there is a large difference in needs for
desired travel times instead of uniformly distributing them as
buses at particular times on a general campus. We also are
ours (See Future Work). Several surveys of the literature in this
designing an app that allows students to signal either their
area have a complete summary of these methods and others
location or intended location to create a dynamic schedule.
[6] [19] [7].
Lastly, we are working several university towns similar to
Measuring customer satisfaction in regards to scheduling is
Bloomington to help improve PT.
central to many transportation industries, especially airlines.
Acknowledgements The authors thank the anonymous re-
Regularly the “best” airlines for on-time flights are published
viewers for their informative, encouraging comments and
and are a critical determiner of customer satisfaction. There
suggestions. This work was partially supported by NCI Grant
is a concerted effort to have a large percentage of flights
1R01CA213466-01.
being on-time compared to the schedule, which can cause
airlines to schedule later arrivals to meet the deadlines more R EFERENCES
often https://tinyurl.com/ybsckdad. In our model, however, we [1] Luca Allulli, Giuseppe F. Italiano, and Federico Santaroni. Exploiting
established that early arrivals are detrimental to the customer gps data in public transport journey planners. pages 295–306, 2014.
[13] Daniel Delling, Giuseppe F. Italiano, Thomas Pajor, and Federico
Santaroni. Better transit routing by exploiting vehicle gps data. In
Proceedings of the 7th ACM SIGSPATIAL International Workshop on
Computational Transportation Science, IWCTS ’14, pages 31–40, New
York, NY, USA, 2014. ACM.
[14] SE Elias. The use of digital computers in the economic scheduling for
both man and machine in public transportation. 1964.
[15] Reza Zanjirani Farahani, Elnaz Miandoabchi, W.Y. Szeto, and Hannaneh
Rashidi. A review of urban transportation network design problems.
European Journal of Operational Research, 229(2):281 – 302, 2013.
[16] Donatella Firmani, Giuseppe F. Italiano, Luigi Laura, and Federico
level Santaroni. Is timetabling routing always reliable for public transport?
33:15–26, September 2013.
20000 [17] Peter G Furth, Brendon Hemily, Theo HJ Muller, and James G Strath-
man. Using archived avl-apc data to improve transit performance and
10000
management. 2006.
[18] Maria Elisa Alen Gonzalez, Lorenzo Rodriguez Comesana, and Jose
Antonio Fraiz Brea. Assessing tourist behavioral intentions through
perceived service quality and customer satisfaction. Journal of Business
Research, 60(2):153–160, February 2007.
[19] Valrie Guihaire and Jin-Kao Hao. Transit network design and schedul-
ing: A global review. Transportation Research Part A: Policy and
Practice, 42(10):1251 – 1273, 2008.
[20] Knut Haase, Guy Desaulniers, and Jacques Desrosiers. Simultaneous ve-
hicle and crew scheduling in urban mass transit systems. Transportation
science, 35(3):286–303, 2001.
[21] Rabiul Islam, Mohammed S Chowdhury, Mohammad Sumann Sarker,
and Salauddin Ahmed. Measuring customer’s satisfaction on bus
transportation. 2014.
Fig. 12: Heat map of student class sizes on the IU campus. [22] Leise Neel Jansen, Michael Berliner Pedersen, and Otto Anker Nielsen.
Minimizing passenger transfer times in public transport timetables. In
Data was taken from the Indiana University’s Registrar’s 7th Conference of the Hong Kong Society for Transportation Studies,
Office. Different areas of campus have vastly varying class Transportation in the information age, Hong Kong, pages 229–239,
sizes, both aggregated across all times of day as this figure is 2002.
[23] Natalia Kliewer, Taieb Mellouli, and Leena Suhl. A time–space
or per hour. We can use this data to leverage improvements network based exact optimization model for multi-depot bus scheduling.
to the public transportation schedule determined from where European journal of operational research, 175(3):1616–1627, 2006.
and when on campus the student traffic is originating from. [24] Wen-Tai Lai and Ching-Fu Chen. Behavioral intentions of public transit
passengersthe roles of service quality, perceived value, satisfaction and
involvement. Transport Policy, 18(2):318–325, 2011.
[25] ElGeneidy Ahmed M., Horning Jessica, and Krizek Kevin J. Analyzing
[2] American Planning Association. Planning and urban design standards. transit service reliability using detailed data from automatic vehicular
John Wiley & Sons, 2006. locator systems. Journal of Advanced Transportation, 45(1):66–79,
[3] M. Hadi Baaj and Hani S. Mahmassani. An ai-based approach for transit 2011.
route system planning and design. Journal of Advanced Transportation, [26] K. R. MacCrimmon and M. Toda. The Experimental Determination of
25(2):187–209, 1991. Indifference Curves. Review of Economic Studies, 36(4):433–451, 1969.
[4] Alexandre Barra, Luis Carvalho, Nicolas Teypaz, Van-Dat Cung, and [27] James H Miller. Transportation on college and university campuses.
Ronaldo Balassiano. Solving the Transit Network Design problem Number Project J-7, Topic SA-11. 2001.
with Constraint Programming. In 11th World Conference in Transport [28] IU Office of Sustainability. Transportation: Student commuting.
Research - WCTR 2007, pages –, Berkeley, United States, June 2007. http://sustain.iu.edu/overview/indicators/transportation.php, 2012.
[5] Avishai Ceder. Transit scheduling. Journal of advanced transportation, [29] Amer Shalaby and Ali Farhan. Prediction model of bus arrival and
25(2):137–160, 1991. departure times using avl and apc data. Journal of Public Transportation,
[6] Avishai Ceder. Urban transit scheduling: framework, review and exam- 7(1):3, 2004.
ples. Journal of Urban Planning and Development, 128(4):225–244, [30] Jau-Ming Su, Nomungerel Erdenebat, Liang-Hua Ho, and Yu-Ting Zhan.
2002. Integration of transit demand and big data for bus route design in taiwan.
[7] Avishai Ceder and Nigel H.M. Wilson. Bus network design. Trans- pages 19–26, 2016.
portation Research Part B: Methodological, 20(4):331 – 344, 1986. [31] Yannis Tyrinopoulos and Constantinos Antoniou. Public transit user
[8] Partha Chakroborty. Genetic algorithms for optimal urban transit satisfaction: Variability and policy implications. Transport Policy,
network design. Computer-Aided Civil and Infrastructure Engineering, 15(4):260–272, 2008.
18(3):184–200, 2003. [32] Niels van Oort, Daniel Sparing, Ties Brands, and Rob M. P. Goverde.
[9] Partha Chakroborty, Kalyanmoy Deb, and P. S. Subrahmanyam. Data driven improvements in public transport: the dutch example. Public
Optimal scheduling of urban transit systems using genetic algo- Transport, 7(3):369–389, Dec 2015.
rithms. Journal of Transportation Engineering, 121(6), Issue: ob- [33] Quentin K. Wan and Hong K. Lo. A mixed integer formulation
ject: doi:10.1061/jtpedi.1995.121.issue-6, revision: rev:1479464639368- for multiple-route transit network design. Journal of Mathematical
29611:doi:10.1061/jtpedi.1995.121.issue-6, . Modelling and Algorithms, 2(4):299–308, 2003.
[10] Mei Chen, Xiaobo Liu, and Jingxin Xia. Dynamic prediction method [34] Michael E Williams and Kathleen L Petrait. U-pass: A model transporta-
with schedule recovery impact for bus arrival time. tion management program that works. Transportation Research Record,
[11] John Daggett and Richard Gutkowski. University transportation survey: (1404), 1993.
Transportation in university communities. Transportation Research [35] Anthony Wren and David O Wren. A genetic algorithm for public trans-
Record: Journal of the Transportation Research Board, (1835):42–49, port driver scheduling. Computers & Operations Research, 22(1):101–
2003. 110, 1995.
[12] Andr de Palma and Robin Lindsey. Optimal timetables for public trans- [36] Fang Zhao and Xiaogang Zeng. Optimization of transit route network,
portation. Transportation Research Part B: Methodological, 35(8):789 vehicle headways and timetables for large-scale transit networks. Euro-
– 813, 2001. pean Journal of Operational Research, 186(2):841 – 855, 2008.