key: cord-0521889-jnlpfryn authors: Sugishita, Kashin; Masuda, Naoki title: Recurrence in the evolution of air transport networks date: 2020-05-29 journal: nan DOI: nan sha: a83be9d5b763cfec240bb063b384e0a85958b612 doc_id: 521889 cord_uid: jnlpfryn Changes in air transport networks over time may be induced by competition among carriers, changes in regulations on airline industry, and socioeconomic events such as terrorist attacks and epidemic outbreaks. Such network changes may reflect corporate strategies of each carrier. In the present study, we propose a framework for analyzing evolution patterns in temporal networks in discrete time from the viewpoint of recurrence. Recurrence implies that the network structure returns to one relatively close to that in the past. We applied the proposed methods to four major carriers in the US from 1987 to 2019. We found that the carriers were different in terms of speed of network evolution, strength of periodicity, and changes in these quantities across decades. We also found that the network structure of the individual carriers abruptly changes from time to time. Such a network change reflects changes in their operation at their hub airports rather than famous socioeconomic events that look closely related to airline industry. The proposed methods are expected to be useful for revealing, for example, evolution of airline alliances and responses to natural disasters or infectious diseases, as well as characterizing evolution of social, biological, and other networks over time. In the US, the Airline Deregulation Act was established in 1978, which removed the governmental controls over fares, routes, and market entries of new air carriers (carriers for short) 1 . After the establishment of the law, the formerly regional carriers and completely new carriers grew in the market and increased the level of competition among carriers 2 . The deregulation also caused the emergence of low cost carriers (LCCs). In contrast to full service carriers (FSCs), an LCC offers low fares without many traditional passenger services. The growth of LCCs, especially Southwest Airlines in the case of the US, which is the largest LCC in the world in terms of passenger traffic 3 , elicited a dramatic downward pressure on fares 4 . After the deregulation in the US, similar airline deregulations and subsequent elevated competition among carriers have been duplicated around the world including in Europe 5 and Asia 6 . In addition to fierce competition among carriers, the airline industry has been exposed to a highly competitive environment partly owing to various socioeconomic events. For example, the 9/11 attacks in 2001 had significant impacts on the US airline industry. This event resulted in a negative demand shock of more than 30% 7 . The US airline industry lost nearly $35 billion between 2001 and 2005 in total 2 . In fact, a number of carriers filed liquidation bankruptcies (so-called Chapter 7 bankruptcy) or reorganization bankruptcies (so-called Chapter 11 bankruptcy) including major carriers such as American Airlines, United Airlines, and Delta Air Lines 8 . Moreover, the US airline industry has faced massive restructuring with a number of mergers such as the merger of American Airlines and US Airways, and of Delta Air Lines and Northwest Airlines 9 . In Europe, the volcanic eruptions in Iceland in 2010 had a devastating effect on the European air transport network, preventing air travel throughout most of Europe for six days 10 . Overall, business environments in the airline industry have been tough on the global scale. For example, the International Air Transport Association (IATA) announced that year 2019 was the smallest in freight demand in the world since 2009 11 . As of 2020, the airline industry is facing a financial crisis because of the COVID-19 pandemic. The Europe's biggest regional carrier of the time, Flybe, went bankrupt in March 2020, and many carriers may follow 12 . The airline industry is highly susceptible to such socioeconomic and natural events, and these events may influence the structure of air transport networks. Regardless of the reason, air transport networks are temporal networks 13 ; in various types of networks such as social and economic networks, nodes and/or edges change over time, constituting temporal networks [14] [15] [16] [17] . In fact, air transport networks consist of multiple carriers. Each carrier has its own corporate strategies, which may change over time depending on evolution of the market and knowledge that the carrier gains by learning from databases (e.g., about customers) 18 . These time-varying corporate strategies of carriers may affect the evolution of their network structure. A number of studies have analyzed structures of air transport networks. Most of these studies have examined static properties of air transport networks such as the degree distribution, average path length, clustering coefficient, assortativity, and centrality [19] [20] [21] [22] [23] [24] [25] . In recent years, several studies have investigated evolution of air transport networks. For example, in an air transport network in Brazil, the number of nodes and edges decreased from 1995 to 2006, whereas the number of passengers and the amount of cargo increased during the same period 26 . In contrast, the connectivity of an air transport network in China increased from 1930 to 2012 27 . These and other 13, [28] [29] [30] studies are concerned with the evolution of air transport networks at the country, continental, or global level and do not distinguish networks by carriers. In fact, each carrier may be evolving in different manners, reflecting its own corporate strategies. Because carriers compete for a limited traffic demand, some carriers expanding their networks may cause other carriers to shrink their networks. Even if the structure of an entire network combining different carriers is relatively stable 31 , each carrier may be drastically changing its network structure over time. To date, time-dependent structure of air transport networks at the carrier level is not well understood 32 , while a few studies have investigated static properties of airline networks by carriers 33, 34 . In the present study, we propose a novel framework for analyzing evolution of air transport networks. In particular, we examine temporal air transport networks from the viewpoint of recurrence. Recurrence implies that the structure of the network returns to one close to that of the same or different network in the past. Recurrent plots and recurrence quantification analysis are a set of data analysis methods to quantify recurrence, which was originally proposed for time series 35 and recently extended to the case of temporal networks 36, 37 . While we do not employ recurrence plots or recurrence quantification analysis in the exact sense in the present study, we will use the idea of recurrence to examine when air transport networks approach their past structure relatively closely as compared to in other times. We apply the proposed framework to four major carriers in the US, i.e., American Airlines (AA), United Airlines (UA), Delta Air Lines (DL), and Southwest Airlines (WN), from 1987 to 2019. We use a data set available at the US Bureau of Transportation Statistics. The data are based on the reports made by certified carriers that account for at least one percent of the domestic scheduled passenger revenue. By definition, a certified carrier holds an operating certificate issued by the US Secretary of Transportation. The data set contains the origin and destination airports of each flight, departure and arrival times, operating carriers, and so on. We downloaded monthly data between October 1987 and August 2019 38 . We focus on air transport networks for the four most major carriers in the US in terms of revenue, i.e., AA, UA, DL, and WN. According to the US Bureau of Transportation Statistics, domestic revenue passenger-miles of each of these four carriers in 2019 is over 100 billions. Alaska Airlines, ranked fifth, earned domestic revenue passenger-miles less than 50 billions in 2019. For each carrier, we construct monthly snapshot networks from October 1987 to August 2019, in which a node is an airport. We connect two nodes by an edge if there is at least one direct commercial flight between the two airports in the month. For simplicity, we assume that each snapshot network is unweighted and undirected. To measure the dissimilarity between two snapshot networks, we define the normalized network distance (network distance for short) between two networks G and G as where M(G) and M(G ) are the numbers of edges in G and G respectively, and M(G ∩ G ) is the number of edges that G and G have in common. Network distance d ranges between 0 and 1. The distance matrix for a carrier is a t max × t max symmetric matrix of which the (t,t )th entry is given by d(G t , G t ), where G t is the network at month t and t max is the number of months observed. To measure differences in the distance matrices, we conduct statistical tests on distributions of d. First, we use the Kruskal-Wallis test, which is an nonparametric statistical test to assess the differences among three or more groups 39 . We used "Kruskal" function in R (ver. 3.6.3). Second, as a post-hoc test, we use the Bonferroni-corrected Mann-Whitney U test 40 to compare all pairs of groups. We used "wilcox.test" function in R (ver. 3.6.3). In addition to the p values, we measure its effect size r, which is the Z value obtained from the "wilcox.test" function divided by the square root of the number of observations 41 . According to a standard, the effect size is said to be strong, medium, or small when r > 0.5, r > 0.3, or r > 0.1, respectively 42 . To measure the similarity between air transport networks across time, we define an autocorrelation function (ACF) for a given carrier by where τ is the time lag. Note that Eq. (2) is not a proper autocorrelation function. However, in analogy to the Wiener-Khinchin theorem 43 , we define a power spectral density (PSD) by the Fourier transform of the ACF as follows: where f is the frequency. The PSD shows the power of periodic variations. We identify states of the temporal air transport networks for each carrier based on its distance matrix. Consider a sequence of t max static networks. To assign a state to each of the t max snapshot networks, we apply a hierarchical clustering algorithm to the t max × t max distance matrix 36 . We used "linkage" and "fcluster" functions in "scipy" module in Python. To define the distance between clusters, we used "ward" in the "linkage" function. The hierarchical clustering provides partitions of the monthly networks into C discrete states, where the number of states, C, ranges between 1 and t max . We adopt the value of We show the evolution of the numbers of nodes and edges for each carrier in Figure 1 . The three FSCs (i.e., AA, UA, and DL) experienced the Chapter 11 bankruptcies. This is a form of bankruptcy that permits reorganization under the bankruptcy laws. A debtor proposes a plan of reorganization, and it can exit from the bankruptcy if it completes the proposed plan. The dashed lines in Figure 1 represent the times of the Chapter 11 bankruptcies (shown in red) and exits from them (shown in yellow). Additionally, we show the times of the 9/11 attacks and mergers by the dotted-dashed lines and dotted lines, respectively. The three FSCs show some common patterns. Between the middle of the 1990s and around 2010, the numbers of nodes and edges largely decreased. After the 9/11 attacks, the three carriers experienced the bankruptcies. Then, after they exited from the bankruptcy, they merged with other carriers. Upon the mergers, the number of edges almost doubled suddenly. There was no notable change in the numbers of nodes and edges in response to the bankruptcies and the exits from them. In contrast, WN, which is an LCC, has never experienced bankruptcy. The numbers of nodes and edges in the WN network have roughly monotonically increased over the two decades despite the 9/11 attacks and the merger. In the next sections, we further study the structural evolution of the four carriers. In Figure 2 , we show the distance matrices for the four carriers. The figure suggests that the three FSCs considerably changed their network structure upon the mergers after they exited from the bankruptcies (i.e., the merger between AA and US in 2015 in Figure 2 (a), that between UA and CO in 2012 in Figure 2 (b), and that between DL and NW in 2010 in Figure 2 (c)). These results are consistent with the sudden increases in the numbers of edges shown in Figures 1(a), (b) , and (c). However, the distance matrices additionally indicate that the network structure after the merger is considerably different from that before the merger for each FSC. For example, the DL network after the merger with NW in 2010 is considerably different from that in the late 1990s, although the numbers of nodes and edges are similar at the two time points (Figure 1(c) ). We also notice that the distance matrix for WN (Figure 2 (d)) has many bright horizontal and vertical stripes after 1995. They suggest that WN has frequently adapted their network, but for a short duration each time such as a single month. In Figure 3 , we show the box plots of the distribution of the network distance between pairs of months. Each box plot summarizes each distance matrix shown in Figure 2 . To be quantitative, we carry out statistical analyses on these distributions. First, we carry out the Kruskal-Wallis test to find that there is a significant difference among the four distributions shown in Figure 3 (p < 0.001). Second, as a post-hoc test, we run the Bonferroni-corrected Mann-Whitney U test to compare all six pairs of carriers. We find that each pair of carriers is significantly different, partly owing to large sample sizes (AA vs UA: Figure 2 , i.e., WN shows unique patterns and has kept changing the network structure more considerably than the other three carriers. Additionally, DL changed its structure significantly more than AA and UA did over time. In Figure 4 (a), we show the ACFs for the four carriers, calculated from the distance matrices shown in Figure 2 . This figure indicates that networks closer in time are more similar to each other. We also observe that the ACF for WN is the smallest for all the lag τ > 0, confirming that the WN network tends to vary over time to a larger extent than the other carriers. In addition, Figure 4 (a) reconfirms that the DL network tends to vary over time more than the AA and UA networks do. The ACFs for the FSCs have local maxima at multiples of 12 months, suggesting annual periodic patterns. To quantify the strength of annual cycles, we show the PSDs for the four carriers in Figure 4 A value of s larger than 1 suggests some periodicity with frequency f = 1/12. We show s for each carrier in Figure 5 . The figure suggests that AA has the strongest annual periodicity, followed by UA and DL. The value of s for WN is less than 1, suggesting the lack of periodicity. (a) (b) In this section, we examine states of temporal air transport networks for each carrier. In Figures 1 and 2 , we observed that the network structure for the three FSCs drastically changed upon the mergers after they exited from the bankruptcy. If we apply our algorithm to detect states of temporal networks to the entire observation, we detect two states for each FSC, one state corresponding to the time window before the merger and another state after the merger. However, a visual inspection of Figure 2 suggests that we may benefit from distinguishing multiple states for the time period before the merger. Therefore, for each FSC, here we apply the hierarchical clustering algorithm to detect state dynamics to the distance matrix of the temporal 6/15 network restricted to before the post-bankruptcy merger. We regard the snapshot networks after the post-bankruptcy merger as one distinct state. For WN, we applied the algorithm to detect system states to the entire observation window because the WN network has not considerably changed upon the merger with FL in 2015 up to our visual inspections of Figures 1(d) and 2(d) . We show the detected state dynamics for the four carriers in Figure 6 . For all the four carriers, we detected a change point at January 1995. We identified three states for AA and UA, and five states for DL. The detected changes for DL from state 2 to 3 and state 3 to 4 are located at January 2001 and December 2005, respectively. For WN, we detected two states. Differently from the other carriers, state 1 for the WN network is not consecutive in time, i.e., state 1 reappears in October 1995 for a month after the state transition on January 1995. We notice that the detected change point at December 2005 for DL is three months after the bankruptcy. However, except for this instance and the state after the post-bankruptcy mergers for the FSCs, the detected change points coincide with neither the 9/11 attacks (i. To understand differences in the network structure across the states, we visualize deleted and added edges for each pair of consecutive states. We define deleted edges as the edges that exist at least once in the older state and never exist in the newer state. The added edges are defined as edges that never exist in the older state and exist at least once in the newer state. We show the results in Figure 7 . The deleted and added edges are shown in red and blue, respectively. Figure 7 (a) indicates that in transition to state 2, AA withdrew edges especially at BNA (Nashville International Airport) and RDU (Raleigh-Durham International Airport) and added edges at STL (St. Louis Lambert International Airport) and LAX (Los Angeles International Airport). These results are consistent with the fact that AA closed two hub airports, BNA and RDU, in the middle of the 1990s and merged with TW in 2002 whose major airports were STL and LAX. Figure 7(b) indicates that, in transition to state 3, AA withdrew edges by the largest number at STL and added many edges at CLT (Charlotte Douglas International Airport), PHX (Phoenix Sky Harbor International Airport), and PHL (Philadelphia International Airport). These results are consistent with the fact that AA ceased to use STL as a hub around 2010 and merged with US whose major airports were CLT, PHX, and PHL. Figure 7 (c) indicates that, in transition to state 2, UA changed many routes connecting to IAD (Washington Dulles International Airport), which has been a hub airport for UA. Figure 7(d) indicates that, in transition to state 3, UA added edges by the largest numbers at IAH (George Bush Intercontinental Airport) and EWR (Newark Liberty International Airport). This event originated from the merger with CO whose major airports had been IAH and EWR. Figure 7 (e) and 7(f) indicate that, in transition from state 1 to 2 and from state 2 to 3, DL changed many edges connecting to some airports such as MCO (Orlando International Airport), CVG (Cincinnati/Northern Kentucky International Airport), and ATL (Hartsfield-Jackson Atlanta International Airport). Figure 7(g) shows that, in transition to state 4, DL withdrew edges by the largest number at DFW (Dallas/Fort Worth International Airport). Figure 7 (h) is consistent with the merger with NW whose major airports were MSP (Minneapolis-Saint Paul International Airport) and DTW (Detroit Metropolitan Airport). Figure 7 (i) indicates that, in transition to state 2, WN expanded the network by adding many edges at some airports such as MDW (Midway International Airport), DEN (Denver International Airport), and MCO. In this section, we examine the speed and possible annual periodicity of network changes, which themselves may change on the timescale of decades. We measure the ACF and PSD for each state and each carrier. Because state 1 for WN is not consecutive in time, we exclude the single isolate month in state 1 (i.e., October 1995) in the following analysis. We show the ACFs and PSDs for each state and each carrier in Figure 8 . For AA and UA, the ACF is the largest in state 2, followed by state 1, and then by state 3, for all the lag values (Figures 8(a) and 8(c) ). This result indicates that the speed of network change is slower in state 2, which roughly spans form 1995 to 2013 for both carriers. In contrast, the ACF for DL and WN tends to be smaller at later times, at least up to τ = 2 years of the lag (Figures 8(e) and 8(g) ). This result indicates that network evolution accelerates over time for DL and WN. The PSDs for some combinations of the state and carrier have a local maximum at f = 1/12. We measure the strength of periodicity in the same manner as Eq. (5), i.e., by comparing PSD(1/12) with the middle point of the two neighbors, i.e., PSD(1/18) and PSD(1/9). It should be noted that the largest lag assumed in Figure 8 is 30 as opposed to 50 in Figure 4 , which results in different f values for the two neighbors of f = 1/12 (i.e., f = 1/18 and 1/9) as compared to those for Figure 4 . We show the strength of periodicity for the different states and carriers in Figure 9 . The figure indicates that all the four carriers did not have a clear periodicity before 1995 (corresponding to state 1 for each carrier). After 1995, all carriers strengthened the periodicity. In the latest states, UA and DL have stronger periodicity than AA and WN. In this section, we examine similarity in the network structure across different carriers. We calculate the distance matrix for each pair of carriers by measuring the normalized network distance between a network at month t for a carrier and a network at month t for another carrier, where 1 ≤ t, t ≤ t max . We show the distance matrices in Figure 10 . The figure indicates that the AA and UA networks are apparently more similar to each other than the other carrier pairs are on average. The box plots for the entries of the network distance matrix for each pair of carriers are shown in Figure 11 . Consistent with Figure 10 (a), the AA and UA networks seem to be more similar to each other on average than other pairs. We then carried out the Kruskal-Wallis test and found that there was a significant difference in these distributions (p < 0.001). The Bonferroni-corrected Mann-Whitney U test as a post-hoc test revealed that there were significant differences in the p-values for all 15 In particular, the network distance values for AA-UA are smaller than those for any other carrier pairs with strong effect sizes (r > 0.5). The network distance values for AA-DL are also smaller than the other carrier pairs except for the AA-UA pair with strong effect sizes (r > 0.5). These results indicate that these two carrier pairs are more similar than the other carrier pairs, although they are still dissimilar because the network distance values are around 0.8. To search for the origin of the observed similarities and dissimilarities between carrier pairs, we show the time series of the total number of edges shared by two carriers in each month and the number of such shared edges connecting to some major airports in Figure 12 . Figure 12 (a) indicates that the relatively large similarity in the entire observation period between AA and UA originates from the shared edges connecting to ORD (O'Hare International Airport), which has been a major hub for both AA and UA. Figure 12 (b) indicates that AA and DL became more dissimilar around 2005 because DL closed a hub at DFW, which we also observed in Figure 7 (g). Figure 12 (c) indicates that AA and WN became more similar around 2015. This is because AA merged with US, one of whose major airports had been PHX, as we also observed in Figure 7 (b). Figure 12(d) indicates that most of the edges that UA and DL share originate from the operations at LAX. Figures 12(e) and 12(f) indicate that WN became more similar to UA and DL because WN increased the number of edges at DEN and ATL, which we also observed in Figure 7 (i). These results indicate that large part of the similarity in each carrier pair originates from operations at a single airport shared by the two carriers. We proposed a framework for analyzing evolution patterns in temporal air transport networks. We applied the proposed methods to the four largest carriers in the US, i.e., AA, UA, DL, and WN, between 1987 and 2019. We found that each carrier experienced abrupt changes in their network structure. We also found that the carriers were different in terms of the speed of network evolution, the strength of periodicity, and changes in these quantities across decades. All the four carriers drastically changed their network structure in January 1995. In addition, they strengthened the periodicity after 1995. We also showed that the three FSCs (i.e., AA, UA, and DL) substantially changed their network structure upon the post-bankruptcy mergers. On the other hand, we did not detect notable changes in the network structure upon the 9/11 attacks, the bankruptcies, and the exits from the bankruptcies, except for a change point for DL three months after the bankruptcy. There are different possible mechanisms that may have produced these results. First, these events may have triggered drastic changes in the network structure, but on a shorter time scale than a month. Analysis of weekly or daily snapshot networks may be fruitful; our methods are not selective of the time scale, and detailed flight data for the US carriers with a time resolution of minutes is available at the US Bureau of Transportation Statistics 38 . Second, only the number of flights between airports may have primarily changed in response to such events. Although we have assumed unweighted networks in the present article, if one modifies the network distance measure to accommodate weighted networks, the methods are directly applicable to the case of weighted temporal networks. These research directions warrant future work. We showed that WN, which is an LCC, has shown particularly distinct patterns of evolution over years as compared to the FSCs, such as a consistent growth trend in terms of the number of nodes and edges, relatively fast changes in the network structure, and lack of periodicity when the data from all years are combined. To generalize these results to differences between LCCs and FSCs, one needs more thorough analyses of different carriers, such as LCCs and FSCs in Europe and Asia, where LCCs have been rapidly growing. We opted to study the US case in the present study because of the availability of the data. Our data-analytic framework is also applicable to different contexts. For example, an application to airline alliances (e.g., Oneworld, SkyTeam, and Star Alliance) may reveal influences of entries and exits of carriers on the networks of airline alliances over years. Another direction is to study influences of natural disasters on the structure of air transport networks. For example, ash cloud caused by the volcanic eruption in Iceland in 2010 affected the structure of the European air transport networks 45, 46 . Another possible application is to study influences of epidemic outbreaks including the COVID-19 pandemic, which is ongoing as of mid 2020. Obviously, carriers have been slashing down flights (corresponding to the edge weight) and edges due to the drop in the demand. In addition to this, some carriers may be strategically maintaining their network structure to be as similar as possible, or perhaps the converse, to that before COVID-19. Some carriers may also be changing the similarity of their network to other carriers' networks. Finally, the proposed methods including the analysis of periodicity are not specific to air transportation networks such that they are applicable to temporal networks in general. The Politics of Airline Deregulation The good, the bad, and the ugly: 30 years of us airline deregulation Fare impacts of southwest airlines: A comparison of nonstop and connecting flights Airline competition and domestic us airfares: A comprehensive reappraisal Competition in the european aviation market: the entry of low-cost airlines Low-cost carriers in asia: Deregulation, regional liberalization and secondary airports Assessing the impact of the september 11 terrorist attacks on us airline demand Bankruptcy and product-market competition: Evidence from the airline industry Mergers and service quality in the airline industry: A silver lining for air travelers? Eyjafjallajökull volcano eruption-a brief approach IATA: Worst year for air cargo since 2009 financial crisis Flybe's collapse could be 'first of many' airlines Dynamics of air transport networks: A review from a complex systems perspective Modern temporal network theory: A colloquium A Guide to Temporal Networks Temporal Network Theory Airline strategy: From network management to business models Structural properties of us flight network Modeling the world-wide airport network The worldwide air transportation network: Anomalous centrality, community structure, and cities' global roles Topology of the italian airport network: A scale-free small Analysis of the airport network of india as a complex weighted network Exploring the structure of the us intercity passenger air transportation network: a weighted complex network approach A complex network analysis of the united states air transportation Structural evolution of the brazilian airport network Evolution of air transport network of china 1930-2012 The evolving structure of the southeast asian air transport network through the lens of complex networks Temporal evolution analysis of the european air transportation system: air navigation route network and airport network The accelerated growth of the worldwide air transportation network The evolving network structure of us airline system during 1990-2010 Study of the topology and robustness of airline route networks from the complex network approach: a survey and research agenda Network topology and correlation features affiliated with european airline companies Network measures in civil air transport: a case study of lufthansa Recurrence plots for the analysis of complex systems Detecting sequences of system states in temporal networks Recurrence quantification analysis of dynamic brain networks Airline on-time performance databases Use of ranks in one-criterion variance analysis On a test of whether one of two random variables is stochastically larger than the other Parametric measures of effect size Statistical Power Analysis for the Behavioral Sciences Cybernetics or Control and Communication in the Animal and the Machine A fuzzy relative of the isodata process and its use in detecting compact well-separated clusters The vulnerability of the european air traffic network to spatial hazards Increasing the resilience of air traffic networks using a network graph theory approach