key: cord-0542403-72pfwmu4
authors: Ruijter, Arjan de; Cats, Oded; Alonso-Mora, Javier; Hoogendoorn, Serge
title: Ride-Pooling Matching with a Compensatory Cost Function: Implications for Adoption, Efficiency and Level of Service
date: 2021-07-14
journal: nan
DOI: nan
sha: 211b321b90911631307efcb6c94d1efa7abe974b
doc_id: 542403
cord_uid: 72pfwmu4

By utilising vehicle capacity more efficiently, ride-pooling platforms can potentially lead to reduced congestion levels without adversely prolonging travel times. While previous studies concluded that shared rides can offer substantial benefits, initial evidence suggests low adoption levels. We postulate that previous studies that investigated the potential of ride-pooling failed to account for the trade-off that users are likely to make when considering a shared ride. We address this shortcoming by formulating user net benefit stemming from sharing as a compensatory function where the additional travel time and on-board discomfort need to be compensated by the price discount for a traveller to choose a shared ride over a private ride. The proposed formulation is embedded in a method for matching travel requests and vehicles. We conduct a series of experiments investigating how the potential of ride-pooling services depends on travel demand characteristics, user preferences and the pricing policy adopted by the service provider. In particular, the impact of various behavioural settings in terms of users' willingness to share their ride and delay aversion on service adoption and its operational efficiency is assessed. Our results suggest that the total vehicle mileage savings found by previous studies is only attainable when users are very willing to share their ride (i.e. attach low premium to private rides) and are offered a 50% discount for doing so. We find ride-pooling transportation distance savings as low as 15% in less favourable behavioural scenarios.

Recent developments in communication and information technologies have led to the rise of real-time and on-demand ride-pooling platforms like UberPool and ViaVan. Pre-pandemic, in New York alone on average more than 100,000 trips were booked daily using a ride-pooling application (Schneider, 2021) . Users of those platforms consent to sharing a vehicle with travellers heading in a similar direction, even incurring small route deviations to pick-up and drop-off co-riders. By utilising vehicle capacity more efficiently, ride-pooling platforms can potentially lead to reduced congestion levels (Cici et al., 2013; X. Wang et al., 2016) .

The majority of the literature that quantified the societal benefits of ride-pooling concluded indeed that their introduction is expected to yield promising results (H. Wang & Yang, 2019) . A study by Ma et al. (2013) for example stated that if the fleet of taxis in Beijing had allowed for shared rides in 2011, with users assumed to accept a maximum extra travel time of 5 minutes for their ride, 25% more users could have been served and 13% of the total vehicle distance could have been avoided. Another study analysed ride-pooling for up to three pooled requests using a graph representation that is denominated as a shareability graph and concluded that the total vehicle distance of taxis in New York could have been reduced by 32% (Santi et al., 2014) . Concurring evidence is offered by Qian et al. (2017) , who show that with appropriate incentives total taxi vehicle mileage in New York can be reduced by 47%, while a 46% and 29% mileage reduction is feasible in central areas of Wuhan and Shenzhen, respectively. When ride-pooling is offered with high-capacity vehicles of up to ten seats, less than one fifth of the size of the current taxi fleet of New York City can serve 98% of the original requests with a maximum delay of 3.5 minutes per passenger (Alonso-Mora et al., 2017) . While the previously mentioned studies focused on ride-pooling in high-density metropolitan areas, Tachet et al. (2017) assert that also cities with lower densities have the potential to obtain substantial efficiency gains by substituting individual taxi rides with ride-pooling.

Whether ride-pooling can live up to the potential indicated by the above studies, is highly uncertain. For example, the aforementioned studies have strictly analysed the existing demand for taxi services and assumed that ride-pooling services will substitute single-person taxi rides only. However, ride-pooling is also found to substitute other modes, like public transit and active modes (Rayle et al., 2016) . This may lead to increased road traffic volumes. Moreover, as a result of low ride fares, the operation of a ride-pooling service may not be viable or require excessive subsidisation, and thus never fully materialise or scale. At the same time, the inconvenience associated with sharing a vehicle with strangers can be an important deterrent for potential users and therefore hinder a wide-scale adoption of ride-pooling services. In New York for example, prior to its shutdown in the wake of the COVID-19 pandemic, only 7% of Uber's rides were made using its ride-pooling service UberPool (Schneider, 2021) . User inconvenience may arise from a lack of privacy (Dueker et al., 1977; Teal, 1987) , a feeling of dependence and a fear of having negative social interactions with other users (Correia & Viegas, 2011; Morales Sarriera et al., 2017) . In the recent pandemic, virus exposure has emerged as an additional concern for shared mobility (Hensher, 2020) . Given that ride-pooling efficiency depends on mutual compatibility of trip requests, a low willingness to share is highly detrimental to the societal benefits of a ride-pooling system. In Toronto for example, only for 18% of all ride-pooling trips that took place in September 2018 a rider was successfully matched to another rider (City of Toronto, 2019).

A possible explanation for the large discrepancy between the societal benefits of ride-pooling in theory and what has so far been observed in practice, is that previous ride-pooling studies fail to account for the complex trade-off that users are likely to make when considering a shared ride. In all of the aforementioned studies, users are assumed to opt for the ride-pooling service as long as the expected delay, i.e., later pick-up time and en-route detour, does not exceed a certain pre-specified threshold. Hence, no inherent deterrence from using ride-pooling was assumed given everything else being the same. In other words, in previous studies travellers were assumed to be intrinsically motivated to share a ride, without requiring any form of compensation and even accepting a delay. The objective of this study is to explicitly account for the trade-off users can make between the discount offered for sharing their ride and the travel impedance that it induces. We formulate user net benefit due to sharing as a compensatory function where the additional travel time and on-board discomfort must be compensated by the shared ride discount for a traveller to choose a shared ride over a private one. This allows us to assess the potential adoption of ride-pooling by considering the two key barriers (Lavieri & Bhat, 2019) . In order to assess service performance and level of service, we embed our user benefit formulation in the method for matching travel requests and vehicles introduced by Alonso- Mora et al. (2017) .

The potential of ride-pooling services is expected to greatly vary across markets, depending on travel demand characteristics, user preferences and the pricing policy adopted by the service provider. Our approach enables that analysis of system performance under various settings while accounting for user preferences and their possible variation. We conduct a series of experiments that includes investigating the impact of various behavioural settings in terms of (a) users' willingness to share their ride and (b) their delay aversion on service adoption and its operational efficiency. The incorporation of a cost-benefit trade-off at the individual passenger level also allows us to outline implications for the design of an effective discount structure to boost ride-pooling adoption and consequently reduce the total vehicle distance on the road.

The remainder of the paper is structured into four sections. Section 2 provides a detailed description of our methodology. This is followed by details on the design of the numerical experiment in Section 3. The results for the experiments are presented and discussed in Section 4. The paper is concluded by stating the main conclusions that can be drawn in relation to the effect of users' behavioural preferences, the spatial distribution of demand and the pricing mechanism on the performance of a ride-pooling service (Section 5).

The assignment of passenger requests to ride-pooling vehicles over a period of time enables the assessment of the total vehicle movement and service quality obtained by a ride-pooling service. There are several approaches for the real-time assignment of requests to vehicles. As a way of dealing with the large solution space in ride-pooling assignment, in early assignment approaches, such as the one developed by Ma et al. (2013) , incoming requests were individually allocated to vehicles using a greedy algorithm. Santi et al. (2014) introduced the concept of shareability graphs to systematically analyse the mutual compatibility of two requests using a graph representation so that assignment can be performed with traditional graph-solving optimisation methods. A follow-up study extended this graph-based approach by introducing additional graph representations to allow for bundling requests and therefore compose high-occupancy ride-pooling trips (Alonso-Mora et al., 2017) . Their request-group-vehicle (RGV) graph constitutes the composition of request groups and the vehicle that may serve each request group, representing the assignment problem as an Integer Linear Problem (ILP). A subsequent follow-up study by Simonetto et al. (2019) further reduced the complexity of graph-based assignment approaches, without a significant loss of service performance. Agent-based models (ABM) have also been used to study ride-pooling, whereby users and vehicles are modelled as agents that dynamically interact (Fiedler et al., 2018; Winter et al., 2018) .

In this study we adopt the graph-based approach of Alonso-Mora et al. (2017), which offers the capability to model real-time ride-pooling with more than two passengers on-board the same vehicle. Contrary to a greedy approach, in which travellers are instantly assigned to an available vehicle when making a request, trip requests are being collected over a small period of time -in the order of seconds or minutes -and only assigned at the end of this interval. When given enough computation time, each assignment iteration will yield an optimal solution for the current set of requests, meaning that although the initial waiting time for travellers is longer, the approach can yield a shorter total travel time.

The approach contains a few implicit assumptions about ride-pooling operations. Firstly, supply is assumed to be centrally controlled: drivers are fully compliant with central instructions regarding which requests to serve, the order of pick-ups and drop-offs in a pooled ride, and the route to follow. Private and pooled services use the same road infrastructure, i.e. there are no HOV lanes for ride-pooling vehicles. To limit computational complexity in the approach, it is assumed that ride-pooling operations has no effect on the traffic conditions in the network. Travel times in the network are fully predictable and can thus be precomputed. On the user side, the approach assumes that requests can be subject to assignment in consecutive iterations. Assigned travellers that have not yet been picked-up can be reassigned if it improves their level of service. Travellers with unsatisfied requests will be available for assignment until a certain time threshold is reached, i.e. when they run out of patience.

Before providing more details on our methodological contribution to the ride-pooling assignment procedure, we will first shed more light on the graph-based assignment process of Alonso- Mora et al. (2017) , which forms the basis for our work and is subject to adaptations as highlighted in the subsequent sections.

In the graph-based approach of Alonso-Mora et al. (2017) , which is visualised in Fig. 1 , requests are matched with other requests as well as with vehicles at fixed intervals. Each assignment iteration consists of the following nine steps:

1. Establish the status of vehicles as well as the pool of requests to be matched to a vehicle. The latter includes new trip requests and unassigned requests from the previous iteration. Pending requests are removed when assignment is no longer feasible. Illustration in Fig. 1 : two empty vehicles, three requests.

2. Check which pairs of requests can be matched, assuming there is a vehicle located at the origin of one of the two requests.

Requests 1 and 2, and 2 and 3 form feasible request pairs.

3. Check which requests can be matched to which vehicles, considering vehicles' current location and on-board occupancy.

All requests can be served by vehicle 1, only request 2 can be served by vehicle 2. 5. Check which groups of requests can be matched to which vehicles. To do so, first identify potentially feasible request group -vehicle matches based on cliques in the RV-graph, and then find whether there is a feasible route with which a vehicle can satisfy all requests in the group as well as passengers already on-board. From the RV-graph it follows that a group consisting of requests 1 and 2, and a group consisting of requests 2 and 3 can potentially be served by vehicle 1. Besides, all 'groups' with only one request can potentially be served by vehicle 1. Finally, a 'group' consisting only of request 2 can potentially be served with vehicle 2. When checking the feasibility of these group-vehicle combinations, we find that there is no feasible route for vehicle 1 to satisfy the group consisting of requests 1 and 2.

6. Create a request-group-vehicle (RGV) graph: a graph with request, group and vehicle nodes, where an edge between a request and a group node indicates that a request is included in a request group, and an edge between a group and a vehicle node implying that a request group can be matched to a vehicle (as found in step 5). A label can be added to edges between group and vehicle nodes to represent the attractiveness or 'value' of the match. RGV-graph as shown in Fig. 1 , with request (left), group (middle) and vehicle (right) nodes.

7. Decide which groups are assigned to which vehicles by translating the RGV-graph into an Integer Linear Problem. Multiple objective functions are possible, such as a maximisation of the number of assigned requests. The group consisting of requests 2 and 3 is assigned to vehicle 1. Vehicle 2 cannot serve request 1 and is left unassigned.

8. Decide whether idle vehicles will rebalance, and if so, where to.

(Unassigned) vehicle 2 moves in the direction of (unassigned) request 1.

9. Update vehicle schedules according to the assignment (step 7) and rebalancing (step 8) result. Vehicle 1 will pick-up requests 2 and 3, vehicle 2 will rebalance towards the origin of request 1.

The next two sections provide a more detailed description of the method, focusing on the steps where this study differs from previous work. In the following, we present our version of the request-group-vehicle (RGV) matching procedure for finding all feasible request group -vehicle combinations (Subsection 2.2, steps 2-6 in Fig. 1) , and the subsequent procedure of assigning vehicles optimally to requests (Subsection 2.3, steps 7 and 8 in Fig. 1 ).

The matching procedure involves the creation of an RGV-graph to identify which group-vehicle combinations are feasible, given vehicle capacity and travel cost constraints. The matching process of Alonso-Mora et al. (2017) is divided into three steps to ensure that not all group-vehicle combinations have to be enumerated and checked for feasibility, which significantly reduces the complexity of the matching algorithm. In this subsection, after we introduce the functionality of each of these three steps along with our modifications thereof, we provide a detailed formulation of the cost function and explain how it is used in the algorithm that checks the feasibility of group-vehicle pairs.

The first amongst the matching steps (step 2 in Fig. 1 ) establishes whether two requests in the pool of available requests R can share a ride in the most fortunate scenario in which there is an empty vehicle readily available at the location of one of those requests. With this step, the set of potentially feasible request groups G can already be significantly reduced. The next step (step 3 in Fig. 1 ) checks whether a vehicle v in the fleet of vehicles V can serve a single request r ∈ R given its current location and residual capacity (i.e. the number of available seats). The result of both steps can be combined and stored in a RV-graph (step 4) where edges indicate that two requests, or a request and a vehicle, can be matched. Each clique in the RV-graph represents a potentially feasible group-vehicle combination.

Step 5 checks whether there exists a feasible sequence of stops S v for a vehicle v to satisfy a group of requests g ∈ G, considering capacity and user level of service constraints. The RGV-graph consists of nodes representing the set of available requests R, the set of feasible request groups G and the set of vehicles V . Each edge between a request r ∈ R and a request group g ∈ G has a label a rg indicating whether r is included in g (a rg = 1) or not (a rg = 0). Moreover, each edge connecting a request group g ∈ G and a vehicle v ∈ V has a label b gv indicating the sum of net sharing benefits of all requests in g and passengers in P v given the optimal sequence of stops S * v . If a group-vehicle combination gv is not feasible, b gv is assigned a very large penalty (a so-called big M), to ensure that this combination is not chosen during the assignment process.

Our approach differs from the original one by Alonso-Mora et al. (2017) in that group-vehicle edge labels in the RGV-graph contain sharing benefits instead of delay costs. Similarly, we substitute cost-based user constraints in the matching algorithm for benefit-based user constraints. In the remainder of this subsection we describe our modifications to the original work by Alonso-Mora et al. (2017) in more detail.

We formulate and quantify the individual benefits stemming from ride-pooling to represent the trade-off that travellers encounter when choosing between a private and a shared ride. In the following, we define the notion of net sharing benefits. It is used not only for identifying which requests and vehicles can be assigned to each other (i.e. in group generation and establishing group-vehicle pair feasibility), but also in deciding which requests are to eventually be assigned to which vehicles (i.e. in assignment), as explained in this and subsequent subsections.

We formulate the benefits and costs of ride-pooling relative to a private ride. The benefit associated with ride-pooling for a traveller with request r corresponds to the total fare discount offered by the service provider for shared rides and is thus dependent on the discount rate π r that is applied to the ride fare c r . π r can be either set as a fixed rate or as a function of the level of service offered by the shared ride. We examine both cases in our experiments.

In contrast, the disbenefits of a shared ride relative to a private ride relate to extra travel time imposed by sharing and an additional discomfort associated with sharing a vehicle, all other things being equal. The total disbenefit of a ride thus depends on how users perceive both attributes when expressed in monetary terms, in this study expressed as delay aversion β r and reluctance to share γ r .

These parameters indicate what fare discount users require for an hour of delay and for sharing a vehicle with other riders -value of time and ride-pooling alternative specific constant -respectively. To account for a different valuation of waiting time and in-vehicle delay, we introduce the parameter α r . It represents the additional user cost of waiting one minute for pick-up as opposed to having one minute of in-vehicle delay. The waiting time of a request t wait r is calculated as the difference between the pick-up time t pu r and time at which the request was made t r r . The total delay t delay r is the difference between the actual time of drop-off t d r and the earliest possible time of drop-off t * r = t r r + tt o r ,d r . The latter is calculated for an immediate pick-up and the shortest route, with travel time tt o r ,d r between origin o r and destination d r .

The total net benefit of r is thus formulated as:

The three main matching steps in the RGV-approach (steps 2, 3 and 5 in Fig. 1 ) involve applying the same algorithm for establishing whether a specific combination of requests and/or vehicles yields a feasible match. The steps differ only in their input:

• Pairwise request matching (step 2): two requests and a hypothetical vehicle located at the origin of one of the requests • Request-vehicle matching (step 3): a single request and a specific vehicle • Group-vehicle matching (step 5): a group of requests and a specific vehicle

The algorithm can therefore be generalised by stating that it finds whether a vehicle v in fleet V , with passengers P v on-board, can serve a group of unassigned requests U. The vehicle can be either virtual or specific, and the group of requests may consist of a single request or of multiple requests.

In the first step of the algorithm, the complete set of stop sequences K v is identified with which v can potentially satisfy U. Each stop sequence S v ∈ K v is then checked for feasibility based on vehicle and user constraints. The vehicle constraint ensures that the vehicle capacity is not exceeded, i.e. v cannot serve U with stop sequence S v if the vehicle capacity κ is exceeded by the vehicle occupancy O s after each stop s ∈ S v , similar to the approach of Alonso-Mora et al. (2017). As we assume that drivers strictly follow the shortest path between two nodes in a network with static travel times, for each stop sequence S v we only need to check the feasibility constraints for a single route, i.e. the route made up of the shortest paths between the consecutive vehicle stops.

The user constraint proposed in this study is modelled with the net sharing benefit. Stop sequence S v is hereby assumed to satisfy the user level of service constraint only if the net benefit b r of each request in U and P v is positive. In other words, all riders, whether already picked-up or not, must prefer a shared ride provisioned with this specific route over a private ride.

If both types of constraints are satisfied, the benefit of the stop sequence b S v is computed by summing the net benefits of all individual requests in U and P v . If there exists at least one feasible stop sequence S v ∈ K v to serve U, then U and v form a feasible match. When there is more than a single feasible stop sequence, we will also need to determine which stop sequence is most optimal. Therefore, we search for the stop sequence in K v with maximum benefit b S v , which we define as the total benefit of the particular group-vehicle combination b Uv . However, if no feasible stop sequence exists, b Uv is set to "Invalid". The complete procedure for checking the feasibility of group-vehicle combinations and finding the most optimal stop sequence for feasible combinations is specified in the pseudocode shown in Algorithm 1.

Algorithm 1 Matching request group U to vehicle v: Establishing feasibility and optimality of potential stop sequences. 

In this part of the method, requests are assigned to vehicles based on the RGV-graph (corresponding to step 7 in Fig. 1 ). The group-vehicle assignment is treated as an Integer Linear Problem (ILP) with binary decision variables x gv indicating whether a group-vehicle combination with total net sharing benefit b gv is chosen or not. The ILP is defined as follows:

The objective function (Equation 2) aims at maximising the total benefit for accepted requests and passengers, while prioritising the acceptance of a maximum number of requests by adding a very large reward for each request r in an assigned request group g. The sum of these rewards should however not overpass the big M penalty assigned to infeasible group-vehicle combinations in the objective function. Therefore, the reward per request group is set to √ M. The total benefit associated with a group-vehicle combination g − v thus consists of the summed net benefit for all requests and passengers in this group plus a large reward √ M for each request that is a member of this group.

The Integer Linear Problem contains three types of constraints guaranteeing respectively a maximum assignment of one request group g to each vehicle v (Equation 3), that each request r is not part of multiple assigned request groups in G (Equation 4), and that each decision variable is binary (Equation 5).

Vehicles that are not assigned to a pick-up, can still be assigned to move in the direction of unassigned requests, in anticipation of new requests appearing in areas that currently are undersupplied (step 8 in Fig. 1 ). In this study, the rebalancing procedure of Alonso-Mora et al. (2017) is adopted. Its objective is to minimise the total empty vehicle rebalancing distance while ensuring a maximum number of vehicles to be assigned to rebalance.

After vehicles are assigned to pick-up requests, rebalance or remain idle, vehicle schedules are updated and the simulation prepares for the next assignment phase (step 9 in Fig. 1 ).

We measure the performance of the ride-pooling service using a series of Key Performance Indicators (KPIs) designed to capture the level of service (LoS) offered to users as well as its operational efficiency which is relevant for authorities and service providers. If ride-pooling services are assessed by examining the same aspects that public transport users consider to be most important (Bates et al., 2001; Edvardsson, 1998; Hensher et al., 2003; König & Axhausen, 2002; Friman & Gärling, 2001) , then the KPIs of shared rides LoS are reliability, comfort, travel time and fare level. Ride fares in this case are not considered as a KPI, since they are directly dependent on π r and are thus endogenously defined as model input. The main LoS KPIs in this study include the acceptance rate (i.e. the percentage of fulfilled requests out of the total demand, thereby an indicator for coverage and reliability), the delay as a percentage of the direct travel time (indicating travel time), the average number of stops per passenger (pertaining to comfort), and the share of passenger time with a specific number of co-riders on-board (also related to comfort).

In addition to the quality of service delivered by the ride-pooling service, an authority is also interested in the share of the vehicle distance that can be reduced through ride-pooling. A suitable KPI to express distance efficiency is the gross effective vehicle transportation distance ratio, which is defined as the sum of the shortest OD-distance of accepted requests (= 'effective vehicle distance') divided by the total vehicle movement distance (Ehsani et al., 2018) . The total vehicle movement distance (or vehicle mileage) consists of the transportation distance (the vehicle distance with at least one passenger on-board) and the deadheading distance (the total empty vehicle distance for accessing requests and rebalancing). Further, a net effective vehicle transportation distance ratio is defined. This ratio accounts for the fact that the summed shortest distance of accepted requests, which represents the distance needed when sharing is not allowed, excludes deadheading. For a more fair comparison, the deadheading distance is therefore subtracted from the total vehicle movement in the net effective vehicle transportation distance ratio. For operators, the average vehicle occupancy while a vehicle is transporting passengers is an important efficiency KPI.

A series of experiments is constructed to test the effect of users' behavioural preferences, the discounting policy and the spatial distribution of demand on ride-pooling performance in an urban context. Before specifying the scenarios that have been designed to test the effect of these variables (Subsection 3.2), we introduce the general set-up of the experiment in terms of road network, demand and vehicle fleet characteristics in Subsection 3.1. Subsection 3.3 concludes this section with a description of the model implementation.

The assumed grid network consists of 121 nodes with a link distance of 500 meters, thereby leading to a maximum trip distance of 10 kilometres and a surface area of 25 km 2 , comparable to the area inside the Ring Road of Amsterdam or the Inner Ring of Berlin. The intermediate stop distance is relatively large, whereby we implicitly assume that vehicles cannot stop at all road intersections and users are willing to walk to a pick-up location (and/or from a drop-off location). The assumed speed on the roads is slightly higher than in an average European city (Kfzteile24, 2019) : 36 km/h. The total demand for trips is set to 1,210 requests per hour, an average of 10 requests per hour per node. The way trips are distributed over the network is scenario-specific, since we are explicitly interested in investigating the impact of demand distribution on system performance. In all cases trips with a ride distance of 2 kilometres or shorter are excluded, as such rides are uncommon (T. as well as undesirable in the context of a ride-pooling service. A gravity model (Erlander & Stewart, 1990 ) is applied to create a list of origin-destination pairs. Demand generation is assumed to follow a random process with Poisson distribution. Each request r is assigned with a request time t r r by sampling from an exponential distribution based on the expected interval λ between two successive requests with a specific OD-combination, which follows again from the (scenario-specific) demand distribution. The fleet of the investigated ride-pooling service consists of 150 vehicles with a capacity of κ = 3, that of a normal car, initially evenly distributed over the network. Ride fares are set based on the regulated maximum taxi fares for the city of Amsterdam in 2019: a base fee of e3 and a kilometre fee of e2 (Gemeente Amsterdam, 2019) .

For computational reasons, the total duration of the simulation is limited to two hours, with request groups being assigned to vehicles every minute (120 times in total). An additional warm-up period of 15 minutes is applied to minimise the impact of each of the starting conditions.

A total of fourteen scenarios are constructed for investigating the effect of previously introduced behavioural attributes, the platform's pricing policy and the spatial distribution of demand in the network. We limit the number of scenarios in the experiment by designing scenarios that differ only in one variable at a time from a reference scenario. In this sub section, we provide the motivation for the specification of scenarios, summarised in Table 1 , based on the four experimental variables: delay aversion β r , reluctance to share γ r , sharing discount π r , and directionality in demand. 

Detours for picking up or dropping off additional travellers induce a cost for travellers already on-board the vehicle. Thus, a potentially crucial behavioural factor for the potential to pool rides is the amount of disbenefit that travellers allocate to a single unit of delay, which is represented in this work by delay aversion β r (Equation 1). Several recent studies (Krueger et al., 2016; Lavieri & Bhat, 2019; Y. Liu et al., 2019; have estimated the value of in-vehicle time for ride-pooling, providing empirical indications for the range of travellers' delay aversion values. Their estimates vary considerably, ranging from 8 to 23 e/h, which implies that the value of in-vehicle time for ride-pooling may be highly context-specific, depending on factors like the type of vehicle (autonomous or humanly-driven) and socio-economic characteristics. Since the specification of delay aversion in ride-pooling is not yet well-established, we decide to test a relatively wide range of values in order to get a more complete picture of how travellers' perception of delays can possibly affect ride-pooling potential. For the base scenario, we assume a delay aversion β r of e30/h, while alternative scenarios test values that are 6 and 12 e/h higher or lower than the base value (scenarios 1-5 in Table 1 ).

At the same time, taste variations may play an important role in determining the scalability and efficiency of ride-pooling systems when users are able to choose between private and shared rides, as modelled in this study. Alonso-González, found a significant heterogeneity in the value of in-vehicle time of potential ride-pooling users. We therefore introduce a scenario with taste heterogeneity in delay aversion β r . Since there is limited empirical knowledge on the variation of delay aversion amongst the population, we need to make an assumption about the shape of this distribution. We specify heterogeneity in delay aversion (scenario 10 in Table 1 ) as a normal distribution with a mean value of e30/h (the base value from scenario 1) and a standard deviation of e10/h: N (30, 10).

In all of the experiments, the value of α r , representing the extent to which waiting time is perceived more negatively than in-vehicle time, is set to half of β r , which is in line with the waiting time multiplier found in stated preference studies on potential ride-pooling users (Y. ) and a revealed preference study in urban public transit (Yap et al., 2018) .

Recently, the concept of willingness to share, also referred to as the reluctance to share or the willingness to pay to not have to share a ride with other travellers, has started to gain more attention in the literature. Lavieri and Bhat (2019) concluded that travellers' reluctance to share represents a fixed cost, independent on the duration of the ride. Alonso-González, Cats, et al. (2020) confirm this principle, yet with the notation that it applies only to ride-pooling with one or two co-riders, and not for all travellers in the population. When travelling with more than two co-riders, which represents microtransit rather than ride-pooling, they find a travel time dependent willingness to share. For small-scale ride-pooling however, both studies find a relatively small fixed average reluctance to share, in the range of e0.50 -e1. Alonso-González, Cats, et al. (2020) also observe that the willingness to share is highly context-specific, depending on for example geographical characteristics and familiarity with ride-pooling services.

All in all, empirical research on travellers' reluctance to share is still scarce, especially when considering different contexts. With a first indication that the reluctance to share a vehicle with one or two co-riders is represented by a fixed cost, we decide to test a relatively large range of fixed values for reluctance to share γ r in our numerical experiments: from e1 to e5 (scenarios 1 and 6-9 in Table 1 ). In all of the other scenarios, we assume a median value of γ r = e3. Again, when lacking full understanding of the specification of behavioural attributes related to sharing, covering a large range of values allows examining the implications of these attributes for the potential of ride-pooling.

As mentioned earlier, Alonso-González, Cats, et al. (2020) state that different classes of potential ride-pooling users have a different specification of their reluctance to share. In fact, preferences related to fellow passengers in ride-pooling are found to exercise more pronounced heterogeneity than preferences towards travel time (Zhang & Zhao, 2018) . A possible explanation for this is that some users enjoy the social interactions that come along with sharing a ride while others are reluctant to share their ride with strangers. This implies that for some users in fact the wilingness to share γ r might be positive, or in other words that the fact that a vehicle is shared induces an additional benefit next to a reduced ride fare. Again, lacking empirical evidence on the distribution of preferences across the population, a normal distribution is assumed to capture heterogeneity. Since heterogeneity in reluctance to share γ r was found to be (likely) larger than heterogeneity in delay aversion β r (with its assumed standard deviation of one third of the mean), the standard deviation relative to the mean to describe heterogeneity in the reluctance to share γ r is therefore set higher than for the delay aversion β r in scenario 10. We specify a mean reluctance to share of e3 and a standard deviation of e2: N (3, 2) (scenario 11).

An additional scenario (scenario 12 in Table 1 ) has been devised to test the effect of a ride-pooling providers' pricing mechanism. All scenarios except scenario 12 assume a fixed 50% discount for all ride-pooling rides, independent of whether sharing actually occurs throughout the ride. This is in line with services offered by ride-pooling platforms, e.g. UberPool, that typically offer discounts between 25 and 60% of the fare of a private ride (Shaheen & Cohen, 2019) . We are interested in examining the impacts of an alternative pricing mechanism that reflects the actual extent of sharing experienced by the user. We therefore specify an alternative scenario where a similar discount of 50% is given to the user even if he or she ends up being served privately (same as for all other scenarios, whereby the discount is basically a compensation for the risk of having to share), while an additional 7.5% discount is given for each co-rider n pax on-board the vehicle during the part of the ride with highest vehicle occupancy. The aim of this additional scenario is to explore whether occupancy-dependent discounts in ride-pooling can be effective in reducing vehicle mileage by facilitating the matching process, as well as at what user cost. It is not meant to give a specification of the (socially) optimal pricing strategy, which, depending on the results of this study, may however be an interesting topic for future research.

The effect of directionality in demand on the performance of ride-pooling is tested using three different scenarios. In the base scenario (1 in Table 1 ) demand is perfectly uniform, with equal production and attraction in each of the nodes. In two additional scenarios (scenarios 13 and 14 in Table 1 ), we specify a demand distribution that represents an increasingly concentrated demand pattern, with more production in the outer nodes of the network and more attraction in the central nodes, intended to mimic a morning peak pattern.

The simulation model is implemented in Python from scratch, using the open-source library Numpy to enable efficient operations of large data structures in the model, such as creating and storing the edges of RGV-graphs when many requests and vehicles are considered. The Networkx package is used to compute the shortest path between a pair of locations in the road network, after which the corresponding travel time is stored in a look-up table.

We compute the complete RV-and RGV-graphs without imposing a time budget or limits on the number of edges. An exhaustive search is performed to find the optimal stop sequence with which a vehicle can serve one or more requests (Algorithm 1). The optimisation problems that are part of the group-vehicle assignment and rebalancing procedure are, for all experiments, solved to optimality using the MOSEK Optimizer API. Consequently, each assignment iteration is guaranteed to yield an optimal result.

With these settings, the majority of scenarios could be run within 30 minutes on a single-core 2.30GHz processor. Two noticeable exceptions are the scenarios with the lowest delay aversion β r and lowest reluctance to share γ r with run times of approximately five hours. In these scenarios, as a result of more favourable preferences towards sharing, larger request groups are potentially feasible, hence increasing the solution space. Consequently, it requires significant computational time to test those as the set of possible stop sequences to satisfy such groups is significantly (i.e. more than exponentially) larger than for small request groups. Also the scenario with an occupancy-dependent additional discount (U 30 3 D) enlarges the solution space and consequently the computational complexity of this scenario is also relatively high compared to most other scenarios (i.e. a run time of nearly one hour with the same processor).

Even though we investigate real-time ride-pooling, given the offline evaluation nature of this work we do not focus on the computational efficiency of the algorithm in this study. Notwithstanding, the approach adopted in this study is in principle suited for simulation in real-time, as long as we manage to limit the computational load of each assignment iteration. We can think of several changes to the current implementation to do so, including a timeout or a constraint on the number of edges in the development of RV-and RGV-graphs, a timeout when seeking the optimal stop sequence for a group-vehicle combination, and a timeout in the ILP assignment of vehicles to requests. As these implementation directly touch upon the core of the work by Alonso-Mora et al. (2017), we refer to their work for more details.

In this section we report the results of the experiment and analyse the effect of users' preferences (Subsection 4.1), the variation thereof (Subsection 4.2), the applied discount structure (Subsection 4.3) and the demand distribution (Subsection 4.4) on the level of service and efficiency of a ridepooling service. The complete set of KPI values is presented in Tables 2 (level of service) and 3 (efficiency). In the following four subsections we discuss the effect of each of the above mentioned aspects in detail.

As can be expected, the acceptance rate (Fig. 2a) increases as the reluctance to share γ r decreases. The acceptance rate rises from 25.4% when γ r = e5 to nearly 100% when γ r = e1. The increase is approximately linear until the great majority of requests is accepted. Interestingly, the average vehicle occupancy (Fig. 2b) increases more than linearly when γ r decreases, as well as passengers' waiting time and in-vehicle delay (Fig. 2c) . It is found that hardly any rides are shared (i.e. the average vehicle occupancy is 1.14) if users are highly sensitive to sharing with other passengers, resulting in an average in-vehicle delay close to zero. In such a scenario, the operational efficiency in terms of the number of effective passenger kilometres per vehicle kilometre is as low as 1.05. This ratio is found to increase approximately linearly with an increase in the willingness to share. It can be explained by the finding that the total effective vehicle distance (due to more requests served) increases more than the total vehicle movement distance when users are more flexible (Fig. 2d) , as a result of a more efficient assignment of vehicles to requests. Also, deadheading is found to be relatively uncommon when users' sharing tolerance is high, as new requests can be picked-up by vehicles on their way to drop off other passengers. If γ r = e1 for example, the average effective passenger distance per total vehicle kilometre in the system (including transportation and deadheading) rises to 1.36 kilometre. When considering the effect of delay aversion β r instead of reluctance to share γ r , similar, albeit less pronounced, results are found. The acceptance rate, for example, does not exceed 90% in any of the scenarios. Evidently, the level of service and operational efficiency are more sensitive to the tested values of the willingness to share, γ r , than to those of the delay aversion, β r .

Significantly fewer requests are accepted when reluctance to share varies amongst the user population (with an unchanged mean): 66.8% versus 76.0% of all requests (Fig. 3a) . As can be seen in Fig. 3b , the acceptance rate varies considerably amongst user groups that are characterised by different degrees of reluctance to share. Heterogeneity in the delay tolerance on the other hand barely has an impact on the acceptance rate: 75.9%. An explanation for this difference could be that users with a high delay aversion can often still be satisfied (Fig. 3b ) by serving them with the shortest or a relatively short path so that their delay costs are minimised. This happens at the cost of more flexible passengers that will get served with a delayed pick-up and/or a less direct route. In other words, a level of service is offered that discriminates amongst users based on their delay aversion. With the majority of accepted requests having an above average delay tolerance (since it is easier to find a ride for those requests), the average delay experienced by users in the system, 36.4% of the direct travel time, is higher than when the delay tolerance is assumed homogeneous (29.5%, Fig. 3c ).

When considering a scenario with varying willingness to share, it is found that users with a below average willingness to share are much more likely to reject a ride-pooling service (Fig. 3b) even when offered a direct ride, leading to a lower overall acceptance rate (Fig. 3a) . The total rebalancing distance is limited since there are relatively many requests around unassigned vehicles that cannot be served even with a direct route. The effective transportation distance ratio, an indicator for distance savings (Fig. 3d) , ranges between 1.15 and 1.17 for the three scenarios in this set of experiments, and thus does not seem to be significantly dependent on whether heterogeneity in ride-pooling tolerances is considered. Based on this, we can say that the operational efficiency in terms of the vehicle-km travelled that a ride-pooling system can save is not very sensitive to the variation of sharing preferences over the population.

As expected, when users receive an additional 7.5% discount per co-rider they share the highest occupancy part of their ride with, the average vehicle occupancy increases dramatically (from 1.38 to 1.85, as can be seen in Fig. 4a) , and a similar increase is found in the share of time passengers spend in a full vehicle (from 17,7% to 45.1% of the total passenger time). By utilising the available vehicle capacity more efficiently, the acceptance rate (also Fig. 4a ) increases from 76.0% to 82.1%, although at the cost of a higher average delay (Fig. 4b) . A higher vehicle occupancy will burden passengers with longer detours and consequently an in-vehicle delay of more than three times as high as when no additional discount is offered (25.1% vs 8.1% of the direct travel time). Also, the average waiting time for pick-up is marginally longer in the scenario with an occupancy-dependent discount, with pick-ups being complicated by the fact that many vehicles are driving around fully occupied.

The (gross) effective vehicle transportation distance ratio increases from 1.15 to 1.35 when an additional 7.5% discount is awarded per co-rider. In combination with a higher acceptance rate, relatively large distance savings (Fig. 4c) are achieved with the introduction of an additional occupancy-dependent discount. Passengers however may be reluctant to choose a service for which they know the maximum cost but not the exact cost a-priori. The distance that a ride-pooling service can save (when compared to private rides) stems not only from the more efficient transportation of requests (the transportation distance drops from 5,588 to 4,829 kilometres) but also from a reduction in the deadheading distance to access new requests (from 900 to 501 kilometres), as requests are being picked-up by non-empty vehicles on their way to drop off other passengers.

More directionality in demand leads to more requests being rejected by the ride-pooling service, namely 37.1% when demand is strongly directed versus 24.0% when demand is perfectly uniform, as shown by Fig. 5a . If demand is perfectly uniform, the average vehicle occupancy of vehicles in revenue mode (also Fig. 5a ) is 1.38 and the average passenger delay (Fig. 5b) is 29.5% of the direct travel time. The drop in the number of accepted requests when there is a moderate level of direction in demand leads to a drop in the vehicle occupancy (1.32) and average delay (29.1% of direct travel time). Interestingly, when the level of direction increases further however, the vehicle occupancy (1.35) and the average delay (31.9% of the direct travel time) bounce back. With a larger spatial inequality in pick-ups and drop-offs, average waiting times are relatively short in the center, where attraction exceeds production (Fig. 5c) , compared to the nodes in the periphery of the network. Since only the minority of requests originates in the center in this case, the average waiting time is mainly determined by requests originating outside of the center, where production exceeds attraction. In these nodes, the average waiting time is nearly twice as high (27.1% versus 14.1%).

Deadheading serves as a mean to solve inequality in supply and demand. It is responsible for 27.6% of all vehicle kilometres in a scenario in which demand is strongly directed, compared to only 13.9% of the mileage when demand is uniform. The effective passenger kilometres per ride-pooling vehicle kilometre in respective scenarios is 0.95 versus 1.15. This alarming result suggests that when directionality in demand is high, the total vehicle distance can be longer than the effective transportation distance even for ride-pooling services (Fig. 5d) . 

This work is the first study to consider ride-pooling potential while accounting for the trade-off that users encounter when presented with the option of ride-pooling. Previous studies, such as Santi et al. (2014) and Alonso-Mora et al. (2017) , assumed that all users are potentially willing to ride-share as long as their waiting time and total delay do not exceed a certain non-compensatory threshold. This is not very realistic as taxi users have no reason to share their ride (and accept a delay) unless attaining a benefit in return. Therefore, in this study we formulate a compensatory user cost formulation where the disbenefits associated with sharing one's ride need to be at least compensated by the fare reduction offered for the user to substitute a private ride with ride-pooling. Hence, the assumption is that users will only switch to a ride-pooling service if such a choice yields a net positive benefit over a conventional taxi or ride-hailing service. Also, this work accounts for the fact that sharing a vehicle with strangers, which in literature is considered to be one of the main potential barriers for a successful implementation of ride-pooling services, induces a disutility.

Our compensatory formulation is embedded in the the matching framework proposed in Alonso-Mora et al. (2017) . A group of requests is considered feasible only if all the individuals included in the shared ride evaluate it as superior to the private ride alternative, as well as satisfying vehiclerelated constraints.

When representing the choice whether to ride-share or not as a compensatory function between travel attributes (travel time, ride fare and the presence of co-riders), we find that the distance savings from ride-pooling reported by previous studies are only attainable when users have favourable preferences to sharing rides. For example, when users have a relatively high willingness to share, or more specifically when they are willing to pay no more than 1 euro to upgrade a shared ride to an individual one, assuming no change in travel time, 32% of the transportation vehicle kilometres in the network can be removed. This compares to a 40% reduction found by Santi et al. (2014) in their study on ride-pooling in New York. However, our work shows that if users are found to be less willing to share -or in other words, users are willing to pay high premium for a private ride -we can expect distance savings that are significantly lower than reported in previous work. We find for example that, in the scenario where travellers are least willing to share, the total vehicle transportation distance can be reduced by just 15% when allowing for shared rides. This scenario assumes that ride-pooling users demand a compensation of 5 euro for the fact that they have to share their vehicle with other passengers. Furthermore, it should be noted that in this scenario, and in most others, ride-pooling users get rewarded with a discount rate of 50%, relatively high compared to discounts currently offered by ride-pooling services. It is thus plausible that ride-pooling distance savings in reality are even lower than those found in our analysis.

In addition, our results show that besides the expected distance savings also the level of service that a ride-pooling system offers is significantly dependent on users' tolerance to delays and willingness to share a vehicle with co-riders. When we vary both attributes, we find the acceptance rate of a ride-pooling service to range between 25% and 99% of all incoming requests, and the average delay of accepted requests to range between 15% and 61% of the direct travel time. In conclusion, our results highlight that any comparison of ride-pooling model outputs has to carefully account for the behavioural specifications.

Although heterogeneity in user preferences seems to have only a limited impact on potential distance savings, it is found to have negative consequences for the level of service with more rejected requests and a larger average delay. Moreover, this results in striving to serve few users with a below average sharing tolerance at the expense that many of the accepted requests with an above average tolerance get assigned a less efficient ride. This calls for the development of discriminative pricing or service provision mechanisms, including the possibility for the service provider to choose to decline travel requests. Furthermore, this study has shown that the design of a ride-pooling service, such as its pricing structure, may significantly affect the expected societal benefits and service quality. A relatively small additional discount of 7.5% per co-rider with whom a user shares their ride at maximum occupancy, on top of the standard 50% discount assumed in all experiments, can for example more than double the total reduction in vehicle kilometres. At the same time, the percentage of rejected requests drops from 24% to 18% if such a discount policy is implemented. Instead of offering a flat subsidy per pooled ride or charging a tax on private rides, policy makers looking to minimise road traffic externalities may thus be better off subsidising ride-pooling based on vehicle occupancy.

This study also shows that the potential of a ride-pooling system can be greatly dependent on external variables, such as the spatial distribution of demand. Demand patterns are likely to be at least somewhat concentrated due to spatial clustering of activities like work, residence and shopping. This study shows for example that, when most requests are directed towards the center of the network, typical for a morning peak, the performance of a ride-pooling system is worse than when demand is uniform, both in terms of level of service and efficiency. This is a direct result of the spatial imbalance in demand and supply, which requires vehicles to deadhead from drop-off locations in low demand areas to pick-up locations in high demand areas, leading to increased vehicle mileage and longer waiting times for pick-up. To be specific, the (gross) effective vehicle transportation distance ratio can even drop below 1 when the directionality of demand is high, while the average user delay can amount to 32% of the direct travel time in such a scenario. In summary, we found that directionality in demand negatively affects both level of service and operational efficiency of ride-pooling services.

Future research can investigate the impact of additional aspects on ride-pooling performance. This includes for example the effect of fleet properties (capacity and fleet size), the effect of the fares of alternative (single-rider) services, test a more complex discounting mechanism than the one assumed here, or investigate external variables like the density of possible pick-up and drop-off locations in the network. Also, it might be interesting to find whether ride-pooling efficiency can be improved by rejecting requests that negatively affect ride-pooling performance on a system level, such as requests that are destined for a location far away from where new demand is expected. Moreover, the validity of ride-pooling studies can be improved by the incorporation of mode choice, whereby passengers can choose to chose not only between a private ride and ride-pooling, but also choose to travel by other means, possibly adopting a probabilistic choice modelling framework. Finally, future research can address the equity aspects of a ride-pooling system, including the spatial disparity of service accessibility.

What are the determinants of the willingness to share rides in pooled on-demand services? Transportation

Value of time and reliability for urban pooled on-demand services

On-demand high-capacity ride-sharing via dynamic trip-vehicle assignment

The valuation of reliability for personal travel

Quantifying the Potential of Ride-Sharing Using Call Description Records

The Transportation Impacts of Vehicle-for-Hire in the City of Toronto

Carpooling and carpool clubs: Clarifying concepts and assessing value enhancement possibilities through a Stated Preference

Ride sharing: Psychological factors

Causes of customer dissatisfaction-studies of public transport by the critical-incident method

Simulation-based design and analysis of on-demand mobility services

The Gravity Model in Transportation Analysis: Theory and Extensions

The Impact of Ridesharing in Mobility-on-Demand Systems: Simulation Case Study in Prague

Frequency of negative critical incidents and satisfaction with public transport services

Wat kost een rit in een taxi?

What might covid-19 mean for mobility as a service (maas)?

Service quality -developing a service quality index in the provision of commercial bus contracts

Best and worst cities to drive 2017

The Reliability of the Transportation System and its Influence on the Choice Behaviour

Preferences for shared autonomous vehicles

Modeling individuals' willingness to share trips with strangers in an autonomous vehicle future

Exploring Demand Patterns of a Ride-Sourcing Service using Spatial and Temporal Clustering

A framework to integrate mode choice in the design of mobility-on-demand systems

T-share: A Large-Scale Dynamic Taxi Ridesharing Service

To Share or Not To Share: Investigating the Social Aspects of Dynamic Ridesharing. Transportation Research Record

Optimal assignment and incentive design in the taxi group ride problem

Just a better taxi? A survey-based comparison of taxis, transit, and ridesourcing services in San Francisco

Quantifying the benefits of vehicle pooling with shareability networks

NYC Taxi & Ride-hailing Stats Dashboard

Shared ride services in north america: definitions, impacts, and the future of pooling

Real-time city-scale ridesharing via linear assignment problems

Scaling Law of Urban Ride Sharing

Carpooling: Who, How and Why

Ridesourcing systems: A framework and review

A Pickup and Delivery Problem for Ridesharing Considering Congestion

Performance analysis and fleet requirements of automated demand-responsive transport systems as an urban public transport service

Crowding valuation in urban tram and bus transportation based on smart card data

Mobility Sharing as a Preference Matching Problem

This work was supported by the CriticalMaaS project [grant number 804469], which is financed by the European Research Council and the Amsterdam Institute for Advanced Metropolitan Solutions. A preliminary version of this paper was presented at the 2020 Transportation Research Board Annual Meeting in Washington D.C.

The authors declare that there are no conflicts of interest in relation to this work.

The authors confirm contribution to the paper as follows: study conception and design; de Ruijter, Cats; analysis and interpretation of results: de Ruijter, Cats; draft manuscript preparation: de Ruijter, Cats. All authors reviewed the results and approved the final version of the manuscript.