Journal of Modern Power Systems and Clean Energy

ISSN 2196-5625 CN 32-1884/TK

网刊加载中。。。

使用Chrome浏览器效果最佳,继续浏览,你可能不会看到最佳的展示效果,

确定继续浏览么?

复制成功,请在其他浏览器进行阅读

Charging Pricing for Autonomous Mobility-on-demand Fleets Based on Game Theory  PDF

  • Jiawei Wang 1
  • Graduate (Student Member, IEEE)
  • Yujie Sheng 1 (Student Member, IEEE)
  • Huaichang Ge 2
  • Xiang Bai 2
  • Jia Su 2
  • Qinglai Guo 1 (Fellow, IEEE)
  • Hongbin Sun 1 (Fellow, IEEE)
1. Department of Electrical Engineering, Tsinghua University, Beijing100084, China; 2. Shanxi Energy Internet Research Institute, Taiyuan030032, China

Updated:2024-12-18

DOI:10.35833/MPCE.2024.000139

  • Full Text
  • Figs & Tabs
  • References
  • Authors
  • About
CITE
OUTLINE

Abstract

Considering the enormous potential application of autonomous mobility-on-demand (AMoD) systems in future urban transportation, the charging behavior of AMoD fleets, as a key link connecting the power system and the transportation system, needs to be guided by a reasonable charging demand management method. This paper uses game theory to investigate charging pricing methods for the AMoD fleets. Firstly, an AMoD fleet scheduling model with appropriate scale and mathematical complexity is established to describe the spatio-temporal action patterns of the AMoD fleet. Subsequently, using Stackelberg game and Nash bargaining, two game frameworks, i.e., non-cooperative and cooperative, are designed for the charging station operator (CSO) and the AMoD fleet. Then, the interaction trends between the two entities and the mechanism of charging price formation are discussed, along with an analysis of the game implications for breaking the non-cooperative dilemma and moving towards cooperation. Finally, numerical experiments based on real-world city-scale data are provided to validate the designed game frameworks. The results show that the spatio-temporal distribution of charging prices can be captured and utilized by the AMoD fleet. The CSO can then use this action pattern to determine charging prices to optimize the profit. Based on this, negotiated bargaining improves the overall benefits for stakeholders in urban transportation.

1. Introduction

THE operational practices of companies such as Uber, Lyft, and Didi in the real world have proven that, compared with private cars and traditional taxis, shared mobility systems can more effectively meet urban transportation demands and significantly improve the utilization of vehicles, roads, and parking facilities [

1]. Building on shared mobility, technological breakthroughs in areas such as Internet of Vehicles and autonomous driving further provide powerful momentum for the transformation of transportation modes, giving rise to the concept of autonomous mobility-on-demand (AMoD) [2]. The AMoD system is considered as a highly transformative and promising solution to efficient transportation. Evaluation results from existing research show that in the same conditions of transportation demands and environment, one AMoD vehicle can replace seven private cars [3]. From the perspective of urban transportation, deploying AMoD systems based on electric vehicles (EVs) will significantly reduce the demand for public parking and decrease environmental pollution and carbon emissions. Currently, the AMoD mode has received extensive attention from various companies such as RideOS [4] and Zoox [5] in the United States, EasyMile [6] in France, and more. It can be imagined that AMoD may become a major transportation mode in future cities. Privately owned cars and traditional taxis rely on decentralized decision-making by drivers, while the AMoD fleet, with the information aggregation and centralized dispatching of the platform, can more effectively coordinate their routing and charging schedules driven by profit [1]. As a result, the randomness in the operational patterns of fleets is greatly reduced, demonstrating stronger rationality in decision-making.

The characteristics of an AMoD fleet that is centrally dispatched by a platform are well recognized by researchers in the field. Many studies have explored control methods for optimizing the operation of AMoD fleets, with a focus on coordinating passenger pickups, navigation to destinations, the movement of empty vehicles (in preparation for future actions), and their relationships [

7]. Vehicle routing problem (VRP) in the field of operations is a classic framework for planning vehicle routes, making it suitable for the vehicle-centric AMoD dispatching model [8]. In such methods, vehicle routing is typically represented by binary variables, with a value of 1 when the vehicle travels a road in the network [9]. For large-scale fleets at the city level, the scale of VRP, which is an NP-hard problem, poses significant challenges to exact optimization. Therefore, the focus in the field of operations is mainly on the development of heuristic rules and algorithms. Considering the scale of the problem, network flow models have been proposed as an approximation method. In these methods, both the fleet and customer orders are modeled as flows in the network, which are continuous variables that need to satisfy the constraints such as consistency and flow conservation [10]. The network flow model replaces “vehicles” with “flows” and continuous variables instead of discrete variables. While it may be challenging to precisely obtain the route of each vehicle directly from network flows, it significantly reduces the scale of the problem, thus providing a foundation for city-level fleet scheduling [1]. A substantial amount of related work has been developed based on the network flow model such as congestion-aware AMoD system scheduling [11], intermodal transportation systems with AMoD [12], and model predictive control for AMoD systems [13], [14]. Despite different topics, the fundamental paradigm of these studies is to establish an extended traffic network, representing various types of vehicle behaviors with “flows” in the network, and then solving the corresponding optimization or control problems. In addition to optimization and control methods, there have also been the studies that use reinforcement learning to schedule AMoD fleets [15], [16]. The above studies have made important contributions to optimizing the operation of AMoD systems, but they overlook charging decisions and give little consideration to the interaction between the AMoD fleet and the charging station operator (CSO). The rational behavior exhibited by centrally managed large-scale fleets will have a significant impact on the operation of charging stations, enabling AMoD fleet managers to enter into a game with CSOs. Such commercial interactions between large entities are almost impossible in the current scenario, where the dominant players in the transportation system are fragmented and uncoordinated EVs.

Due to the vast potential of AMoD systems in future urban transportation, the charging schedule for electric AMoD vehicles will be a crucial link connecting the power system and the transportation system [

17]. Currently, some studies related to AMoD explicitly involve the interdependency between the transportation and power sectors, highlighting the crucial role of charging behavior in connecting these two systems. For instance, building upon the framework of AMoD system dispatch, several studies are dedicated to coordinating the optimization between the AMoD system and the power system [17], [18]. Some studies further consider additional objectives such as integrating renewable energy sources and reducing carbon emissions [19]. However, the focus of such studies remains on scheduling models and algorithms, which often consider the local marginal prices at power system nodes as model parameters or assume a centralized entity to coordinate the optimization between the AMoD system and power system. Such methods overlook the significant role of CSOs as independent entities and disregard the fundamental role of charging service prices as perceivable information for the AMoD fleet. To the best of the authors’ knowledge, very few studies explicitly recognize the independent entity status of charging stations. Among them, some studies consider charging station planning for AMoD fleets [20], [21], while others deal with the competition among multiple CSOs in the face of AMoD fleets [3]. However, the focus of this paper differs from these studies, as both charging station planning and competition among charging stations fall beyond the scope of this paper.

Real-world experience illustrates that the power system does not attempt direct coordination with vehicles in the transportation system. It is the CSOs who actively engage in demand-side response and participate in bidding within the electricity market [

22]. CSOs must initially consider the mutual impact between themselves and the entities within the transportation system, which forms the foundation for aggregating resources and interacts with the power system. Therefore, the interaction between the CSO and the AMoD fleet, often overlooked in existing studies, deserves emphasis and attention. Moreover, the emphasized interaction in this paper does not solely refer to the physical interaction between charging stations and AMoD vehicles; rather, it focuses on the interaction between CSO and AMoD fleet manager as decision-makers. The charging price, a crucial variable impacting the interests of both decision-makers, inevitably becomes the core of their interaction. Thus, this paper focuses on the patterns of conflict or cooperation that emerge between these two major decision-makers surrounding the charging price. Both the AMoD fleet and the CSO are not only backgrounds or boundaries for each other, but rational entities. Starting from this perspective, this paper aims to present distinct findings and insights compared with existing studies.

1) In contrast to studies focused on enhancing AMoD system performance through advanced dispatch control [

7]-[16], this paper does not confine itself to traffic characteristics. Instead, it considers the charging behavior and energy characteristics of the AMoD fleet. Moreover, this paper thoroughly investigates the interaction between the AMoD fleet and the CSO.

2) In contrast to studies that collaboratively optimize the AMoD system and the power system [

17]-[19], this paper explicitly delineates the independent decision-making status of the CSO. The interaction with the AMoD fleet and its impact come from the charging price signals provided by the CSO, aligning more closely with realistic operational situations.

3) In contrast to existing studies on charging pricing from the perspective of CSOs [

22]-[26], this paper considers the price-based interaction between the CSO and the AMoD fleet as a novel object of investigation. On the one hand, the rational spatio-temporal action patterns of the AMoD fleet cannot be adequately described by traditional traffic assignment models. On the other hand, the rationality exhibited by the centrally managed large-scale fleet qualifies it for equal interaction (e.g., negotiation) with the CSO, thereby expanding the mechanisms behind the charging pricing.

In summary, the interaction between the CSO and the AMoD fleet, as two major commercial entities in future urban transportation, will play a pivotal role. Their interaction will form the foundation for future interactions between the power and transportation systems, accompanied by a distinctly different business model from existing charging stations. Currently, the potential and effects of the interaction framework have not been fully discussed. Game theory, as a theory analyzing optimal decisions and strategic interactions among multiple players, has been widely applied in fields involving multi-player decision-making such as charging pricing [

25]-[27]. Therefore, in discussing the interaction between the CSO and the AMoD fleet, game theory naturally becomes a powerful tool.

From the perspective of game theory, this paper designs cooperative and non-cooperative game frameworks for the CSO and the AMoD fleet and compares their implications and characteristics. The main contributions of this paper are summarized as follows.

1) An AMoD fleet scheduling model with appropriate scale and mathematical complexity is established to describe the spatio-temporal action patterns of the fleet, laying the foundation for the interaction between charging stations and the AMoD platform.

2) The non-cooperative pricing strategies of CSOs facing the AMoD fleet are analyzed using Stackelberg game framework, capturing the fundamental interaction trends between the two entities. A solution method for the sub-game perfect Nash equilibrium is given based on the characteristics of the problem.

3) The possible cooperative interaction mode between the CSO and the AMoD fleet is discussed based on Nash bargaining, taking non-cooperation as a baseline. The mechanism of charging price formation is expanded. From the inherent meaning of the game problem itself, the rationality of breaking the non-cooperative dilemma to achieve cooperation and the natural resistance of participants to betrayal are clarified.

4) The implications and properties of the two frameworks using Stackelberg game and Nash bargaining are compared, and numerical experiments based on real-world city-scale data are provided to verify these results.

The remainder of this paper is organized as follows. Section II introduces the decision modeling for the AMoD fleet and the CSO. Section III presents two game frameworks designed for charging pricing, i.e., non-cooperative and cooperative. Numerical experiments based on real data from Chengdu, China are conducted in Section IV, and Section V concludes this paper.

II. Decision Modeling for AMoD Fleet and CSO

A. AMoD Fleet Scheduling

1) Augmented Time-Space-Energy Network

Inspired by [

17], [28], this paper presents a scheduling model for the AMoD fleet based on multi-commodity flow problem in an augmented time-space-energy network. From network flow perspective, the decision of a certain number of homogeneous vehicles to move from one location to another is described as a flow behavior in the dimensions of time, space, and energy. The vehicle flow in the model is relaxed as continuous variables, referred to as “fluidic relaxation” [29]. Although the approximate nature of the network flow model may not be suitable for developing real-time controllers for AMoD systems, it is sufficient for evaluating the spatio-temporal action patterns of the AMoD fleet when interacting with charging stations [18].

The construction of an augmented network relies on the discretization of time and energy (or state of charge (SOC)). Nodes and edges in the network will contain information about time, location, and energy, enabling stronger decision modeling capabilities. Figure 1, which uses a four-node traffic network as an example, consists of three regular nodes and one charging station node. In the corresponding augmented network, the traffic network is expanded along the time and SOC dimensions, and each augmented node is a triplet of location, time, and SOC. Therefore, the edges between nodes can simultaneously represent movement in space, progression in time, and changes in SOC.

Fig. 1  Schematic diagram of an augmented time-space-energy network. (a) Augmented time-space-energy network. (b) Traffic network topology.

There are four types of edges in the augmented network.

1) Service edge: a service edge indicates a certain number of vehicles carrying passengers, completing the process of transporting passengers from the original location to the destination. Hence, it involves a change in location, a passage of time, and a decrease in SOC.

2) Charging edge: a charging edge represents a certain number of vehicles going through the process of charging at a charging station. Thus, the starting and ending locations of a charging edge are the same, and only a passage of time and an increase in SOC occur.

3) Rebalancing edge: a rebalancing edge represents a certain number of vehicles without passengers transferring from one location to another, expressing a change in location, a passage of time, and a decrease in SOC.

4) Stopping edge: a stopping edge signifies a certain number of vehicles staying in place, waiting for the next action. Thus, there are no changes in location or SOC, only the passage of time.

In addition to the flow within the network, to ensure formal flow conservation, the external injection and outflow of each node need to be defined.

1) External injection: external injection only occurs at nodes in the initial time slot of the augmented time-space-energy network. It essentially specifies the initial location and SOC of each vehicle. Since flow cannot be generated out of nowhere, the corresponding amount of flow needs to come into the network from these nodes.

2) External outflow: external outflow only occurs at nodes in the final time slot of the augmented time-space-energy network. It describes the location and SOC of each vehicle at the end of the schedule. Since nodes are not allowed to store flow, flow will go out of the network from these nodes.

2) Mathematical Model

The AMoD fleet scheduling model based on network flow is established in an augmented time-space-energy network. The symbols related to the augmented network include nloc, ntime, nsoc, di, iori,loc, iori,time, iori,soc, ides,loc, ides,time, and ides,soc. It should be noted that time and SOC are both discretely segmented. The sets used in the model include 𝒩, ser, re, cha, stop, , 𝒯, and 𝒢. The decision variables of the model include xire, xicha, xiser, xistop, xori,des,tdrop, en, and pf,tEV. And the parameters of the model include Δt, πf,tCSO (from the perspective of the AMoD fleet, it is a parameter), λdist, rin, oori,des,t, sn, Vg, c0, Cfmax, Pcha, and tmax. Among them, sn provides the initial condition of the system at t=0, as calculated by (1).

sn=Vg    nloc=g,ntime=0,nsoc=c00      otherwise    n𝒩 (1)

The basic formulation of the AMoD fleet scheduling model is as follows, where RAMoD is the net revenue of the AMoD fleet.

max RAMoD=-Δtft𝒯πf,tCSOpf,tEV-λdistiserxiserdi-λdistirexiredi+riniserxiserdi (2)

s.t.

xire,xicha,xiser,xistop,xori,des,tdrop,en0 (3)
iseriori,loc=oriides,loc=desiori,time=txiser=oori,des,t-xori,des,tdrop    ori,des𝒢,t𝒯 (4)
typeser,re,cha,stopitypeiori=nxitype-itypeides=nxitype=sn-en    n𝒩 (5)
Cfmaxpf,tEV=Pchaichaiori,loc=fiori,time=txicha    f,t𝒯 (6)
en=0    ntime<tmax or nsoc<c0,n𝒩 (7)
n𝒩en=n𝒩sn (8)

In the objective function (2), the first term represents the charging cost, the second term represents the distance cost caused by order services, the third term represents the distance cost caused by rebalancing, and the fourth term represents the income from order services.

Constraint (3) specifies that the decision variables are non-negative. Constraint (4) describes the behavior of meeting passenger orders. Passenger demand at time slot t from origin “ori” to destination “des” corresponds to service edges in the augmented network. The total service flow of the AMoD fleet on the corresponding edges is less than or equal to the passenger demand oori,des,t, and the difference between them represents the abandoned passenger demand xori,des,tdrop. Constraint (5) describes the flow conservation of the nodes in the augmented network. Constraint (6) limits the charging power within the maximum capacity. Constraint (7) specifies that the external outflow from nodes in the augmented network that have not reached the last time slot tmax and have insufficient SOC c0 is 0. Constraint (8) ensures that the sum of all external outflows is equal to the initial total number of vehicles. Combined constraints (7) and (8) require that all flows must flow out of a node with SOC of at least c0 in the last time slot tmax. This physically implies that all vehicles should maintain a battery level of at least c0 at the end of the schedule.

It should be noted that given the charging prices πf,tCSO, the objective function and all constraints of the AMoD scheduling model are linear, and the model can be formulated as a linear programming (LP). In subsequent research on charging pricing strategies based on game theory, the decisions of the CSO will impact the behavior of the AMoD fleet through πf,tCSO.

B. Charging Pricing

From a practical perspective, for the CSO, pricing decisions are often divided into several discrete levels [

22]. Based on historical experience and any available data, the CSO can provide a general range for charging prices and discretize them into multiple levels from high to low. The time scale of charging pricing is generally not overly detailed and may be different from that of AMoD fleet scheduling. In addition, the cost of electricity procurement is determined by the electricity prices of the distribution system and the charging demands, and this paper assumes that the CSO is a price taker for the electricity prices of distribution system.

The model uses sets and 𝒯 with the same meaning as the AMoD scheduling model. 𝒯price𝒯 represents the time slots in 𝒯 where pricing decisions need to be made, while the prices in other time slots remain unchanged and consistent with them.

The decision variables of the model include πf,tCSO and θf,t. The parameters of the model include Δt, αf,t, pf,tEV (from the perspective of the CSO, it is a parameter), Δπf,t, and tprice.

The basic description of the pricing model for the CSO is as follows, where RCSO is the net revenue of the CSO.

max RCSO=t𝒯Δtfπf,tCSO-αf,tpf,tEV (9)

s.t.

πf,tCSO=αf,t+θf,tΔπf,t    f,t𝒯 (10)
θf,t=θf,t+τ    τ=1,2,,tprice-1,f,t𝒯price (11)
θf,t0,1,2    f,t𝒯 (12)

The objective function (9) represents the revenue of the CSO after subtracting the electricity fees paid to the power system.

Constraint (10) calculates the value of the charging price, which represents the pricing decision made by the CSO based on the electricity price from the distribution system. The adjustable part is the additional fees set by the CSO such as service charges. Constraint (11) specifies the time scale for pricing, indicating that the price cannot change for a continuous period of tprice multiplied by Δt. Constraint (12) discretizes the charging pricing into three levels: high, medium, and low, for each time slot.

It should be noted that pf,tEV is determined by the behavior of the AMoD fleet, and according to the AMoD fleet scheduling model, pf,tEV is influenced by πf,tCSO.

III. Game Frameworks

A. Non-cooperative Game Framework

Regardless of the type of object it is facing, pricing-related studies on charging stations inevitably involve the process of modeling the service objects of the charging stations. The decision-making process for pricing requires taking into account the behavior of the service objects. Therefore, Stackelberg game naturally becomes an effective tool to adapt to this characteristic [

25], [26], [30]-[33]. Considering that price signals can be quickly and widely disseminated through information platforms, it is reasonable to establish a sequential order of charging pricing and AMoD fleet scheduling, making Stackelberg game framework applicable.

As shown in Fig. 2, the basic elements of the game are described as follows.

Fig. 2  Illustration of Stackelberg game between CSO and AMoD fleet.

1) Players: the AMoD fleet is the follower, and the CSO is the leader.

2) Payoffs: the payoff functions for players are (2) and (9), respectively.

3) Strategy sets: the strategy sets for players are the feasible dispatch region of the AMoD fleet determined by (3)-(8) and the feasible pricing region of the CSO determined by (10)-(12). It should be noted that the two strategy sets are not coupled. In other words, regardless of the opponent’s decision, their respective feasible regions do not change.

The key to solving the sub-game perfect Nash equilibrium lies in representing the optimal response function of the AMoD fleet under various charging prices. Let xire,xicha,xiser,xistop,xori,des,tdrop,en,pf,tEV be the decision variable vector x. The scheduling problem of the AMoD fleet and its dual problem can be restated in the following compact form.

maxcTxs.t.  A1x=b1:v      A2xb2:w      x0 Dual min b1Tv+b2Tws.t.  A1Tv+A2Twc       w0 (13)

where vector ccontains the influence of charging pricing πf,tCSO; A1,A2,b1,b2 are all constant matrices; and v and w are dual variables.

It has been noticed that the optimality condition for optimization problems can be determined using the Karush-Kuhn-Tucker (KKT) condition. For the problems with good properties, the KKT condition can be both a necessary and sufficient condition for the optimal solution [

22]. However, for problem (13), both the inequality constraint A2xb2 and the non-negativity constraint x0 require the corresponding complementary constraints to be introduced. This is especially important considering the high dimensionality of x in the AMoD fleet scheduling problem, as too many complementary constraints will burden the computation.

For LP, it can be proven that the primal-dual condition is also both sufficient and necessary for optimality. The primal-dual condition replaces complementary slackness with strong duality, thus overcoming the challenges of dealing with too many complementary slackness conditions.

A1x=b1A2xb2x0A1Tv+A2Twcw0cTx=b1Tv+b2Tw (14)

Backtracking to the charging pricing stage, the equivalent optimization problem is described as:

max RCSO=t𝒯Δtfπf,tCSO-αf,tpf,tEVs.t.  Charging pricing constraints (10)-(12)       AMoD optimality condition (14) (15)

In the objective function (9) and the strong duality condition cTx=b1Tv+b2Tw, there are non-linear terms resulting from the multiplication of charging price πf,tCSO with other decision variables. Since constraint (10) discretizes the charging price, it actually introduces a product of integer variables and continuous variables, which can be linearized using Big-M method. Therefore, the optimization problem (15) is essentially mixed-integer linear programming (MILP) and can be solved by mature commercial solvers.

B. Cooperative Game Framework

The AMoD mode will bring about centralized rationality in large-scale fleets, implying that EVs may no longer be only charging price takers but rather qualify for equal interaction (e.g., cooperation and negotiation) with CSOs. The rational decision-making exhibited by a centralized dispatching fleet makes it possible for the CSO and the AMoD manager to potentially determine charging prices through friendly negotiations, which is almost impossible in a scenario where the transportation system is still dominated by decentralized and disordered private cars. The cooperation between the two parties would bring greater potential profits to the transportation system.

The key to breaking the non-cooperative dilemma is that when all charging stations, as available resources, cooperate with vehicles, the order acceptance and route arrangement of the AMoD fleet could be further optimized, reducing inefficient movements and facilitating the completion of more passenger orders, thus increasing the total revenue. The non-cooperative equilibrium is an inefficient equilibrium. From the perspective of future urban transportation, negotiations and cooperation between these two types of stakeholders may be more likely to occur.

As shown in Fig. 3, this paper uses Nash bargaining as a negotiation tool, which provides a cooperative game solution that satisfies the properties of Pareto optimality, symmetry, independence of irrelevant alternatives, and affine transformation invariance. The breaking point of the bargaining R0AMoD,R0CSO is the equilibrium solution of Stackelberg game described in Section III-A. In other words, if the net income given to either the AMoD fleet or the CSO in the bargaining is less than the net income in Stackelberg game, the player will not accept the cooperative result, and cooperation cannot be achieved. It is precisely because both parties have the potential to increase their profits that cooperation and negotiation have a basis.

Fig. 3  Illustration of Nash bargaining between CSO and AMoD fleet.

According to Nash bargaining, the objective is to maximize Nash product RAMoD-R0AMoDRCSO-R0CSO. It should be noted that even ignoring the issue of integer variables, the objective function is a non-convex function with bilinear terms, which will bring difficulty to the computation. By using the monotonicity property of the logarithmic function, it is possible to take the logarithm of the objective function, which converts the product into a sum of logarithms.

The final formulated Nash bargaining problem is given as:

max lnRAMoD-R0AMoD+lnRCSO-R0CSOs.t.  RAMoDR0AMoD      RCSOR0CSO      AMoD scheduling constraints (3)-(8)      Charging pricing constraints (10)-(12) (16)

Problem (16) is a mixed-integer convex optimization problem with linear constraints, or can be transformed into linear constraints using Big-M method.

In the solution of problem (16), the net benefits of both the CSO and the AMoD fleet will increase compared with the non-cooperative game framework, which is a prerequisite for cooperation between both parties. On this basis, there are still natural obstacles to betraying the Nash bargaining solution for both parties based on the principles of game theory, which is the key to overcoming the non-cooperative dilemma.

1) The pricing problem inherently involves a decision order, which creates resistance for the leader (i.e., CSO) to deviate from the Nash bargaining solution. This is because any betrayal actions by the leader will be observed by the follower (i.e., AMoD fleet), and the AMoD fleet will make the optimal response to the deviated prices. Therefore, the CSO can at most revert to the equilibrium of Stackelberg game, as this is the maximum benefit it can get in the case of non-cooperation. However, this benefit is not as high as the one obtained from the Nash bargaining solution.

2) The follower (i.e., AMoD fleet) can indeed betray and achieve higher payoffs than the Nash bargaining solution after the leader’s decision is made. However, the resistance to betraying cooperation comes from the deterrence created by the repeated nature of the game. Pricing and scheduling take place every day, and a single act of betrayal in one day will trigger punishments from the leader in subsequent rounds (e.g., tit for tat or cruel strategy) [

34]. Considering long-term benefits with a discount factor 0<δ<1, betrayal would lead to a loss in long-term benefits.

IV. Numerical Experiments

A. Data Overview and Parameter Settings

The passenger travel demand data are based on the ride-hailing order dataset of Didi in Chengdu [

35], China. After preprocessing more than 0.22 million raw data entries, some orders with inappropriate timestamps and locations too far from the city center have been removed, resulting in approximately 0.17 million retained travel orders. It is believed that this data range can describe the distribution of passenger travel demand in the city. Figure 4 shows the heat map distribution of the origin locations for passenger travel demands. From practical calculation standpoint, the area is divided into grids, each measuring approximately 5 km in length. The specified area in this paper covers a total of 5×5 grids. The CSO operates eight charging stations within specific grids, i.e., charging stations #6, #9, #10, #11, #12, #13, #19, and #22, according to their corresponding grid numbers. Each charging station within a grid can be considered an aggregate of all charging facilities within that grid.

Fig. 4  Heat map distribution of origin locations for passenger travel demands.

In addition to spatial distribution characteristics, travel orders also have temporal distribution characteristics. As shown in Fig. 5, there is less demand for nighttime travel, and the number of passenger orders increases from around 08:00 in the morning until 23:00 in the evening.

Fig. 5  Distribution of time for passenger orders.

Assuming that within the 5×5 grids, the power load of each grid has different electricity prices, reflecting the differential characteristics of time and space. The electricity price data shown in Fig. 6 is sourced from the electricity market in [

36]. Each grid corresponds to a randomly selected node from the data.

Fig. 6  Electricity price data in each grid.

The configuration parameters of the AMoD fleet and charging stations are shown in Table I.

TABLE I  Configuration Parameters of AMoD Fleet and Charging Stations
ParameterValue
Fleet size 5000 vehicles [13]
Vehicle range 300 km [37]
Battery capacity 76 kWh [38]
Limits for SOC 10%-90% [39]
Initial SOC of vehicles 50% [3]
Average speed 50 km/h [38]
Unit distance order revenue 0.22 $/km
Unit distance cost 0.0165 $/km [38]
Time discretization for AMoD 30 min [17]
SOC discretization for AMoD 10%
Charging power 30 kW [37]
Charging efficiency 95% [40]
Charging station capacity 15 MW
Difference between price levels 0.056 $/kWh
Time scale for charging pricing 1 hour [3]

A fleet size of 5000 vehicles represents approximately 3% of the total number of daily orders, which is a normal proportion for the AMoD system [

13], [18], [19]. The dispatch time range is from 0 to 24 hours within a day. The initial distribution of the AMoD fleet is randomly and uniformly generated.

All numerical experiments are conducted in a Python 3.9.13 environment. The optimization problems are solved using Gurobi 11.0.0.

B. Non-cooperative Scenario

The Stackelberg game problem is solved to obtain the optimal pricing strategy for the CSO, resulting in a total net revenue of $69380.11.

As shown in Fig. 7, the results demonstrate two trends. On the temporal scale, there is a higher charging load during periods of lower pricing and a lower charging load during periods of higher pricing. On the spatial scale, charging stations located in the city center and hotspots receive more visits. For example, charging stations #6 and #12 in the central area have high charging power, and charging station #22 at the northern end also receives a number of visits since many passenger orders are involved. However, unpopular and remote charging stations such as charging stations #9 and #19 have very few users. These results reflect the profit-driven behavior of the AMoD fleet, namely fully utilizing the temporal and spatial differences in charging prices.

Fig. 7  Pricing results and charging power of each charging station (Stackelberg game).

In the equilibrium solution, the total net revenue of the AMoD fleet is $84429.66. Out of the 172926 passenger orders completed in a day, 116369 are served while 56557 are abandoned, resulting in a total abandoned order value of $95517.78.

The optimal scheduling result of the AMoD fleet reflects its response to the charging pricing. Figure 8 shows the distribution of vehicle locations at different time. Throughout the entire operation, the result shows the influence of passenger travel demand. The central area of the city has abundant charging stations and high travel demand of passengers, and most passengers still have destinations in the central area, leading to a concentration of vehicle activity in the city center.

Fig. 8  Distribution of vehicle locations at different time (Stackelberg game). (a) 00:00. (b) 03:00. (c) 06:00. (d) 09:00. (e) 12:00. (f) 15:00. (g) 18:00. (h) 21:00. (i) 24:00.

Figure 9 presents the variations in the number of vehicles in different states at each time slot. Initially, due to random initial distributions and vehicles with only 50% SOC, some vehicles start charging and rebalancing to prepare for the morning peak of travel. During the operation, the trend in the number of vehicles in service closely follows the time distribution characteristics of passenger demand as shown in Fig. 5.

Fig. 9  Number of vehicles in different states at each time slot (Stackelberg game).

C. Cooperative Scenario

According to Nash bargaining, the pricing results implemented by the CSO are shown in Fig. 10. The total net revenue for the CSO is $75142.76.

Fig. 10  Pricing results and charging power of each charging station (Nash bargaining).

It can be seen that overall, the charging prices are relatively low, mostly at the middle or low level. This reflects the compromises and concessions made by the CSO to attract vehicles for charging. In response to this, the number of vehicles going to charging stations has indeed increased. Therefore, even with the trend of price reduction, the net revenue of CSO has increased.

At the same time, it is also important to note that the AMoD fleet also makes compromises. Figure 10 shows that if the AMoD fleet decides to betray, it can take full advantage of the low-price period to significantly reduce charging costs, but it refuses to do so to avoid harming its partner.

Instead, the AMoD fleet focuses its main efforts on completing more passenger orders (126459 out of a total of 172926) and achieving an increase in net profits ($91048.91).

D. Comparison and Discussion

Table II summarizes the calculation results of certain indicators for different cases, considering various possible situations for comparison.

TABLE II  Calculation Results of Certain Indicators for Different Cases
EntityIndicatorCase 1Case 2Case 3Case 4
AMoD fleet Net revenue $91623.53 $84429.66 $91048.91 $102605.02
Total service distance 916745.93 km 844173.61 km 916724.11 km 837770.23 km
Total rebalancing distance 74416.69 km 64919.63 km 74497.38 km 112375.93 km
Completed passenger order 126470 116369 126459 115889
Abandoned passenger order 46456 56557 46467 57037
CSO Net revenue $74574.96 $69380.11 $75142.76 $48920.00
Charging station income $96712.89 $89057.02 $97281.67 $68774.13
Fees paid to power system $22137.93 $19676.91 $22138.90 $19854.13
Total charging energy 1435.66 MWh 1272.80 MWh 1435.62 MWh 1352.40 MWh
Average charging pricing 0.050 $/kWh 0.083 $/kWh 0.057 $/kWh 0.057 $/kWh
Average charging settlement price 0.067 $/kWh 0.070 $/kWh 0.068 $/kWh 0.051 $/kWh

1) Case 1: complete central optimization. There exists a single entity that has full control over both the CSO and the AMoD fleet, making decisions with the goal of optimizing overall benefits.

2) Case 2: non-cooperative game. The CSO and the AMoD fleet consider each other’s actions as adversaries, aiming to maximize their own interests and reach Stackelberg equilibrium.

3) Case 3: cooperative bargaining. Building upon the non-cooperative baseline, the CSO and the AMoD fleet engage in joint negotiations to reach a solution that increases the benefits for both parties while maintaining relative fairness (from the perspective of Nash bargaining theory).

4) Case 4: cooperative bargaining, but the AMoD fleet betrays the negotiation agreement.

It is worth noting that in cooperative bargaining (Case 3), the AMoD fleet is the second mover, and its betrayal corresponds to Case 4. As the first mover, the CSO’s betrayal would be observed by the second mover, leading to a return to Stackelberg game (Case 2).

Firstly, complete central optimization (Case 1) determines the upper limit of the sum of net incomes for the CSO and the AMoD fleet to be $166198.49 ($91623.53+$74574.96).

In the non-cooperative solution (Case 2), even if the average charging price set by the CSO is high, the AMoD fleet is not very proactive in charging and tries to avoid high-priced periods as much as possible. As a result, the CSO does not receive a significant net profit after settlement. Due to the concerns about charging costs, the AMoD fleet must carefully plan routes, potentially abandoning a substantial number of passenger orders. With a decrease in the fulfillment rate of passenger orders, the net revenue of the AMoD fleet also experiences a decline. In summary, compared with Case 1, Case 2 demonstrates an overall efficiency loss resulting from the adversarial nature and competition, amounting to approximately 7.45%.

In the bargaining solution (Case 3), the CSO adjusts the charging prices according to the cooperation requirements. This allows the AMoD fleet to be less concerned about charging costs and further optimize their routes to serve more passengers and generate more revenue. With the increase of distance traveled, the charging demand of the AMoD fleet also increases, resulting in higher total charging energy and increased net income for the CSO. From Table II, it can be observed that even though the average charging price set by the CSO is lower, the actual settlement price is higher. This phenomenon highlights the progress of Case 3 compared with Case 2, where the charging station assists the AMoD fleet in earning additional passenger revenue, and the fleet transfers a portion of their earnings to the CSO by increasing their charging behaviors.

Meanwhile, when comparing Case 3 with Case 1, there is hardly loss in the total net revenue. There is just a slight adjustment in revenue distribution, reflecting the nature of Pareto efficiency in Nash bargaining.

Additionally, it can be observed that the negotiation cooperation between the two parties enhances the overall benefits for various stakeholders in urban transportation. In addition to increasing the net profits of the two major players, the travel demands of passengers are also better met, resulting in a win-win situation for the CSO, the AMoD fleet, and the passengers. Furthermore, increasing the charging energy during the daytime with high photovoltaic generation in the power system is beneficial for accommodating renewable energy sources.

Regarding the possibility of betrayal in the negotiation, clear conclusions can be drawn from Table II. The comparison between Cases 2 and 3 indicates that the CSO, as the first mover, has no incentive to actively betray and return to the Stackelberg equilibrium. The comparison between Cases 3 and 4 shows that the AMoD fleet, as the second mover, would indeed significantly increase net income by choosing betrayal. However, this increase in income does not come from passenger orders, but from the unjust pursuit of low-priced periods, which will seriously damage the revenue of the CSO. At the same time, from the perspective of the entire transportation system, this scenario reduces the number of completed passenger orders and considerably increases the distance of rebalancing movements. Compared with Case 1, this results in a serious efficiency loss (approximately 8.83%), even more severe than the normal non-cooperative game (Case 2).

Fortunately, the betrayal of the AMoD fleet is not rational in terms of long-term interests. Considering the repeated game played daily, a sudden betrayal on one day will trigger the cruel strategy of the CSO in the following days. Even if the AMoD fleet can earn an extra $11556.11 (from $91048.91 to $102605.02) in one day, assuming that the CSO adopts an unforgiving strategy in the subsequent days, the AMoD fleet would essentially earn $6619.25 less every day (from $91048.91 to $84429.66). As long as the AMoD fleet considers long-term interests with a not-too-small discount factor δ (0<δ<1), it will realize that betrayal is not profitable. It should be noted that these discussions are only at the theoretical level of game theory and do not consider the complexities of the real-world business environment or government regulations.

The above analysis indicates that the game frameworks designed in this paper capture the decision-making characteristics of both the CSO and the AMoD fleet. Game equilibrium helps guide and explain the operation of the future urban transportation system.

V. Conclusion

The AMoD system provides a revolutionary and promising solution for future urban transportation. The charging behavior of AMoD fleets, as a critical link connecting future power and transportation systems, requires a well-guided charging demand management method. This paper investigates the charging pricing strategy for the AMoD fleet based on game theory. Firstly, an optimization scheduling model is established to describe the spatio-temporal action patterns of the AMoD fleet. Subsequently, charging pricing strategies are studied using Stackelberg game and Nash bargaining. Then, interaction trends between the two entities and the mechanism of charging price formation are discussed, along with an analysis of the game implications for breaking out of the non-cooperative dilemma and moving towards cooperation. Finally, numerical experiments based on real-world city-scale data are provided to validate the rationality of the proposed frameworks. The results show that the AMoD fleet has profit-driven action patterns and can capture and utilize the spatio-temporal distribution of charging prices. By leveraging this behavior, the CSO can formulate optimal profit-oriented charging prices based on Stackelberg game. Furthermore, if the negotiation between the AMoD fleet and the CSO can be conducted, the net incomes of both parties can increase. The fulfillment of passenger’s travel demands and the comprehensive benefits of stakeholders in urban transportation can be enhanced.

This paper only involves a single CSO and a single AMoD fleet. In general, there may be multiple CSOs and multiple AMoD platforms in a city. In such cases, multiple CSOs may compete for the charging demands of vehicles to seize market share and earn more profits, while multiple AMoD fleets may also compete with each other due to charging capacity limitations, traffic congestion effects, etc. It may be necessary to introduce complex game architectures (such as Nash-Stackelberg-Nash game) to describe the interactions between various entities. Moreover, considering the uncertainties involved in traffic networks and passenger orders, dynamic pricing for charging stations and real-time online scheduling for AMoD fleets will be more challenging research topics.

Nomenclature

Symbol —— Definition
A. —— Parameters
αf,t —— Electricity price at which charging station f connects to distribution system in time slot t
λdist —— Cost per unit distance of vehicles
Δt —— Length of a time slot
Δπf,t —— Difference between adjacent two price levels for charging station f and time slot t
c0 —— Initial state of charge (SOC) of each vehicle
Cfmax —— The maximum charging capacity of charging station f
di —— Distance of edge i in augmented network
iori,loc,ides,loc —— Corresponding locations of original node and destination node of edge i in augmented network
iori,time,ides,time —— Corresponding time of original node and destination node of edge i in augmented network
iori,soc,ides,soc —— Corresponding SOCs of original node and destination node of edge i in augmented network
nloc,ntime,nsoc —— Location, time, and SOC of node n in augmented network
oori,des,t —— Number of passenger orders requesting movement from origin to destination during time slot t
Pcha —— Charging power for each vehicle or each charging pile
rin —— Income per unit distance of serving passenger orders
sn —— Flow injected from outside of network to node n
tmax —— Last time slot within scheduling range
tprice —— Number of time slots to keep charging price unchanged
Vg —— Number of vehicles initially parked at location g
B. —— Sets
—— Set of charging stations
𝒢 —— Set of spatial locations in traffic network
𝒩 —— Set of nodes in augmented network
𝒯 —— Set of time slots
𝒯price —— Set of time slots in 𝒯 where pricing decisions need to be made
ser,re,cha,stop —— Sets of service edges, rebalancing edges, charging edges, and stopping edges in augmented network
C. —— Variables
θf,t —— Integer variable representing level of charging price at charging station f during time slot t
πf,tCSO —— Numerical value of charging price at charging station f during time slot t
en —— External outflow from node n to outside of network
pf,tEV —— Charging power at charging station f during time slot t
xori,des,tdrop —— Abandoned flow in passenger demand from origin location to destination during time slot t
xicha —— Flow on charging edge i
xire —— Flow on rebalancing edge i
xiser —— Flow on service edge i
xistop —— Flow on stopping edge i

References

1

W. Tan, Y. Sun, Z. Ding et al., “Fleet management and charging scheduling for shared mobility-on-demand system: a systematic review,” IEEE Open Access Journal of Power and Energy, vol. 9, pp. 425-436, Oct. 2022. [Baidu Scholar] 

2

G. Zardini, N. Lanzetti, M. Pavone et al., “Analysis and control of autonomous mobility-on-demand systems,” Annual Review of Control, Robotics, and Autonomous Systems, vol. 5, pp. 633-658, May 2022. [Baidu Scholar] 

3

Y. Lu, Y. Liang, Z. Ding et al., “Deep reinforcement learning-based charging pricing for autonomous mobility-on-demand system,” IEEE Transactions on Smart Grid, vol. 13, no. 2, pp. 1412-1426, Mar. 2022. [Baidu Scholar] 

4

Gopuff Blog. (2021, Jun.). Gopuff enters into an agreement to acquire rideOS to accelerate innovation. [Online]. Available: https://www.gopuff.com/blog/news/gopuff-acquires-rideos [Baidu Scholar] 

5

Zoox. (2023, Apr.). The future is for riders. [Online]. Available: https://www.zoox.com/ [Baidu Scholar] 

6

EasyMile. (2023, Apr.). Autonomous vehicle technology and solutions. [Online]. Available: https://easymile.com [Baidu Scholar] 

7

R. Zhang, F. Rossi, and M. Pavone, “Model predictive control of autonomous mobility-on-demand systems,” in Proceedings of 2016 IEEE International Conference on Robotics and Automation (ICRA), Stockholm, Sweden, May 2016, pp. 1382-1389. [Baidu Scholar] 

8

F. Bullo, E. Frazzoli, M. Pavone et al., “Dynamic vehicle routing for robotic systems,” Proceedings of the IEEE, vol. 99, no. 9, pp. 1482-1504, Sept. 2011. [Baidu Scholar] 

9

A. Simonetto, J. Monteil, and C. Gambella, “Real-time city-scale ridesharing via linear assignment problems,” Transportation Research Part C: Emerging Technologies, vol. 101, pp. 208-232, Apr. 2019. [Baidu Scholar] 

10

F. Rossi, R. Zhang, Y. Hindy et al., “Routing autonomous vehicles in congested transportation networks: structural properties and coordination algorithms,” Autonomous Robots, vol. 42, pp. 1427-1442, May 2018. [Baidu Scholar] 

11

S. Wollenstein-Betech, A. Houshmand, M. Salazar et al., “Congestion-aware routing and rebalancing of autonomous mobility-on-demand systems in mixed traffic,” in Proceedings of 2020 IEEE 23rd International Conference on Intelligent Transportation Systems (ITSC), Rhodes, Greece, Sept. 2020, pp. 1-7. [Baidu Scholar] 

12

S. Wollenstein-Betech, M. Salazar, A. Houshmand et al., “Routing and rebalancing intermodal autonomous mobility-on-demand systems in mixed traffic,” IEEE Transactions on Intelligent Transportation Systems, vol. 23, no. 8, pp. 12263-12275, Aug. 2022. [Baidu Scholar] 

13

R. Iglesias, F. Rossi, K. Wang et al., “Data-driven model predictive control of autonomous mobility-on-demand systems,” in Proceedings of 2018 IEEE International Conference on Robotics and Automation (ICRA), Brisbane, Australia, May 2018, pp. 6019-6025. [Baidu Scholar] 

14

M. Tsao, D. Milojevic, C. Ruch et al., “Model predictive control of ride-sharing autonomous mobility-on-demand systems,” in Proceedings of 2019 International Conference on Robotics and Automation, Montreal, Canada, May 2019, pp. 6665-6671. [Baidu Scholar] 

15

M. Guériau and I. Dusparic, “SAMoD: shared autonomous mobility-on-demand using decentralized reinforcement learning,” in Proceedings of 2018 21st International Conference on Intelligent Transportation Systems, Maui, USA, Nov. 2018, pp. 1558-1563. [Baidu Scholar] 

16

J. Wen, J. Zhao, and P. Jaillet, “Rebalancing shared mobility-on-demand systems: a reinforcement learning approach,” in Proceedings of 2017 IEEE 20th International Conference on Intelligent Transportation Systems, Yokohama, Japan, Oct. 2017, pp. 220-225. [Baidu Scholar] 

17

F. Rossi, R. Iglesias, M. Alizadeh et al., “On the interaction between autonomous mobility-on-demand systems and the power network: Models and coordination algorithms,” IEEE Transactions on Control of Network Systems, vol. 7, no. 1, pp. 384-397, Mar. 2020. [Baidu Scholar] 

18

A. Estandia, M. Schiffer, F. Rossi et al., “On the interaction between autonomous mobility on demand systems and power distribution networks – an optimal power flow approach,” IEEE Transactions on Control of Network Systems, vol. 8, no. 3, pp. 1163-1176, Sept. 2021. [Baidu Scholar] 

19

Y. Sheng, Y. Lin, H. Zeng et al., “Emission-concerned coordinated dispatching of electrified autonomous mobility-on-demand system and power system incorporating heterogeneous spatiotemporal scales,” Sustainable Cities and Society, vol. 98, p. 104755, Nov. 2023. [Baidu Scholar] 

20

J. Gao and S. Li, “Charging autonomous electric vehicle fleet for mobility-on-demand services: plug in or swap out?” Transportation Research Part C: Emerging Technologies, vol. 158, p. 104457, Jan. 2024. [Baidu Scholar] 

21

J. Yang, M. W. Levin, L. Hu et al., “Fleet sizing and charging infrastructure design for electric autonomous mobility-on-demand systems with endogenous congestion and limited link space,” Transportation Research Part C: Emerging Technologies, vol. 152, p. 104172, Jul. 2023. [Baidu Scholar] 

22

S. Chen, S. Feng, Z. Guo et al., “Trilevel optimization model for competitive pricing of electric vehicle charging station considering distribution locational marginal price,” IEEE Transactions on Smart Grid, vol. 13, no. 6, pp. 4716-4729, Nov. 2022. [Baidu Scholar] 

23

Z. Bao, Z. Hu, D. M. Kammen et al., “Data-driven approach for analyzing spatiotemporal price elasticities of EV public charging demands based on conditional random fields,” IEEE Transactions on Smart Grid, vol. 12, no. 5, pp. 4363-4376, Sept. 2021. [Baidu Scholar] 

24

Y. Cui, Z. Hu, and X. Duan, “Optimal pricing of public electric vehicle charging stations considering operations of coupled transportation and power systems,” IEEE Transactions on Smart Grid, vol. 12, no. 4, pp. 3278-3288, Jul. 2021. [Baidu Scholar] 

25

W. Yuan, J. Huang, and Y. Zhang, “Competitive charging station pricing for plug-in electric vehicles,” IEEE Transactions on Smart Grid, vol. 8, no. 2, pp. 627-639, Mar. 2017. [Baidu Scholar] 

26

W. Lee, R. Schober, and V. W. S. Wong, “An analysis of price competition in heterogeneous electric vehicle charging stations,” IEEE Transactions on Smart Grid, vol. 10, no. 4, pp. 3990-4002, Jul. 2019. [Baidu Scholar] 

27

H. Zeng, Y. Sheng, H. Sun et al., “A conic relaxation approach for solving Stackelberg pricing game of electric vehicle charging station considering traffic equilibrium,” IEEE Transactions on Smart Grid, vol. 15, no. 3, pp. 3080-3097, May 2024. [Baidu Scholar] 

28

Z. Ding, W. Tan, W.-J. Lee et al., “Integrated operation model for autonomous mobility-on-demand fleet and battery swapping station,” IEEE Transactions on Industry Applications, vol. 57, no. 6, pp. 5593-5602, Nov.-Dec. 2021. [Baidu Scholar] 

29

M. Pavone, S. L. Smith, E. Frazzoli et al., “Robotic load balancing for mobility-on-demand systems,” The International Journal of Robotics Research, vol. 31, no. 7, pp. 839-854, Jun. 2012. [Baidu Scholar] 

30

W. Tushar, W. Saad, H. V. Poor et al., “Economics of electric vehicle charging: a game theoretic approach,” IEEE Transactions on Smart Grid, vol. 3, no. 4, pp. 1767-1778, Dec. 2012. [Baidu Scholar] 

31

K. Ma, X. Hu, J. Yang et al., “Electric taxi charging strategy based on Stackelberg game considering hotspot information,” IEEE Transactions on Vehicular Technology, vol. 71, no. 3, pp. 2427-2436, Mar. 2022. [Baidu Scholar] 

32

S.-G. Yoon, Y.-J. Choi, J.-K. Park et al., “Stackelberg-game-based demand response for at-home electric vehicle charging,” IEEE Transactions on Vehicular Technology, vol. 65, no. 6, pp. 4172-4184, Jun. 2016. [Baidu Scholar] 

33

Y. Sheng, H. Zeng, Q. Guo et al., “Impact of customer portrait information superiority on competitive pricing of EV fast-charging stations,” Applied Energy, vol. 348, p. 121412, Oct. 2023. [Baidu Scholar] 

34

D. Abreu, “On the theory of infinitely repeated games with discounting,” Econometrica, vol. 56, no. 2, pp. 383-396, Mar. 1988. [Baidu Scholar] 

35

Neardws. (2022, Oct.). Vehicular trajectories processing for Didi open data. [Online]. Available: https://github.com/neardws/Vehicular-Trajectories-Processing-for-Didi-Open-Data [Baidu Scholar] 

36

PJM. (2020, May). Data Miner 2. [Online]. Available: https://dataminer2.pjm.com/list [Baidu Scholar] 

37

Y. Liang, Z. Ding, T. Ding et al., “Mobility-aware charging scheduling for shared on-demand electric vehicle fleet using deep reinforcement learning,” IEEE Transactions on Smart Grid, vol. 12, no. 2, pp. 1380-1393, Mar. 2021. [Baidu Scholar] 

38

J. Deng, H. Hu, S. Gong et al., “Impacts of charging pricing schemes on cost-optimal logistics electric vehicle fleet operation,” Transportation Research Part D: Transport and Environment, vol. 109, p. 103333, Aug. 2022. [Baidu Scholar] 

39

J. Wang, Q. Guo, H. Sun et al., “Collaborative optimization of logistics and electricity for the mobile charging service system,” Applied Energy, vol. 336, p. 120845, Apr. 2023. [Baidu Scholar] 

40

B. Li, Y. Chen, W. Wei et al., “Routing and scheduling of electric buses for resilient restoration of distribution system,” IEEE Transactions on Transportation Electrification, vol. 7, no. 4, pp. 2414-2428, Dec. 2021. [Baidu Scholar]