当前位置:网站首页>Didi - dispatching
Didi - dispatching
2022-07-25 07:55:00 【Anywhere --- little Yuning】
Here's the catalog title
- 1 2017 - 《A Taxi Order Dispatch Model based On Combinatorial Optimization》
- 2 2018 - 《Large‑Scale Order Dispatch in On‑Demand Ride‑Hailing Platforms: A Learning and Planning Approach》
- 3 2019 - 《A Deep Value-networkBased Approach for Multi-Driver Order Dispatching》
- 4 2019 - 《Efficient Collaborative Multi-Agent Deep Reinforcement Learning for Large-Scale Fleet Management》
1 2017 - 《A Taxi Order Dispatch Model based On Combinatorial Optimization》
A summary of the paper :
Dispathing system Distribution system : That is, many to many drivers and passengers match , Make drivers make more money , Higher passenger satisfaction .
- The traditional way is : For each order , Maximize the driver's order taking rate . shortcoming : The overall success rate may not be high .
- The new method : The order splitting model based on combinatorial optimization , Maximize the overall success rate of receiving orders , Maximize overall efficiency , To enhance the user experience .
The details of the algorithm are as follows :
- Optimize the overall transaction rate when splitting orders
The mathematical form of the model is :
among ,max(E) For the optimization goal of the whole model , That is, the transaction rate ;g(a)≤0 For the constraints that the model must meet , Here may be some business rules , For example, a driver can only assign one order at a time ;a Is the solution of the model , That is, how to allocate the overall order and the overall driver .
- Logistics Regression The model calculates the driver acceptance probability
According to the value of the order 、 Pick up distance 、 The angle between directions 、 Characteristics such as driving direction , Use logistics regression Model to calculate the probability of the driver accepting the order .![[ picture ]](/img/34/f0f58033214d38d45e93ef820c69ce.png)
Because an order will be sent to m A driver , So the first i The probability of closing an order is :![[ picture ]](/img/e0/66056ff3699fabf99420b55b3b4cff.png)
In this way, the whole combinatorial optimization model is :
among N Is the current total number of orders ,M It is the number of drivers notified in one order . What is optimized is the sum of the success rates of all orders at the current moment .![[ picture ]](/img/de/4aa8838ed22581acb07113c3ea451b.png)
2 2018 - 《Large‑Scale Order Dispatch in On‑Demand Ride‑Hailing Platforms: A Learning and Planning Approach》
- traditional method : Focus on immediate passenger satisfaction .
- The new method : Pay more attention to resource utilization and user experience in the overall and long-term perspective .
The overall architecture :
- Offline training value function , Reinforcement learning is used here ,MAP Method of state space transition
- Online reasoning is based on current benefits , And the comprehensive judgment of income in the future

3 2019 - 《A Deep Value-networkBased Approach for Multi-Driver Order Dispatching》
- This article is based on the previous article , Deep reinforcement learning is used + Semi Markov decision process ( With time ductility ).
- Add situational features to the state space , In the last paper , State space is only related to time and place . The current revision is : state ( Time 、 place 、 Situational features - Hot spot , Cold zone, etc ).
- Add delay decreasing information in the update of value function
![[ picture ]](/img/eb/4c8427061564b49061f513c311ca2c.png)
- And different cities use transfer learning , Cities with less data are more friendly , Don't go from 0 Start .
4 2019 - 《Efficient Collaborative Multi-Agent Deep Reinforcement Learning for Large-Scale Fleet Management》
Multi agent , Reinforcement learning
It's Didi's new play , Added the concept of team , This team is a team , Between teams , It's competition . On the one hand, drivers won't be bored , Between team members , There will be a sense of belonging . On the other hand , The competitive relationship between teams , It will stimulate the driver's enthusiasm for work . It also uses the method of reinforcement learning . Didn't take a close look at , Little correlation .
边栏推荐
- 查看电脑重启次数、原因
- P1049 [NOIP2001 普及组 T4] 装箱问题
- File details
- A queue of two stacks
- Recommend 7 open source projects of yyds this week
- Nano data, football data, football match scores, sports data API, Qatar world cup
- Problems in deep learning training and testing: error: the following arguments are required: --dataroot, solution: the configuration method of training files and test files
- While (~scanf ("%d", & n)) is equivalent to while (scanf ("%d", & n)! =eof)
- Room database migration
- eval与assert一句话木马分析
猜你喜欢
P1086 [noip2004 popularization group question 2] peanut picking

查看电脑重启次数、原因
![[unity introduction program] basic concept - preform prefab](/img/c6/aac7bffdf99073978f9b2f8ff15fbb.png)
[unity introduction program] basic concept - preform prefab

eval与assert一句话木马分析

Science: listening to music can really relieve pain. Chinese scientists reveal the neural mechanism behind it

Recommend 7 open source projects of yyds this week

深度学习之快速实现数据集增强的方法

Gather the wisdom of developers and consolidate the foundation of the database industry
P1086 [NOIP2004 普及组第二题] 花生采摘

Native form submission data
随机推荐
Talk about programmers learning English again
[QNX hypervisor 2.2 user manual]9.3 CPU
Analysis of difficulties in diagramscene project
大佬秋招面经
使用CycleGAN训练自己制作的数据集,通俗教程,快速上手
The 17th revitalization cup computer programmer (cloud computing platform operation and development) finals
【FFmpeg】mp4转yuv
深度学习训练和测试时出现问题:error: the following arguments are required: --dataroot,解决:训练文件的配置方法和测试文件的配置方法
【论文笔记】Progressive Layered Extraction (PLE): A Novel Multi-Task Learning (MTL) Model for Personalized
UNIPRO multi terminal deployment to meet customers' diversified needs
The value of integer a after bitwise negation (~) is - (a+1)
Problems in deep learning training and testing: error: the following arguments are required: --dataroot, solution: the configuration method of training files and test files
【Unity入门计划】基本概念-2D碰撞体Collider 2D
MathWorks has been in China for 15 years. What are the secrets of continuous innovation?
P1047 [noip2005 popularization group t2] tree outside the school gate
7.24 simulation summary
P1048 [noip2005 popularization group t3] drug collection
[paper notes] effective CNN architecture design guided by visualization
轮询、中断、DMA和通道
J1 常用的DOS命令(P25)