Invention Grant
- Patent Title: Reinforcement learning method for driver incentives: generative adversarial network for driver-system interactions
-
Application No.: US17618864Application Date: 2019-06-14
-
Publication No.: US11861643B2Publication Date: 2024-01-02
- Inventor: Wenjie Shang , Qingyang Li , Zhiwei Qin , Yiping Meng , Yang Yu , Jieping Ye
- Applicant: BEIJING DIDI INFINITY TECHNOLOGY AND DEVELOPMENT CO., LTD.
- Applicant Address: CN Beijing
- Assignee: BEIJING DIDI INFINITY TECHNOLOGY AND DEVELOPMENT CO., LTD.
- Current Assignee: BEIJING DIDI INFINITY TECHNOLOGY AND DEVELOPMENT CO., LTD.
- Current Assignee Address: CN Beijing
- Agency: METIS IP LLC
- International Application: PCT/CN2019/091255 2019.06.14
- International Announcement: WO2020/248223A 2020.12.17
- Date entered country: 2021-12-13
- Main IPC: G06N3/08
- IPC: G06N3/08 ; G06Q50/30 ; G06Q30/0211 ; G06Q30/0208 ; G06Q30/0207

Abstract:
A system and method of determining a policy to prevent fading drivers is described. The system and method creates virtual trajectories of incentives such as coupons offered to drivers in a transportation hailing system and corresponding states of drivers in response to the incentives. A joint policy simulator is created from an incentive policy, a confounding incentive policy, and an incentive object policy to generate the simulated actions of drivers in response to different incentives. The rewards of the simulated actions of the drivers is determined by a discriminator. The incentive policy for preventing fading drivers is optimized by reinforcement learning based on the virtual trajectories generated by the joint policy simulator and discriminator.
Public/Granted literature
Information query