Patent search ap:("X Development LLC") AND inv:"David ANDRE" Page 1

1.

发明公开
PERIODICALLY COOPERATIVE MULTI-AGENT REINFORCEMENT LEARNING 审中-公开

公开(公告)号：US20240152774A1

公开(公告)日：2024-05-09

申请号：US17979964

申请日：2022-11-03

Applicant: X Development LLC

Inventor： Lam Thanh NGUYEN , Grace Taixi BRENTANO , David ANDRE , Salil Vijaykumar PRADHAN , Gearoid MURPHY

IPC: G06N5/02 , G06N5/04

CPC classification number: G06N5/022 , G06N5/043

Abstract: Disclosed herein are methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for modeling agents in multi-agent systems as reinforcement learning (RL) agents and training control policies that cause the agents to cooperate towards a common goal. A method can include generating, for each of a group of simulated local agents in an agent network in which the simulated local agents share resources, information, or both, experience tuples having a state for the simulated local agent, an action taken by the simulated local agent, and a local result for the action taken, updating each local policy of each simulated local agent according to the respective local result, providing, to each of the simulated local agents, information representing a global state of the agent network, and updating each local policy of each simulated local agent according to the global state of the agent network.

Patent Agency Ranking