Device control using policy training based on task embeddings
Abstract:
A control system for a robotic device comprising a task embedding network to receive one or more demonstrations of a task and to generate a task embedding. The task embedding comprises a representation of the task, and each demonstration comprises one or more observations of a performance of the task. The control system includes a control network to receive the task embedding from the task embedding network and to apply a policy to map a plurality of successive observations of the robotic device to respective control instructions for the robotic device. The policy applied by the control network is modulated across the plurality of successive observations of the robotic device using the task embedding from the task embedding network.
Public/Granted literature
Information query
Patent Agency Ranking
0/0