-
公开(公告)号:US11790042B1
公开(公告)日:2023-10-17
申请号:US17896519
申请日:2022-08-26
Applicant: X Development LLC
Inventor: Yunfei Bai
CPC classification number: G06F18/2148 , B25J9/1671 , G06N3/045 , G06N3/08 , G06N20/00
Abstract: Mitigating the reality gap through training and utilization of at least one difference model. The difference model can be utilized to generate, for each of a plurality of instances of simulated state data generated by a robotic simulator, a corresponding instance of modified simulated state data. The difference model is trained so that a generated modified instance of simulated state data is closer to “real world data” than is a corresponding initial instance of simulated state data. Accordingly, the difference model can be utilized to mitigate the reality gap through modification of initially generated simulated state data, to make it more accurately reflect what would occur in a real environment. Moreover, the difference representation from the difference model can be used as input to the control policy to adapt the control learned from simulator to the real environment.
-
公开(公告)号:US10773382B2
公开(公告)日:2020-09-15
申请号:US15913212
申请日:2018-03-06
Applicant: X Development LLC
Inventor: Yunfei Bai , Kuan Fang , Stefan Hinterstoisser , Mrinal Kalakrishnan
IPC: B25J9/16
Abstract: Implementations are directed to training a machine learning model that, once trained, is used in performance of robotic grasping and/or other manipulation task(s) by a robot. The model can be trained using simulated training examples that are based on simulated data that is based on simulated robot(s) attempting simulated manipulations of various simulated objects. At least portions of the model can also be trained based on real training examples that are based on data from real-world physical robots attempting manipulations of various objects. The simulated training examples can be utilized to train the model to predict an output that can be utilized in a particular task—and the real training examples used to adapt at least a portion of the model to the real-world domain can be tailored to a distinct task. In some implementations, domain-adversarial similarity losses are determined during training, and utilized to regularize at least portion(s) of the model.
-
公开(公告)号:US10967505B1
公开(公告)日:2021-04-06
申请号:US16703101
申请日:2019-12-04
Applicant: X Development LLC
Inventor: Yunfei Bai
IPC: B25J9/16
Abstract: Methods and systems for modifying the inertial parameters used in a virtual robot model that simulates the interactions of a real-world robot with an environment to better reflect the actual inertial properties of the real-world robot. In one aspect, a method includes obtaining joint physical parameter measurements for the joints of a real-world robot, determining simulated joint physical parameter values for each of the joint physical parameter measurements, and adjusting an estimate of inertial properties of the real-world robot used by the virtual robot dynamic model to reduce a difference between the simulated joint physical parameter values and the corresponding joint physical parameter measurements.
-
4.
公开(公告)号:US20200376675A1
公开(公告)日:2020-12-03
申请号:US16424363
申请日:2019-05-28
Applicant: X Development LLC
Inventor: Yunfei Bai , Yuanzheng Gong
Abstract: Generating edge-depth values for an object, utilizing the edge-depth values in generating a 3D point cloud for the object, and utilizing the generated 3D point cloud for generating a 3D bounding shape (e.g., 3D bounding box) for the object. Edge-depth values for an object are depth values that are determined from frame(s) of vision data (e.g., left/right images) that captures the object, and that are determined to correspond to an edge of the object (an edge from the perspective of frame(s) of vision data). Techniques that utilize edge-depth values for an object (exclusively, or in combination with other depth values for the object) in generating 3D bounding shapes can enable accurate 3D bounding shapes to be generated for partially or fully transparent objects. Such increased accuracy 3D bounding shapes directly improve performance of a robot that utilizes the 3D bounding shapes in performing various tasks.
-
5.
公开(公告)号:US11707840B1
公开(公告)日:2023-07-25
申请号:US17535373
申请日:2021-11-24
Applicant: X Development LLC
Inventor: Yunfei Bai , Elmar Mair , Yuchen Wu , Ian Wilkes , Max Moroz , Weidan Wu
CPC classification number: B25J9/163 , B25J9/1653 , G05B13/0265 , G05B13/042
Abstract: Mitigating the reality gap through optimization of one or more simulated hardware parameters for simulated hardware components of a simulated robot. Implementations generate and store real navigation data instances that are each based on a corresponding episode of locomotion of a real robot. A real navigation data instance can include a sequence of velocity control instances generated to control a real robot during a real episode of locomotion of the real robot, and one or more ground truth values, where each of the ground truth values is a measured value of a corresponding property of the real robot (e.g., pose). The velocity control instances can be applied to a simulated robot, and one or more losses can be generated based on comparing the ground truth value(s) to corresponding simulated value(s) generated from applying the velocity control instances to the simulated robot. The simulated hardware parameters and environmental parameters can be optimized based on the loss(es).
-
6.
公开(公告)号:US11494632B1
公开(公告)日:2022-11-08
申请号:US15835357
申请日:2017-12-07
Applicant: X Development LLC
Inventor: Yunfei Bai
Abstract: Implementations are directed to generating simulated training examples for training of a machine learning model, training the machine learning model based at least in part on the simulated training examples, and/or using the trained machine learning model in control of at least one real-world physical robot. Implementations are additionally or alternatively directed to performing one or more iterations of quantifying a “reality gap” for a robotic simulator and adapting parameter(s) for the robotic simulator based on the determined reality gap. The robotic simulator with the adapted parameter(s) can further be utilized to generate simulated training examples when the reality gap of one or more iterations satisfies one or more criteria.
-
公开(公告)号:US20220297303A1
公开(公告)日:2022-09-22
申请号:US17203296
申请日:2021-03-16
Applicant: X Development LLC
Inventor: Seyed Mohammad Khansari Zadeh , Eric Jang , Daniel Lam , Daniel Kappler , Matthew Bennice , Brent Austin , Yunfei Bai , Sergey Levine , Alexander Irpan , Nicolas Sievers , Chelsea Finn
Abstract: Implementations described herein relate to training and refining robotic control policies using imitation learning techniques. A robotic control policy can be initially trained based on human demonstrations of various robotic tasks. Further, the robotic control policy can be refined based on human interventions while a robot is performing a robotic task. In some implementations, the robotic control policy may determine whether the robot will fail in performance of the robotic task, and prompt a human to intervene in performance of the robotic task. In additional or alternative implementations, a representation of the sequence of actions can be visually rendered for presentation to the human can proactively intervene in performance of the robotic task.
-
8.
公开(公告)号:US20190084151A1
公开(公告)日:2019-03-21
申请号:US15913212
申请日:2018-03-06
Applicant: X Development LLC
Inventor: Yunfei Bai , Kuan Fang , Stefan Hinterstoisser , Mrinal Kalakrishnan
IPC: B25J9/16
Abstract: Implementations are directed to training a machine learning model that, once trained, is used in performance of robotic grasping and/or other manipulation task(s) by a robot. The model can be trained using simulated training examples that are based on simulated data that is based on simulated robot(s) attempting simulated manipulations of various simulated objects. At least portions of the model can also be trained based on real training examples that are based on data from real-world physical robots attempting manipulations of various objects. The simulated training examples can be utilized to train the model to predict an output that can be utilized in a particular task—and the real training examples used to adapt at least a portion of the model to the real-world domain can be tailored to a distinct task. In some implementations, domain-adversarial similarity losses are determined during training, and utilized to regularize at least portion(s) of the model.
-
公开(公告)号:US11657527B2
公开(公告)日:2023-05-23
申请号:US16424363
申请日:2019-05-28
Applicant: X Development LLC
Inventor: Yunfei Bai , Yuanzheng Gong
IPC: H04N5/268 , H04N5/247 , H04N5/232 , H04N13/111 , H04N13/161 , H04N13/282 , B25J9/16 , G06T7/12 , G06T7/521 , G06T7/13 , G06K9/62 , G06T5/00
CPC classification number: B25J9/1697 , B25J9/1661 , G06K9/6267 , G06T5/008 , G06T7/12 , G06T7/13 , G06T7/521 , H04N5/247 , G06T2207/10024 , G06T2207/10028 , G06T2207/10048
Abstract: Generating edge-depth values for an object, utilizing the edge-depth values in generating a 3D point cloud for the object, and utilizing the generated 3D point cloud for generating a 3D bounding shape (e.g., 3D bounding box) for the object. Edge-depth values for an object are depth values that are determined from frame(s) of vision data (e.g., left/right images) that captures the object, and that are determined to correspond to an edge of the object (an edge from the perspective of frame(s) of vision data). Techniques that utilize edge-depth values for an object (exclusively, or in combination with other depth values for the object) in generating 3D bounding shapes can enable accurate 3D bounding shapes to be generated for partially or fully transparent objects. Such increased accuracy 3D bounding shapes directly improve performance of a robot that utilizes the 3D bounding shapes in performing various tasks.
-
10.
公开(公告)号:US11461589B1
公开(公告)日:2022-10-04
申请号:US16267077
申请日:2019-02-04
Applicant: X Development LLC
Inventor: Yunfei Bai
Abstract: Mitigating the reality gap through training and utilization of at least one difference model. The difference model can be utilized to generate, for each of a plurality of instances of simulated state data generated by a robotic simulator, a corresponding instance of modified simulated state data. The difference model is trained so that a generated modified instance of simulated state data is closer to “real world data” than is a corresponding initial instance of simulated state data. Accordingly, the difference model can be utilized to mitigate the reality gap through modification of initially generated simulated state data, to make it more accurately reflect what would occur in a real environment. Moreover, the difference representation from the difference model can be used as input to the control policy to adapt the control learned from simulator to the real environment.
-
-
-
-
-
-
-
-
-