-
公开(公告)号:US20210142005A1
公开(公告)日:2021-05-13
申请号:US17094278
申请日:2020-11-10
Applicant: SRI International
Inventor: Natarajan Shankar , Stephane Graham-Lengrand , Daniel Elenius , Chih-hung Yeh
IPC: G06F40/211 , G06F40/253 , G06F40/279 , G06N7/00 , G06N20/20
Abstract: In general, the disclosure describes techniques for machine learning for translation to structured computer readable representation. An example method to generate a training set for a natural language translation model includes receiving, by a computing system, a grammar comprising rules, one or more of the rules being associated with random biases; generating, by the computing system, at least one of random trees or random graphs based on the random biases in the grammar; for each of the random trees or random graphs, by the computing system, generating a natural language sample; and generating, by the computing system, the training set with the random trees or random graphs and the corresponding natural language samples.