Automated building of expanded datasets for training of autonomous agents

Invention Grant

US11455494B2 Automated building of expanded datasets for training of autonomous agents 有权

Please log in to see more content

Patent Title: Automated building of expanded datasets for training of autonomous agents
Application No.: US16426878

Application Date: 2019-05-30
Publication No.: US11455494B2

Publication Date: 2022-09-27
Inventor: Boris Galitsky
Applicant: Oracle International Corporation
Applicant Address: US CA Redwood Shores
Assignee: Oracle International Corporation
Current Assignee: Oracle International Corporation
Current Assignee Address: US CA Redwood Shores
Agency: Kilpatrick Townsend & Stockton LLP
Main IPC: G06K9/62
IPC: G06K9/62 ; G06F40/35 ; G06N20/00

Automated building of expanded datasets for training of autonomous agents

Abstract:

Improved systems and methods for generating training data for classification models are disclosed. In an example, a training application accesses two fragments of text. The application represents each fragment of text as a parse thicket. The parse thickets jointly represent syntactic and discourse information. From the parse thickets, the application generalizes the text by identifying common entities or common rhetorical relations between parse thickets. The generalized text is added to a training data set, thereby increasing the coverage of the training set.

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06K	图形数据读取（图像或视频识别或理解G06V）；数据的呈现；记录载体；处理记录载体
G06K9/00	识别模式的方法或装置（图形读取或将机械参数模式（例如力或存在）转换为电信号的方法或装置 G06K11/00）（图像或视频识别或理解 G06V）（语音识别 G10L15/00 )
G06K9/62	.应用电子设备进行识别的方法或装置