Invention Grant
- Patent Title: Automated building of expanded datasets for training of autonomous agents
-
Application No.: US16426878Application Date: 2019-05-30
-
Publication No.: US11455494B2Publication Date: 2022-09-27
- Inventor: Boris Galitsky
- Applicant: Oracle International Corporation
- Applicant Address: US CA Redwood Shores
- Assignee: Oracle International Corporation
- Current Assignee: Oracle International Corporation
- Current Assignee Address: US CA Redwood Shores
- Agency: Kilpatrick Townsend & Stockton LLP
- Main IPC: G06K9/62
- IPC: G06K9/62 ; G06F40/35 ; G06N20/00

Abstract:
Improved systems and methods for generating training data for classification models are disclosed. In an example, a training application accesses two fragments of text. The application represents each fragment of text as a parse thicket. The parse thickets jointly represent syntactic and discourse information. From the parse thickets, the application generalizes the text by identifying common entities or common rhetorical relations between parse thickets. The generalized text is added to a training data set, thereby increasing the coverage of the training set.
Information query