Machine learning for training NLP agent

Invention Grant

US12014142B2 Machine learning for training NLP agent 有权

Please log in to see more content

Patent Title: Machine learning for training NLP agent
Application No.: US17354825

Application Date: 2021-06-22
Publication No.: US12014142B2

Publication Date: 2024-06-18
Inventor: Gary Francis Diamanti , Shikhar Kwatra , Ryan Anderson , Rodrigo Goulart Silva
Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
Applicant Address: US NY Armonk
Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
Current Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
Current Assignee Address: US NY Armonk
Agency: CUENOT, FORSYTHE & KIM, LLC
Main IPC: G06F40/232
IPC: G06F40/232 ; G06F18/22 ; G06F40/284 ; G06N3/08 ; G06N20/00

Abstract:

A computer-implemented process for training a natural language processing (NLP) agent having a reinforced learning model includes the following operations. A type of document from a document corpus is identified using metadata particularly associated with the document. The NLP agent tokenizes the document to generate a plurality of tokens. Using a schema identified from the type of the document, one of the plurality of tokens is compared to a system of record (SOR) field from the schema. A similarity score between the one of the plurality of tokens with a correct value and a reward based upon the similarity score are generated. A determination is made that an optimum minimum average similarity rate has not been obtained. Based upon the determination, the reinforced learning model is trained using a loss function that includes the reward.

Public/Granted literature

US20220405473A1 MACHINE LEARNING FOR TRAINING NLP AGENT Public/Granted day:2022-12-22

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06F	电数字数据处理（基于特定计算模型的计算机系统入G06N）
G06F40/00	处理自然语言数据（语音分析或综合，语音识别G10L）
G06F40/20	.自然语言分析（自然语言的语义分析入G06F40/30）
G06F40/232	..拼写校正，例如拼写差错程序或加元音符