Training sequence natural language processing engines

Invention Grant

US10402752B2 Training sequence natural language processing engines 有权

Please log in to see more content

Patent Title: Training sequence natural language processing engines
Application No.: US15356494

Application Date: 2016-11-18
Publication No.: US10402752B2

Publication Date: 2019-09-03
Inventor: Marc Aurelio Ranzato , Sumit Chopra , Michael Auli , Wojciech Zaremba
Applicant: Facebook, Inc.
Applicant Address: US CA Menlo Park
Assignee: Facebook, Inc.
Current Assignee: Facebook, Inc.
Current Assignee Address: US CA Menlo Park
Agency: Baker Botts, L.L.P.
Main IPC: G06F17/27
IPC: G06F17/27 ; G06N20/00

Training sequence natural language processing engines

Abstract:

A system for training a model to predict a sequence (e.g. a sequence of words) given a context is disclosed. A model can be trained to make these predictions using a combination of individual predictions compared to base truth and sequences of predictions based on previous predictions, where the resulting sequence is compared to the base truth sequence. In particular, the model can initially use the individual predictions to train the model. The model can then be further trained over the training data in multiple iterations, where each iteration includes two processes for each training element. In the first process, an initial part of the sequence is predicted, and the model and model parameters are updated after each prediction. In the second process, the entire remaining amount of the sequence is predicted and compared to the corresponding training sequence to adjust model parameters to encourage or discourage each prediction.

Public/Granted literature

US20180144264A1 TRAINING SEQUENCE NATURAL LANGUAGE PROCESSING ENGINES Public/Granted day:2018-05-24

Information query

Espacenet