Training of model for processing sequence data

Invention Grant

US12136411B2 Training of model for processing sequence data 有权

Please log in to see more content

Patent Title: Training of model for processing sequence data
Application No.: US16839976

Application Date: 2020-04-03
Publication No.: US12136411B2

Publication Date: 2024-11-05
Inventor: Gakuto Kurata , Kartik Audhkhasi
Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
Applicant Address: US NY Armonk
Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
Current Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
Current Assignee Address: US NY Armonk
Agency: Tutunjian & Bitetto, P.C.
Agent Robert Richard Aragona
Main IPC: G06N3/084
IPC: G06N3/084 ; G06F17/18 ; G06F18/24 ; G06N3/047 ; G10L15/06

Training of model for processing sequence data

Abstract:

A technique for training a model is disclosed. A training sample including an input sequence of observations and a target sequence of symbols having length different from the input sequence of observations is obtained. The input sequence of observations is fed into the model to obtain a sequence of predictions. The sequence of predictions is shifted by an amount with respect to the input sequence of observations. The model is updated based on a loss using a shifted sequence of predictions and the target sequence of the symbols.

Public/Granted literature

US20210312294A1 TRAINING OF MODEL FOR PROCESSING SEQUENCE DATA Public/Granted day:2021-10-07

Information query

Espacenet