Invention Grant
- Patent Title: Training of model for processing sequence data
-
Application No.: US16839976Application Date: 2020-04-03
-
Publication No.: US12136411B2Publication Date: 2024-11-05
- Inventor: Gakuto Kurata , Kartik Audhkhasi
- Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
- Applicant Address: US NY Armonk
- Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
- Current Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
- Current Assignee Address: US NY Armonk
- Agency: Tutunjian & Bitetto, P.C.
- Agent Robert Richard Aragona
- Main IPC: G06N3/084
- IPC: G06N3/084 ; G06F17/18 ; G06F18/24 ; G06N3/047 ; G10L15/06

Abstract:
A technique for training a model is disclosed. A training sample including an input sequence of observations and a target sequence of symbols having length different from the input sequence of observations is obtained. The input sequence of observations is fed into the model to obtain a sequence of predictions. The sequence of predictions is shifted by an amount with respect to the input sequence of observations. The model is updated based on a loss using a shifted sequence of predictions and the target sequence of the symbols.
Public/Granted literature
- US20210312294A1 TRAINING OF MODEL FOR PROCESSING SEQUENCE DATA Public/Granted day:2021-10-07
Information query