Method and server for training a neural network to generate a textual output sequence

Invention Grant

US11984113B2 Method and server for training a neural network to generate a textual output sequence 有权

Please log in to see more content

Patent Title: Method and server for training a neural network to generate a textual output sequence
Application No.: US17490620

Application Date: 2021-09-30
Publication No.: US11984113B2

Publication Date: 2024-05-14
Inventor: Aleksei Sergeevich Petrov , Sergey Dmitrievich Gubanov , Sergey Aleksandrovich Gaydaenko
Applicant: YANDEX EUROPE AG
Applicant Address: CH Lucerne
Assignee: Direct Cursus Technology L.L.C
Current Assignee: Direct Cursus Technology L.L.C
Current Assignee Address: AE Dubai
Agency: BCF LLP
Priority: RU 20132862 2020.10.06
Main IPC: G10L15/00
IPC: G10L15/00 ; G06F40/166 ; G06F40/40 ; G06N3/045 ; G06N3/08 ; G10L13/02 ; G10L15/06 ; G10L15/16 ; G10L15/22 ; G10L15/30

Method and server for training a neural network to generate a textual output sequence

Abstract:

Method and server for training an Attention-based Neural Network (ANN) to generate a textual output sequence being a content summary to be used as a response to a query. The ANN has an encoder and a decoder sub-network. The method includes, (i) inputting a query and input sequence into the encoder sub-network where the input sequence is a sequence of input groups associated with corresponding content snippets, (ii) generating an encoded representation of the input sequence, which includes generating attention-type outputs for respective words from the input sequence by applying an attention-limiting mask configured to attend only to words from the given input group, (iii) generating a decoded representation being a predicted textual output sequence, (iv) generating a penalty and (iii) adjusting the ANN based on the penalty score.

Public/Granted literature

US20220108685A1 METHOD AND SERVER FOR TRAINING A NEURAL NETWORK TO GENERATE A TEXTUAL OUTPUT SEQUENCE Public/Granted day:2022-04-07

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L15/00	语音识别（G10L17/00优先）