Invention Grant
- Patent Title: Method and server for training a neural network to generate a textual output sequence
-
Application No.: US17490620Application Date: 2021-09-30
-
Publication No.: US11984113B2Publication Date: 2024-05-14
- Inventor: Aleksei Sergeevich Petrov , Sergey Dmitrievich Gubanov , Sergey Aleksandrovich Gaydaenko
- Applicant: YANDEX EUROPE AG
- Applicant Address: CH Lucerne
- Assignee: Direct Cursus Technology L.L.C
- Current Assignee: Direct Cursus Technology L.L.C
- Current Assignee Address: AE Dubai
- Agency: BCF LLP
- Priority: RU 20132862 2020.10.06
- Main IPC: G10L15/00
- IPC: G10L15/00 ; G06F40/166 ; G06F40/40 ; G06N3/045 ; G06N3/08 ; G10L13/02 ; G10L15/06 ; G10L15/16 ; G10L15/22 ; G10L15/30

Abstract:
Method and server for training an Attention-based Neural Network (ANN) to generate a textual output sequence being a content summary to be used as a response to a query. The ANN has an encoder and a decoder sub-network. The method includes, (i) inputting a query and input sequence into the encoder sub-network where the input sequence is a sequence of input groups associated with corresponding content snippets, (ii) generating an encoded representation of the input sequence, which includes generating attention-type outputs for respective words from the input sequence by applying an attention-limiting mask configured to attend only to words from the given input group, (iii) generating a decoded representation being a predicted textual output sequence, (iv) generating a penalty and (iii) adjusting the ANN based on the penalty score.
Public/Granted literature
- US20220108685A1 METHOD AND SERVER FOR TRAINING A NEURAL NETWORK TO GENERATE A TEXTUAL OUTPUT SEQUENCE Public/Granted day:2022-04-07
Information query