Answer selection using a compare-aggregate model with language model and condensed similarity information from latent clustering

Invention Grant

US11113323B2 Answer selection using a compare-aggregate model with language model and condensed similarity information from latent clustering 有权

Please log in to see more content

Patent Title: Answer selection using a compare-aggregate model with language model and condensed similarity information from latent clustering
Application No.: US16420764

Application Date: 2019-05-23
Publication No.: US11113323B2

Publication Date: 2021-09-07
Inventor: Seung-hyun Yoon , Franck Dernoncourt , Trung Huu Bui , Doo Soon Kim , Carl Iwan Dockhorn , Yu Gong
Applicant: ADOBE INC.
Applicant Address: US CA San Jose
Assignee: ADOBE INC.
Current Assignee: ADOBE INC.
Current Assignee Address: US CA San Jose
Agency: Shook, Hardy & Bacon L.L.P.
Main IPC: G06F7/00
IPC: G06F7/00 ; G06F16/332 ; G06N20/00 ; G06F16/33

Answer selection using a compare-aggregate model with language model and condensed similarity information from latent clustering

Abstract:

Embodiments of the present invention provide systems, methods, and computer storage media for techniques for identifying textual similarity and performing answer selection. A textual-similarity computing model can use a pre-trained language model to generate vector representations of a question and a candidate answer from a target corpus. The target corpus can be clustered into latent topics (or other latent groupings), and probabilities of a question or candidate answer being in each of the latent topics can be calculated and condensed (e.g., downsampled) to improve performance and focus on the most relevant topics. The condensed probabilities can be aggregated and combined with a downstream vector representation of the question (or answer) so the model can use focused topical and other categorical information as auxiliary information in a similarity computation. In training, transfer learning may be applied from a large-scale corpus, and the conventional list-wise approach can be replaced with point-wise learning.

Public/Granted literature

US20200372025A1 ANSWER SELECTION USING A COMPARE-AGGREGATE MODEL WITH LANGUAGE MODEL AND CONDENSED SIMILARITY INFORMATION FROM LATENT CLUSTERING Public/Granted day:2020-11-26

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06F	电数字数据处理（基于特定计算模型的计算机系统入G06N）
G06F7/00	通过待处理的数据的指令或内容进行运算的数据处理的方法或装置（逻辑电路入H03K19/00）