Techniques for out-of-domain (OOD) detection

Invention Grant

US12014146B2 Techniques for out-of-domain (OOD) detection 有权

Please log in to see more content

Patent Title: Techniques for out-of-domain (OOD) detection
Application No.: US18364298

Application Date: 2023-08-02
Publication No.: US12014146B2

Publication Date: 2024-06-18
Inventor: Thanh Long Duong , Mark Edward Johnson , Vishal Vishnoi , Crystal C. Pan , Vladislav Blinov , Cong Duy Vu Hoang , Elias Luqman Jalaluddin , Duy Vu , Balakota Srinivas Vinnakota
Applicant: Oracle International Corporation
Applicant Address: US CA Redwood Shores
Assignee: Oracle International Corporation
Current Assignee: Oracle International Corporation
Current Assignee Address: US CA Redwood Shores
Agency: Kilpatrick Townsend & Stockton LLP
Main IPC: G06F40/30
IPC: G06F40/30 ; G06F40/205 ; G06F40/289 ; G06N20/00 ; H04L51/02

Techniques for out-of-domain (OOD) detection

Abstract:

The present disclosure relates to techniques for identifying out-of-domain utterances. One particular technique includes receiving an utterance and a target domain of a chatbot, generating a sentence embedding for the utterance, obtaining an embedding representation for each cluster of in-domain utterances associated with the target domain, predicting, using a metric learning model, a first probability that the utterance belongs to the target domain based on a similarity or difference between the sentence embedding and each embedding representation for each cluster, predicting, using an outlier detection model, a second probability that the utterance belongs to the target domain based on a determined distance or density deviation between the sentence embedding and embedding representations for neighboring clusters, evaluating the first probability and the second probability to determine a final probability, and classifying the utterance as in-domain or out-of-domain for the chatbot based on the final probability.

Public/Granted literature

US20230376696A1 TECHNIQUES FOR OUT-OF-DOMAIN (OOD) DETECTION Public/Granted day:2023-11-23

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06F	电数字数据处理（基于特定计算模型的计算机系统入G06N）
G06F40/00	处理自然语言数据（语音分析或综合，语音识别G10L）
G06F40/30	.语义分析