System and methods for training task-oriented dialogue (TOD) language models

Invention Grant

US11749264B2 System and methods for training task-oriented dialogue (TOD) language models 有权

Please log in to see more content

Patent Title: System and methods for training task-oriented dialogue (TOD) language models
Application No.: US17088206

Application Date: 2020-11-03
Publication No.: US11749264B2

Publication Date: 2023-09-05
Inventor: Chien-Sheng Wu , Chu Hong Hoi , Richard Socher , Caiming Xiong
Applicant: salesforce.com, inc.
Applicant Address: US CA San Francisco
Assignee: Salesforce, Inc.
Current Assignee: Salesforce, Inc.
Current Assignee Address: US CA San Francisco
Agency: Haynes and Boone, LLP
Main IPC: G10L15/18
IPC: G10L15/18 ; G10L15/06

System and methods for training task-oriented dialogue (TOD) language models

Abstract:

Embodiments described herein provide methods and systems for training task-oriented dialogue (TOD) language models. In some embodiments, a TOD language model may receive a TOD dataset including a plurality of dialogues and a model input sequence may be generated from the dialogues using a first token prefixed to each user utterance and a second token prefixed to each system response of the dialogues. In some embodiments, the first token or the second token may be randomly replaced with a mask token to generate a masked training sequence and a masked language modeling (MLM) loss may be computed using the masked training sequence. In some embodiments, the TOD language model may be updated based on the MLM loss.

Public/Granted literature

US20220139384A1 SYSTEM AND METHODS FOR TRAINING TASK-ORIENTED DIALOGUE (TOD) LANGUAGE MODELS Public/Granted day:2022-05-05

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L15/00	语音识别（G10L17/00优先）
G10L15/08	.语音分类或检索
G10L15/18	..利用自然语言模型