Invention Grant
- Patent Title: System and methods for training task-oriented dialogue (TOD) language models
-
Application No.: US17088206Application Date: 2020-11-03
-
Publication No.: US11749264B2Publication Date: 2023-09-05
- Inventor: Chien-Sheng Wu , Chu Hong Hoi , Richard Socher , Caiming Xiong
- Applicant: salesforce.com, inc.
- Applicant Address: US CA San Francisco
- Assignee: Salesforce, Inc.
- Current Assignee: Salesforce, Inc.
- Current Assignee Address: US CA San Francisco
- Agency: Haynes and Boone, LLP
- Main IPC: G10L15/18
- IPC: G10L15/18 ; G10L15/06

Abstract:
Embodiments described herein provide methods and systems for training task-oriented dialogue (TOD) language models. In some embodiments, a TOD language model may receive a TOD dataset including a plurality of dialogues and a model input sequence may be generated from the dialogues using a first token prefixed to each user utterance and a second token prefixed to each system response of the dialogues. In some embodiments, the first token or the second token may be randomly replaced with a mask token to generate a masked training sequence and a masked language modeling (MLM) loss may be computed using the masked training sequence. In some embodiments, the TOD language model may be updated based on the MLM loss.
Public/Granted literature
- US20220139384A1 SYSTEM AND METHODS FOR TRAINING TASK-ORIENTED DIALOGUE (TOD) LANGUAGE MODELS Public/Granted day:2022-05-05
Information query