Invention Grant
- Patent Title: Cold fusing sequence-to-sequence models with language models
-
Application No.: US17061455Application Date: 2020-10-01
-
Publication No.: US11620986B2Publication Date: 2023-04-04
- Inventor: Anuroop Sriram , Heewoo Jun , Sanjeev Satheesh , Adam Coates
- Applicant: Baidu USA, LLC
- Applicant Address: US CA Sunnyvale
- Assignee: Baidu USA, LLC
- Current Assignee: Baidu USA, LLC
- Current Assignee Address: US CA Sunnyvale
- Agency: North Weber & Baugh LLP
- Main IPC: G10L15/06
- IPC: G10L15/06 ; G06N3/08 ; G10L15/183 ; G06N3/04 ; G06N3/088 ; G10L15/16

Abstract:
Described herein are systems and methods for generating natural language sentences with Sequence-to-sequence (Seq2Seq) models with attention. The Seq2Seq models may be implemented in applications, such as machine translation, image captioning, and speech recognition. Performance has further been improved by leveraging unlabeled data, often in the form of a language models. Disclosed herein are “Cold Fusion” architecture embodiments that leverage a pre-trained language model during training. The Seq2Seq models with Cold Fusion embodiments are able to better utilize language information enjoying faster convergence, better generalization, and almost complete transfer to a new domain while using less labeled training data.
Public/Granted literature
- US20210027767A1 COLD FUSING SEQUENCE-TO-SEQUENCE MODELS WITH LANGUAGE MODELS Public/Granted day:2021-01-28
Information query