Invention Grant
- Patent Title: Cross-lingual zero-shot transfer via semantic and synthetic representation learning
-
Application No.: US17464005Application Date: 2021-09-01
-
Publication No.: US12050870B2Publication Date: 2024-07-30
- Inventor: Xuchao Zhang , Yanchi Liu , Bo Zong , Wei Cheng , Haifeng Chen , Junxiang Wang
- Applicant: NEC Laboratories America, Inc.
- Applicant Address: US NJ Princeton
- Assignee: NEC Corporation
- Current Assignee: NEC Corporation
- Current Assignee Address: JP Tokyo
- Agent Joseph Kolodka
- Main IPC: G06F40/284
- IPC: G06F40/284 ; G06F40/205 ; G06F40/295 ; G06N3/04

Abstract:
A computer-implemented method is provided for cross-lingual transfer. The method includes randomly masking a source corpus and a target corpus to obtain a masked source corpus and a masked target corpus. The method further includes tokenizing, by pretrained Natural Language Processing (NLP) models, the masked source corpus and the masked target corpus to obtain source tokens and target tokens. The method also includes transforming the source tokens and the target tokens into a source dependency parsing tree and a target dependency parsing tree. The method additionally includes inputting the source dependency parsing tree and the target dependency parsing tree into a graph encoder pretrained on a translation language modeling task to extract common language information for transfer. The method further includes fine-tuning the graph encoder and a down-stream network for a specific NLP down-stream task.
Public/Granted literature
- US20220075945A1 CROSS-LINGUAL ZERO-SHOT TRANSFER VIA SEMANTIC AND SYNTHETIC REPRESENTATION LEARNING Public/Granted day:2022-03-10
Information query