-
1.
公开(公告)号:WO2022240558A1
公开(公告)日:2022-11-17
申请号:PCT/US2022/025487
申请日:2022-04-20
Applicant: NEC LABORATORIES AMERICA, INC.
Inventor: CHENG, Wei , CHEN, Haifeng , ZHANG, Xuchao , LUO, Dongsheng
IPC: G06F16/34 , G06F40/279 , G06F40/169 , G06N3/08 , G06N3/04 , G06F16/345 , G06F40/284 , G06F40/289
Abstract: A computer-implemented method is provided for keyphrase generation. The method includes pretraining (1210), by a processor device, a policy neural network on training documents using a sequence-to-sequence model. The training documents are each associated with a list of keyphrases included therein. The method further includes training (1220), by the processor device, the policy neural network using reinforcement learning with a summarization reward on present annotated keyphrases in an input training document and absent annotated keyphrase from the input training document that semantically describe a concept of the input training document. The method also includes predicting (1230), by the processor device, new keyphrases using the trained policy neural network.
-
公开(公告)号:WO2022109134A1
公开(公告)日:2022-05-27
申请号:PCT/US2021/059888
申请日:2021-11-18
Applicant: NEC LABORATORIES AMERICA, INC.
Inventor: CHENG, Wei , CHEN, Haifeng , NI, Jingchao , LUO, Dongsheng
Abstract: Systems and methods for augmenting data sets is provided. The systems and methods include feeding an original document (120) into a data augmentation generator (210) to produce one or more augmented documents (220); calculating a contrastive loss (230) between the original document (120) and the one or more augmented documents (220); and using the original document (120) and the one or more augmented documents (220) to train a neural network (1030).
-