Invention Grant
- Patent Title: Representation learning for input classification via topic sparse autoencoder and entity embedding
-
Application No.: US16691554Application Date: 2019-11-21
-
Publication No.: US11615311B2Publication Date: 2023-03-28
- Inventor: Dingcheng Li , Jingyuan Zhang , Ping Li
- Applicant: Baidu USA, LLC
- Applicant Address: US CA Sunnyvale
- Assignee: Baidu USA, LLC
- Current Assignee: Baidu USA, LLC
- Current Assignee Address: US CA Sunnyvale
- Agency: North Weber & Baugh LLP
- Main IPC: G06N3/08
- IPC: G06N3/08 ; G06N3/04 ; G06K9/62 ; G06F17/18 ; G06N3/084

Abstract:
Described herein are embodiments of a unified neural network framework to integrate Topic modeling, Word embedding and Entity Embedding (TWEE) for representation learning of inputs. In one or more embodiments, a novel topic sparse autoencoder is introduced to incorporate discriminative topics into the representation learning of the input. Topic distributions of inputs are generated from a global viewpoint and are utilized to enable autoencoder to learn topical representations. A sparsity constraint may be added to ensure that the most discriminative representations are related to topics. In addition, both words and entity related information may be embedded into the network to help learn a more comprehensive input representation. Extensive empirical experiments show that embodiments of the TWEE framework outperform the state-of-the-art methods on different datasets.
Public/Granted literature
- US20200184339A1 REPRESENTATION LEARNING FOR INPUT CLASSIFICATION VIA TOPIC SPARSE AUTOENCODER AND ENTITY EMBEDDING Public/Granted day:2020-06-11
Information query