Invention Grant
- Patent Title: Graph-based labeling rule augmentation for weakly supervised training of machine-learning-based named entity recognition
-
Application No.: US17185664Application Date: 2021-02-25
-
Publication No.: US11669740B2Publication Date: 2023-06-06
- Inventor: Xinyan Zhao , Haibo Ding , Zhe Feng
- Applicant: Robert Bosch GmbH
- Applicant Address: DE Stuttgart
- Assignee: Robert Bosch GmbH
- Current Assignee: Robert Bosch GmbH
- Current Assignee Address: DE Stuttgart
- Agency: Michael Best & Friedrich, LLP
- Main IPC: G06N3/08
- IPC: G06N3/08 ; G06F40/30 ; G06F40/295 ; G06N3/042

Abstract:
Systems and methods for training a machine-learning model for named-entity recognition. A rule graph is constructed including a plurality of nodes each corresponding to a different labeling rule of a set of labeling rules (including a set of seeding rules of known labeling accuracy and a plurality of candidate rules of unknown labeling accuracy). The nodes are coupled to other nodes based on which rules exhibit the highest sematic similarity. A labeling accuracy metric is estimated for each candidate rule by propagating a labeling confidence metric through the rule graph from the seeding rules to each candidate rule. A subset of labeling rules is then identified by ranking the rules by their labeling confidence metric. The identified subset of labeling rules is applied to unlabeled data to generate a set of weakly labeled named entities and the machine-learning model is trained based on the set of weakly labeled named entities.
Public/Granted literature
Information query