Graph-based labeling rule augmentation for weakly supervised training of machine-learning-based named entity recognition

Invention Grant

US11669740B2 Graph-based labeling rule augmentation for weakly supervised training of machine-learning-based named entity recognition 有权

Please log in to see more content

Patent Title: Graph-based labeling rule augmentation for weakly supervised training of machine-learning-based named entity recognition
Application No.: US17185664

Application Date: 2021-02-25
Publication No.: US11669740B2

Publication Date: 2023-06-06
Inventor: Xinyan Zhao , Haibo Ding , Zhe Feng
Applicant: Robert Bosch GmbH
Applicant Address: DE Stuttgart
Assignee: Robert Bosch GmbH
Current Assignee: Robert Bosch GmbH
Current Assignee Address: DE Stuttgart
Agency: Michael Best & Friedrich, LLP
Main IPC: G06N3/08
IPC: G06N3/08 ; G06F40/30 ; G06F40/295 ; G06N3/042

Graph-based labeling rule augmentation for weakly supervised training of machine-learning-based named entity recognition

Abstract:

Systems and methods for training a machine-learning model for named-entity recognition. A rule graph is constructed including a plurality of nodes each corresponding to a different labeling rule of a set of labeling rules (including a set of seeding rules of known labeling accuracy and a plurality of candidate rules of unknown labeling accuracy). The nodes are coupled to other nodes based on which rules exhibit the highest sematic similarity. A labeling accuracy metric is estimated for each candidate rule by propagating a labeling confidence metric through the rule graph from the seeding rules to each candidate rule. A subset of labeling rules is then identified by ranking the rules by their labeling confidence metric. The identified subset of labeling rules is applied to unlabeled data to generate a set of weakly labeled named entities and the machine-learning model is trained based on the set of weakly labeled named entities.

Public/Granted literature

US20220269939A1 GRAPH-BASED LABELING RULE AUGMENTATION FOR WEAKLY SUPERVISED TRAINING OF MACHINE-LEARNING-BASED NAMED ENTITY RECOGNITION Public/Granted day:2022-08-25

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06N	基于特定计算模型的计算机系统
G06N3/00	基于生物学模型的计算机系统
G06N3/02	.采用神经网络模型
G06N3/08	..学习方法