Scalable multilingual named-entity recognition

Invention Grant

US10699077B2 Scalable multilingual named-entity recognition 有权

Please log in to see more content

Patent Title: Scalable multilingual named-entity recognition
Application No.: US15406586

Application Date: 2017-01-13
Publication No.: US10699077B2

Publication Date: 2020-06-30
Inventor: Yashar Mehdad , Aasish Pappu , Amanda Stent
Applicant: Oath Inc.
Applicant Address: US NY New York
Assignee: Oath Inc.
Current Assignee: Oath Inc.
Current Assignee Address: US NY New York
Agency: Cooper Legal Group, LLC
Main IPC: G06F40/295
IPC: G06F40/295 ; G06F16/9535 ; G06F40/30

Scalable multilingual named-entity recognition

Abstract:

Software on a website serves a user of an online content aggregation service a first article that the user views. The software extracts named entities from the first article using a named-entity recognizer. The named-entity recognizer uses a sequence of word embeddings as inputs to a conditional random field (CRF) tool to assign labels to each of the word embeddings. Each of the word embeddings is associated with a word in the first article and is trained using an entire topical article from a corpus of topical articles as a context for the word. The software then creates rankings for articles ingested by the content aggregation service based at least in part on the named entities and serves the user a second article using the rankings.

Public/Granted literature

US20180203843A1 Scalable Multilingual Named-Entity Recognition Public/Granted day:2018-07-19

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06F	电数字数据处理（基于特定计算模型的计算机系统入G06N）
G06F40/00	处理自然语言数据（语音分析或综合，语音识别G10L）
G06F40/20	.自然语言分析（自然语言的语义分析入G06F40/30）
G06F40/279	..文字实体的识别
G06F40/289	...短语分析，例如有限状态技术或分块
G06F40/295	....命名实体识别