Unified embeddings for translation

Invention Grant

US10796107B2 Unified embeddings for translation 有权

Please log in to see more content

Patent Title: Unified embeddings for translation
Application No.: US16232984

Application Date: 2018-12-26
Publication No.: US10796107B2

Publication Date: 2020-10-06
Inventor: Terry Kong
Applicant: SoundHound, Inc.
Applicant Address: US CA Santa Clara
Assignee: SoundHound, Inc.
Current Assignee: SoundHound, Inc.
Current Assignee Address: US CA Santa Clara
Agency: Haynes Beffel & Wolfeld LLP
Agent Andrew L. Dunlap
Main IPC: G06F40/216
IPC: G06F40/216 ; G06F40/58 ; G06K9/62 ; G06F40/295

Abstract:

A method of training word embeddings is provided. The method includes determining anchors, each comprising a first word in a first domain and a second word in a second domain, training word embeddings for the first and second domains, and training a transform for transforming word embedding vectors in the first domain to word embedding vectors in the second domain, wherein the training minimizes a loss function that includes an anchor loss for each anchor, such that for each anchor, the anchor loss is based on a distance between the anchor's second word's embedding vector and the transform of the anchor's first word's embedding vector, and for each anchor, the anchor loss for the respective anchor is zero when the distance between the respective anchor's second word's embedding vector and the transform of the respective anchor's first word's embedding vector is less than a specific tolerance.

Public/Granted literature

US20200210529A1 UNIFIED EMBEDDINGS FOR TRANSLATION Public/Granted day:2020-07-02

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06F	电数字数据处理（基于特定计算模型的计算机系统入G06N）
G06F40/00	处理自然语言数据（语音分析或综合，语音识别G10L）
G06F40/20	.自然语言分析（自然语言的语义分析入G06F40/30）
G06F40/205	..解析
G06F40/216	...使用统计方法