Invention Grant
- Patent Title: Word vector processing for foreign languages
-
Application No.: US15874725Application Date: 2018-01-18
-
Publication No.: US10430518B2Publication Date: 2019-10-01
- Inventor: Shaosheng Cao , Xiaolong Li
- Applicant: Alibaba Group Holding Limited
- Applicant Address: KY George Town, Grand Cayman
- Assignee: Alibaba Group Holding Limited
- Current Assignee: Alibaba Group Holding Limited
- Current Assignee Address: KY George Town, Grand Cayman
- Agency: Fish & Richardson P.C.
- Priority: CN201710045459 20170122
- Main IPC: G06F17/27
- IPC: G06F17/27 ; G06N20/00 ; G06F17/28 ; G06N3/08

Abstract:
A word vector processing method is provided. Word segmentation is performed on a corpus to obtain words, and n-gram strokes corresponding to the words are determined. Each n-gram stroke represents n successive strokes of a corresponding word. Word vectors of the words and stroke vectors of the n-gram strokes are initialized corresponding to the words. After performing the word segmentation, the n-gram strokes are determined, and the word vectors and stroke vectors are determined, training the word vectors and the stroke vectors.
Public/Granted literature
- US20180210876A1 WORD VECTOR PROCESSING FOR FOREIGN LANGUAGES Public/Granted day:2018-07-26
Information query