- Patent Title: System, method, and recording medium for natural language learning
-
Application No.: US15087050Application Date: 2016-03-31
-
Publication No.: US10282411B2Publication Date: 2019-05-07
- Inventor: Octavian Popescu , Vadim Sheinin
- Applicant: International Business Machines Corporation
- Applicant Address: US NY Armonk
- Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
- Current Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
- Current Assignee Address: US NY Armonk
- Agency: McGinn IP Law Group, PLLC
- Agent Rahan Uddin, Esq.
- Main IPC: G06F17/27
- IPC: G06F17/27

Abstract:
A natural language learning method, system, and non-transitory computer readable medium include analyzing a corpus of sentences stored in a database to identify an internal structure of words in the corpus of sentences, creating a plurality of new words that are a combination of the internal structure of a word of the words in the corpus of sentences and the word, clustering the plurality of new words created by the creating that match into a plurality of cluster groups, filtering the plurality of cluster groups to create a partial set of each of the plurality of cluster groups, and performing word embedding processing on the partial set of each of the plurality of cluster groups to obtain vectors for new words.
Public/Granted literature
- US20170286403A1 SYSTEM, METHOD, AND RECORDING MEDIUM FOR NATURAL LANGUAGE LEARNING Public/Granted day:2017-10-05
Information query