Invention Grant
- Patent Title: Clustering classes in language modeling
- Patent Title (中): 语言建模中的聚类
-
Application No.: US14656027Application Date: 2015-03-12
-
Publication No.: US09529898B2Publication Date: 2016-12-27
- Inventor: Mark Edward Epstein , Vladislav Schogol
- Applicant: Google Inc.
- Applicant Address: US CA Mountain View
- Assignee: Google Inc.
- Current Assignee: Google Inc.
- Current Assignee Address: US CA Mountain View
- Agency: Fish & Richardson P.C.
- Main IPC: G06F17/30
- IPC: G06F17/30 ; G06F17/27

Abstract:
This document describes, among other things, a computer-implemented method. The method can include obtaining a plurality of text samples that each include one or more terms belonging to a first class of terms. The plurality of text samples can be classified into a plurality of groups of text samples. Each group of text samples can correspond to a different sub-class of terms. For each of the groups of text samples, a sub-class context model can be generated based on the text samples in the respective group of text samples. Particular ones of the sub-class context models that are determined to be similar can be merged to generate a hierarchical set of context models. Further, the method can include selecting particular ones of the context models and generating a class-based language model based on the selected context models.
Public/Granted literature
- US20160062985A1 Clustering Classes in Language Modeling Public/Granted day:2016-03-03
Information query