Invention Grant
- Patent Title: Correcting N-gram probabilities by page view information
-
Application No.: US13965492Application Date: 2013-08-13
-
Publication No.: US09251135B2Publication Date: 2016-02-02
- Inventor: Nathan M. Bodenstab , Nobuyasu Itoh , Gakuto Kurata , Masafumi Nishimura , Paul J. Vozila
- Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
- Applicant Address: US NY Armonk
- Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
- Current Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
- Current Assignee Address: US NY Armonk
- Agency: Tutunjian & Bitetto, P.C.
- Agent Vazken Alexanian
- Main IPC: G06F17/27
- IPC: G06F17/27 ; G06F17/20 ; G06F17/21

Abstract:
Methods and a system for calculating N-gram probabilities in a language model. A method includes counting N-grams in each page of a plurality of pages or in each document of a plurality of documents to obtain respective N-gram counts therefor. The method further includes applying weights to the respective N-gram counts based on at least one of view counts and rankings to obtain weighted respective N-gram counts. The view counts and the rankings are determined with respect to the plurality of pages or the plurality of documents. The method also includes merging the weighted respective N-gram counts to obtain merged weighted respective N-gram counts for the plurality of pages or the plurality of documents. The method additionally includes calculating a respective probability for each of the N-grams based on the merged weighted respective N-gram counts.
Public/Granted literature
- US20150051899A1 CORRECTING N-GRAM PROBABILITIES BY PAGE VIEW INFORMATION Public/Granted day:2015-02-19
Information query