Invention Grant
- Patent Title: Analyzing the ability to find textual content
- Patent Title (中): 分析查找文字内容的能力
-
Application No.: US11461464Application Date: 2006-08-01
-
Publication No.: US07792830B2Publication Date: 2010-09-07
- Inventor: David Carmel , Adam Darlow , Shai Fine , Dan Pelleg , Elad Yom-Tov
- Applicant: David Carmel , Adam Darlow , Shai Fine , Dan Pelleg , Elad Yom-Tov
- Applicant Address: US NY Armonk
- Assignee: International Business Machines Corporation
- Current Assignee: International Business Machines Corporation
- Current Assignee Address: US NY Armonk
- Main IPC: G06F7/00
- IPC: G06F7/00 ; G06F17/30 ; G06F15/18 ; G06F17/00 ; G06N5/02

Abstract:
A method and system for analyzing a document set (202, 420) are provided. The method includes determining a set of terms (312) from the terms of the document set that minimizes a distance measurement (405) from the given set of documents (420). The method includes using a greedy algorithm to build the set of terms incrementally, at each stage finding a single word that is closest to the document set (202, 420). The set of terms is evaluated to assess the ability to find the document set (202, 420). The set of terms are compared with expected terms to evaluate the ability to find the document set (202, 420). A measure of the ability to find a document set (202, 420) is provided by computing a distance measure (403) between a document set and an entire collection.
Public/Granted literature
- US20080033971A1 Analyzing the Ability to Find Textual Content Public/Granted day:2008-02-07
Information query