Invention Grant
- Patent Title: Document retrieval system and document retrieval method
- Patent Title (中): 文件检索系统和文件检索方法
-
Application No.: US12029694Application Date: 2008-02-12
-
Publication No.: US08046368B2Publication Date: 2011-10-25
- Inventor: Hiroko Ohi , Yoshiki Niwa , Kiyohiro Obara
- Applicant: Hiroko Ohi , Yoshiki Niwa , Kiyohiro Obara
- Applicant Address: JP Tokyo
- Assignee: Hitachi, Ltd.
- Current Assignee: Hitachi, Ltd.
- Current Assignee Address: JP Tokyo
- Agency: Antonelli, Terry, Stout & Kraus, LLP.
- Priority: JP2007-119872 20070427
- Main IPC: G06F7/00
- IPC: G06F7/00 ; G06F17/30

Abstract:
A document retrieval is performed with similarities between documents in numeric data taken into consideration. To this end, generated is a set E of intervals in which each element of a set D of numeric values representing a feature A is included in any one of the intervals. Each numeric value in each document is indexed by assigning, with 1, an interval including an element x of the set D, and with 0, an interval without the element x. Each document data including numeric values is indexed by indexing its text part with term frequencies, and by indexing its numeric-value part with the above-described numeric value indexing scheme. By use of indices thus created for each of the document data, similarities between the document data are calculated using a vector space model or a probability model, and the document data are presented in order of similarity.
Public/Granted literature
- US20080270386A1 DOCUMENT RETRIEVAL SYSTEM AND DOCUMENT RETRIEVAL METHOD Public/Granted day:2008-10-30
Information query