SYSTEM AND METHOD FOR INFORMATION RETRIEVAL

    公开(公告)号:JPH0844759A

    公开(公告)日:1996-02-16

    申请号:JP17639694

    申请日:1994-07-28

    Applicant: IBM JAPAN

    Abstract: PURPOSE:To navigate a user who is unfamiliar with data base retrieval to the retrieval result that the user desires by dynamically applying a prepared view for itemized number display. CONSTITUTION:A retrieval execution part retrieves a book information data base 2004 at a request to retrieve book information. A book information retrieval mechanism 2018 consists of a key word retrieval mechanism 2020 which retrieves not only a key word list 2008, but also a synonym dictionary 2006 and a key word document correspondence table 2010 at a request for key word retrieval and an entire-text retrieval mechanism 2022 which retrieves indexes 2012 for entire-text retrieval at a request for entire-text retrieval. Then plural views which classify stored information by individual fixed viewpoints and display the number of constituent elements by the viewpoints are prepared. The views dynamically change associatively with one another in a process of retrieval according to the number of corresponding retrieval results and user's view selection processing.

    METHOD AND DEVICE FOR EXTRACTING KNOWLEDGE FROM ENORMOUS DOCUMENT DATA AND MEDIUM

    公开(公告)号:JP2001084250A

    公开(公告)日:2001-03-30

    申请号:JP23967499

    申请日:1999-08-26

    Applicant: IBM

    Abstract: PROBLEM TO BE SOLVED: To automatically extract a document satisfying a pattern from enormous amount of documents, to extract useful knowledge and to reduce time required for a response by generating a field-dependent dictionary from document data, generating a syntax tree considering modification, by means of a language analysis device and extracting/outputting a frequentlyappearing pattern by means of a pattern extraction device. SOLUTION: A language feature analysis device generates an analysis- dependent dictionary. A language analysis device needs to prepare a field- dependent dictionary for requiring an attribute adjusted to data to be analyzed. A word having the specified attribute is to be generated by each field. The language feature analysis device checks the word from actual data and registers it in the field-dependent dictionary. A pattern extraction device obtains a pattern, which frequently appears by using document data which is structure- analyzed by the device and takes out an original document having a syntax which is matched with the pattern. A frequently-appearing pattern device displays the document, having the detected frequently-appearing pattern and a syntax tree matched with it.

    JAPANESE SENTENCE DIVIDER
    3.
    发明专利

    公开(公告)号:JPH01234975A

    公开(公告)日:1989-09-20

    申请号:JP5650788

    申请日:1988-03-11

    Applicant: IBM

    Abstract: PURPOSE: To facilitate the system maintenance by applying an unregistered word estimate rule to divisions based on a word dictionary so as to cope with appearance of an unregistered word in a document. CONSTITUTION: The device is made up of an input section 1, 1st-5th processing sections 2-6, a changeover section 7, an output section 8 and 1st-7th storage sections 9-15 storing dictionaries, tables and rules or the like. Basically a sentence is divided by using a word dictionary and when an unregistered word appears, a character string including the unregistered word is tentatively divided in various modes and a patial division character string is in matching with a word in the word dictionary and a most likely division is decided based on number of characters in the matched partial character string. Thus, the processing requiring many man-hours and much cost such as management of dictionaries and updating of them is omitted.

Patent Agency Ranking