Abstract:
PURPOSE: To automatically display what character a set of documents narrowed down at present has by obtaining the number of documents to which keywords are given with respect to a set of keywords given to the set of narrowed-down documents and displaying them in the frequency order. CONSTITUTION: The keyword inputted by a user input and display module 210 is transferred to a keyword retrieval engine 208 to retrieve an ID index 204 to the keyword, and a set of IDs of pertinent documents is returned. The keyword retrieval result (the number of retrieved documents, titles, or the like) is displayed in individual windows by the module 210. Obtained keywords are sorted in the descending order based on counted values by a keyword collection and sort module 212 and are displayed by the module 210. A list of keywords sorted in the descending order in this manner is watched to find what character the set of narrowed-down documents has.
Abstract:
PROBLEM TO BE SOLVED: To enable a user to set whether or not to disclose preference information of the user depending on preference information of other users or a user group. SOLUTION: By applying a privacy policy, which describes and manages a policy for disclosure of preference information, to the preference information, it is determined whether or not to disclose some or all of the information to a third party based on classification as a result of matching by a matching system. The method includes: disclosing, to a user who has made a query, only preference information that matches preferences of the user; and disclosing only preference information common to the user group. The description of a privacy policy makes it possible to achieve a meaningful communication by disclosing preference information to others within a detailed range while keeping the preference information undisclosed if the user does not wish disclosure thereof, such as disclosing only preference information including preferences matching those of the other user or disclosing only preference information including preferences shared in a certain user group. COPYRIGHT: (C)2009,JPO&INPIT
Abstract:
PROBLEM TO BE SOLVED: To precisely retrieve text information which has strong association with numerical information such as time series data, and to display the text information by associating it with the numerical information. SOLUTION: This information processor for associating text information with the numerical information of time series is provided with a generation part for generating first words and phrases showing the change of a first numerical information in a certain period; a first retrieval part for retrieving text information matched with a retrieval key including the first words and phrases from among a plurality of pieces of text information as the object of retrieval; and an output part for outputting the retrieved text information by associating it with the first numerical information in the period. In this information processor, the first retrieval part may retrieve text information describing the fact that the first numerical information has been changed as shown by the first words and phrases from among a plurality of pieces of text information prepared or announced in the period. COPYRIGHT: (C)2007,JPO&INPIT
Abstract:
PROBLEM TO BE SOLVED: To present valuable information to a user at the time of periodically observing a plurality of dynamically changing information sources. SOLUTION: This information processing system for processing information to be acquired from a plurality of sites connected through the Internet 10 is provided with a crawler 13 patrolling in the sites on the Internet 10 registered in a registered site DB11, a meta data DB12 for storing meta data to be acquired according to the patrol by extracting elements carrying information from contents to be referred to by a URL, a significant information element extracting mechanism 30 for reading the information stored in the meta data DB12, and for extracting significant information elements based on the level of coincidence of the information elements, a significant information element DB40 for storing the extracted significant information, and a result display mechanism 41 for operating visualization processing to the stored significant information elements.
Abstract:
PROBLEM TO BE SOLVED: To effectively obtain information and provide an effective means for presenting the obtained information without a definite indication of an intention for obtaining the information, such as pushing down of a retrieval button. SOLUTION: A kana (Japanese syllabary)/kanji (Chinese character) converting routine is started (step 50), and a character string is inputted (step 51). Next, a conversion key for converting the inputted character string to kanji is pushed down (step 52). In this timing for pushing down the conversion key, a homonym candidate selection routine is started (step 53), and a conversion candidate is presented. In the timing for pushing down the conversion key or conversion operation of the conversion candidate in the homonym candidate selection routine, namely candidate pre-selection operation (step 54), an information access routine is started (step 55). Next, information access is executed (step 56), and the retrieval result (the information obtained by access) is obtained (step 57). Then, the retrieval result is presented (step 58).
Abstract:
PURPOSE: To enable the execution of machine translation with which the merit of CBMT is provided and processing efficiency is improved by generalizing knowledge provided from translated examples and using such generalized knowledge for translation processing. CONSTITUTION: In order to find the high-order conception of a word, first of all, it is investigated whether any translation pattern and thesaurus having the same number as its translation pattern description are existent or not and when they are existent, that translation pattern thesaurus is retrieved before the thesaurus. Thus, by searching the generalized translation knowledge, shortest distance generalized translation knowledge is provided. In the case of retrieving the shortest distance generalized knowledge to be applied to inputting, it is not necessary to calculate the degree of similarity.
Abstract:
PURPOSE: To construct knowledge for canceling the structural equivocality of natural language by extracting pairs of modifying word and modified words concerning respective plural possible dependence relations concerning a sentence, for which structural equivocality is discriminated, and deciding the dependence relation of the highest likelihood based on the distance found for each pair. CONSTITUTION: The structural equivocality to be the maximum bottle neck for analyzing the natural language sentence is generated because plural modification (dependence) relations between vocabularies can be considered. Then, first of all, the knowledge is expressed in three structure showing synonymous relation, hierarchical relation and dependence relation. Concerning the equivocal modification, the dependence relation between vocaburaries defined in the background knowledge is searched by a dependence relation analyzer while using the synonymous relation or hierarchical relation, the optimum modification is selected while using limitations provided from the sentence and limitations provided from contexts, and equivocality is canceled. The fixed dependence structure is registered in a knowledge base as context dependence relation data. Thus, the knowledge for canceling structural equivocality in natural language can be constructed.
Abstract:
The present invention provides, in a database, a technique for automatically indicating what property a current set of documents narrowed by a search criteria possesses, and an approach for retrieving a set of similar documents by specifying a document rather than a keyword. First, in a set of keywords attached to the set of documents narrowed in response to the user retrieval processing, a system according to preferred embodiments of the present invention counts the number of documents to which each of the keywords is attached and displays the keywords in order of decreasing frequency. Next, a user specifies his/her document(s) of interest among the set of documents. In response to this, out of the keywords attached to the documents specified by the user, a keyword able to retrieve any documents other than are specified is displayed. From the set of keywords attached to the current set of document narrowed, this is determined by omitting the keywords meeting the following conditions: (1) the keyword is not attached to any documents other than the specified one; and (2) the keyword is attached to all the documents in the current set of documents. An automatic retrieval with this keyword will enable narrowing by document rather by the keyword.
Abstract:
PROBLEM TO BE SOLVED: To retrieve a document data while appropriately reflecting the content of a retrieving sentence, and to appropriately detect the occurrence of a problem out of document data sequentially added. SOLUTION: This retrieval system retrieves the document data including the content of the retrieving sentence from the plurality of document data, and includes a document database for storing the plurality of document data, a concept database for storing a plurality of concepts by hierarchical structure, a document data concept extraction part for extracting document concept corresponding to document data, based on a keyword included in each document data, a retrieving sentence concept extraction part for extracting a retrieving sentence concept, based on the keyword included in the retrieving sentence, a concept retrieval part for retrieving the document data with the retrieving sentence concept serving as an upper hierarchy or lower hierarchy of the document concept, out of the plurality of document data, and a retrieval result output part for outputting the document data retrieved by the concept retrieval part, as the document data including the content assigned by the retrieving sentence. COPYRIGHT: (C)2010,JPO&INPIT
Abstract:
PROBLEM TO BE SOLVED: To present information regarding a file that has been secondly and derivatively created from a leaked file, by retrieving this derivatively created file from the firstly leaked file, and to present personal information recorded in a computer accessible via a broadband network. SOLUTION: The disclosed apparatus comprises: a recording section for recording original data of a leaked file; an extraction section for extracting a representation corresponding to information, to prevent leakage, contained in the original data and files present in a recording region to be investigated; an investigation section for investigating a degree of relation between the files present in the recording region to be investigated and the original data; and a presentation section for presenting information, based on the degree of relation, regarding a file created based on the leaked file. COPYRIGHT: (C)2007,JPO&INPIT