Abstract:
Improved privacy preservation techniques are disclosed for use in accordance with data mining. By way of example, a technique for preserving privacy of data records for use in a data mining application comprises the following steps/operations. Different privacy levels are assigned to the data records. Condensed groups are constructed from the data records based on the privacy levels, wherein summary statistics are maintained for each condensed group. Pseudo-data is generated from the summary statistics, wherein the pseudo-data is available for use in the data mining application. Principles of the invention are capable of handling both static and dynamic data sets.
Abstract:
A system and method for determining patterns in a data sequence (Fig. 1) constructs a compatibility matrix (100) which provides a probability between an actual occurrence of an item and an observed occurrence of that or another item between each item in the data sequence. Candidate patterns are generated (201). The candidate patterns include items in the data sequence. The candidate patterns are checked against the data sequence to determine a match value (203) based on the compatibility matrix, and significant matches are determined based on candidate patterns having the match value above a threshold (204).
Abstract:
PROBLEM TO BE SOLVED: To provide a method and system for providing a consumer aggregation service in a network service provider. SOLUTION: At first, when a user registers himself or herself in a consumer collecting service 110, the consumer collecting service 110 replaces the identification information of the registered user 134 with its own identification information when the registered user 134 reads a world wide web(WWW) site. Moreover, the consumer collecting server 110 intercepts an electronic merchandise order made by the registered user 134 with a retailer 124 though a network 100, and charges the registered user 134 with the order, and carries out the order with the retailer 124 on behalf of the registered user 134, and makes the retail agent 124 charge the consumer aggregation service itself with the order. Moreover, the consumer collecting server 110 collects coupons or bonuses from the retailers 124 based on shopping performed by the registered users 134, and stores them in a data base, and distributes the collected coupons or bonuses to the registered users 134 in a prescribed method.
Abstract:
PROBLEM TO BE SOLVED: To provide a method and device for preparing an effective and valid decision tree for executing valid classification. SOLUTION: In this method and device for generating a decision tree by using linear judgment analysis, and for executing the decision tree for data classification (categorization), it is desired that the data are constituted in the format of multi-dimensional objects, for example, the format of data records including characteristic valuables and class variables in a decision tree generation mode and data records including only the characteristics variables in a decision tree scanning mode. For example, a more effective classification system with monitor is prepared by this method. Generally, the recursive division of the decision tree is included in this method so that the maximum amounts of separation can be achieved among the class values of training data. This is executed by finding the effective combination of the variables in order to prepare the decision tree by recursively dividing the training data. This decision tree is used for classifying input test data afterwards.
Abstract:
Systems and methods are provided wherein a base device facilitates a determination of a location associated with an occurrence of an event. A location of the base device is determined, and the base device receives information from an event device fig. 3, element 602. For example, a device in a police automobile may receive information from a police handgun via a wireless communication. Information is then stored to enable a determination of a location associated with an occurrence of an event.
Abstract:
PROBLEM TO BE SOLVED: To improve cluster quality when data substantially proceeds with a lapse of time. SOLUTION: In regard to a technique performing data clustering of a data stream, first, online statistics are generated from a data stream. Thereafter, offline processing of the online statistics is performed when the offline processing is needed or desired. The online statistics can be generated through the reception of data points from the data stream, and the formation and the update of a data group. The offline processing can be performed by re-clustering the data point group in the periphery of the sampled data points, and a newly formed cluster is reported. COPYRIGHT: (C)2005,JPO&NCIPI
Abstract:
PROBLEM TO BE SOLVED: To provide a method for distinguishing and finding a significant cyclic pattern in event subsequence even when the pattern includes disturbance having a length up to a threshold value of prior definition. SOLUTION: A method for distinguishing a partial cyclic pattern in event sequence that includes a list of events from the event sequence in the pattern is provided. At least one pattern which is at least one subsequence in the event sequence and at least one pattern in at least one subsequence distinguishes at least one subsequence in the event sequence and at least one pattern in at least one subsequence which exceeds the minimum repetitive number in at least one subsequence and in which a distance between continuous repetitions of at least one pattern in at least one subsequence does not exceed a distance threshold value of prior definition. At least one of at least one pattern and at least one subsequence is stored.
Abstract:
PROBLEM TO BE SOLVED: To provide a method and system for cache server balancing. SOLUTION: In a system including a collection of cooperating cache servers such as proxy cache servers, a request can be forwarded to a cooperating cache server if the requested object cannot be found locally. An overload condition is detected if for example, due to reference skew, some objects are in high demand by all the clients and the cache servers that contain those hot objects become overloaded due to forwarded requests. In response, the load is balanced by shifting some or all of the forwarded requests from an overloaded cache server to a less loaded one. Both centralized and distributed load balancing environments are described. COPYRIGHT: (C)2007,JPO&INPIT
Abstract:
PROBLEM TO BE SOLVED: To provide a system and method for charging to one or plural persons concerned in response to a client access to an internet. SOLUTION: At least one of one or plural persons concerned is identified as a person responsible for payment of claimed charge (a step 203). Then, charge is shared by each responsible person concerned based on a prescribed function. Then, the total sum of the charge to each responsible person concerned is calculated based on the function of share and the band width using amounts by the client (a step 204).
Abstract:
In a system including a collection of cooperating cache servers, such as proxy cache servers, a request can be forwarded to a cooperating cache server if the requested object cannot be found locally. An overload condition is detected if for example, due to reference skew, some objects are in high demand by all the clients and the cache servers that contain those hot objects become overloaded due to forwarded requests. In response, the load is balanced by shifting some or all of the forwarded requests from an overloaded cache server to a less loaded one. Both centralized and distributed load balancing environments are described.