DATABASE MANAGEMENT SYSTEMS FOR MANAGING DATA WITH DATA CONFIDENCE

    公开(公告)号:US20200327106A1

    公开(公告)日:2020-10-15

    申请号:US16916851

    申请日:2020-06-30

    Applicant: Ocient Inc.

    Abstract: A method for managing data storage and retrieval and operated within a database management system includes determining whether a data confidence value of a data record that is stored or is to be stored in memory of the database management system is less than a data confidence threshold, where the data confidence value includes one or more of an estimate of an accuracy of data within the data record, an estimate of the accuracy of the data record, and an estimate of a reliability level of the data. When the data confidence value is less than the data confidence threshold, the method continues by generating a confidence data record based on the data record and the data confidence value. The method continues by storing the confidence data record in memory of the database management system.

    DATABASE MANAGEMENT SYSTEM CLUSTER NODE SUBTASKING DATA QUERY

    公开(公告)号:US20180285414A1

    公开(公告)日:2018-10-04

    申请号:US15942976

    申请日:2018-04-02

    Applicant: OCIENT, INC

    Abstract: A cluster node within a cluster of a highly parallel database system includes at least one processing unit that runs a set of first tier threads and a set of second tier threads, a storage disk drive, and a networking interface. When a first tier thread receives a task, it divides the task into a set of subtasks. The first tier thread also assigns the set of subtasks between a subset of the set of second tier threads for execution. Each second tier thread within the subset processes the one or more subtasks it is assigned to. When the task is a work, the subtasks are work units. When the task is a work unit, the subtasks are subwork units.

    DIVIDING A DATA PARTITION BASED ON A DATA STORAGE CODING SCHEME

    公开(公告)号:US20240362226A1

    公开(公告)日:2024-10-31

    申请号:US18766879

    申请日:2024-07-09

    Applicant: Ocient Inc.

    CPC classification number: G06F16/24554 G06F16/221 G06F16/24542

    Abstract: A method for execution by at least one computing entity of a database system, the method includes obtaining a plurality of data partitions of a data set for storage in the database system, where the data set is organized in rows and columns, and the rows correspond to data records and the columns correspond to fields of the data records. The method further includes dividing a first partition of a plurality of data partitions to produce a first number of first raw data segments for storage in the database system, where the first number is based on a first data storage coding scheme. The method further includes dividing a second partition of the plurality of data partitions to produce a second number of second raw data segments for storage in the database system, where the second number is based on a second data storage coding scheme.

    Determining a coding scheme for a partition of a data set

    公开(公告)号:US20240104100A1

    公开(公告)日:2024-03-28

    申请号:US18509455

    申请日:2023-11-15

    Applicant: Ocient Inc.

    CPC classification number: G06F16/24554 G06F16/221 G06F16/24542

    Abstract: A method includes obtaining a plurality of data partitions of a data set for storage in a database system. The method further includes determining a first data storage coding scheme for a first partition of the plurality of data partitions, where the first data storage coding scheme includes first encoding parameters regarding encoding the first partition into first data segments and first parity segments. The method further includes determining a second data storage coding scheme for a second partition of the plurality of data partitions. The method further includes dividing the first partition to produce a first number of first raw data segments, where the first number is based on the first data storage coding scheme. The method further includes dividing the second partition to produce a second number of second raw data segments, where the second number is based on the second data storage coding scheme.

Patent Agency Ranking