Method for Two-Stage Query Optimization in Massively Parallel Processing Database Clusters
    11.
    发明申请
    Method for Two-Stage Query Optimization in Massively Parallel Processing Database Clusters 有权
    大规模并行处理数据库集群中两阶段查询优化的方法

    公开(公告)号:US20140188841A1

    公开(公告)日:2014-07-03

    申请号:US13730872

    申请日:2012-12-29

    CPC classification number: G06F17/30445 G06F17/30483

    Abstract: Queries may be processed more efficiently in an massively parallel processing (MPP) database by locally optimizing the global execution plan. The global execution plan and a semantic tree may be provided to MPP data nodes by an MPP coordinator. The MPP data nodes may then use the global execution plan and the semantic tree to generate a local execution plan. Thereafter, the MPP data nodes may select either the global execution plan or the local execution plan is accordance with a cost evaluation.

    Abstract translation: 在大规模并行处理(MPP)数据库中,可以通过局部优化全局执行计划,更有效地处理查询。 全局执行计划和语义树可以由MPP协调器提供给MPP数据节点。 然后,MPP数据节点可以使用全局执行计划和语义树来生成本地执行计划。 此后,MPP数据节点可以选择全局执行计划,或者本地执行计划根据成本评估。

    Apparatus and method for managing storage of a primary database and a replica database

    公开(公告)号:US12189657B2

    公开(公告)日:2025-01-07

    申请号:US17836105

    申请日:2022-06-09

    Abstract: System and method embodiments are provided for using different storage formats for a primary database and its replicas in a database managed replication (DMR) system. As such, the advantages of both formats can be combined with suitable design complexity and implementation. In an embodiment, data is arranged in a sequence of rows and stored in a first storage format at the primary database. The data arranged in the sequence of rows is also stored in a second storage format at the replica database. The sequence of rows is determined according to the first storage format or the second storage format. The first storage format is a row store (RS) and the second storage format is a column store (CS), or vice versa. In an embodiment, the sequence of rows is determined to improve compression efficiency at the CS.

    System and Method for Flexible Distributed Massively Parallel Processing (MPP) Database
    13.
    发明申请
    System and Method for Flexible Distributed Massively Parallel Processing (MPP) Database 有权
    用于灵活分布式大规模并行处理(MPP)数据库的系统和方法

    公开(公告)号:US20140122484A1

    公开(公告)日:2014-05-01

    申请号:US13663237

    申请日:2012-10-29

    CPC classification number: G06F17/30339

    Abstract: An embodiment method for massively parallel processing includes assigning a primary key to a first table in a database and a foreign key to a second table in the database, the foreign key of the second table identical to the primary key of the first table, determining a number of partition groups desired for the database, partitioning the first table into first partitions based on the primary key assigned and the number of partition groups desired, partitioning the second table into second partitions based on the foreign key assigned and the number of partition groups desired, and distributing the first partitions and the second partitions to the partition groups as partitioned. An embodiment system for implementing the embodiment methods is also disclosed.

    Abstract translation: 用于大规模并行处理的实施例方法包括:将主密钥分配给数据库中的第一表,将外键分配给数据库中的第二表,第二表的外键与第一表的主键相匹配, 数据库所需的分区组数量,基于分配的主键和所分配的分组数将第一分区划分成第一分区,基于所分配的外键划分第二分区到第二分区,以及期望的分组数量 并且将第一分区和第二分区分配给分区的分区组。 还公开了用于实现实施方式的实施例系统。

    Methods and Systems for Dynamically Allocating Resources and Tasks Among Database Work Agents in an SMP Environment
    16.
    发明申请
    Methods and Systems for Dynamically Allocating Resources and Tasks Among Database Work Agents in an SMP Environment 审中-公开
    在SMP环境中动态分配资源和数据库工作代理任务的方法和系统

    公开(公告)号:US20150227586A1

    公开(公告)日:2015-08-13

    申请号:US14175489

    申请日:2014-02-07

    Abstract: Dynamically re-allocating tasks and/or memory quotas amongst work agents in symmetric multiprocessing (SMP) systems can significantly mitigate delays and inefficiencies associated with data skew. For example, unfinished tasks can be reallocated from a busy work agent to an idle work agent upon determining that the idle work agent has finished processing its originally assigned set of tasks. Alternatively, a portion of a memory quota assigned to an idle work agent can be reallocated to a busy work agent for use in processing the remaining tasks. Memory quotas can be re-assigned by releasing the memory quota back into a memory pool once the idle work agent has finished processing its originally assigned tasks, and then reallocating some or all of the memory quota to the busy work agent.

    Abstract translation: 对称多处理(SMP)系统中的工作代理之间的动态重新分配任务和/或内存配额可以显着地减轻与数据偏移相关的延迟和低效率。 例如,未确定的任务可以在确定空闲工作代理已经完成处理其原始分配的一组任务之后,从繁忙的工作代理重新分配到空闲工作代理。 或者,分配给空闲工作代理的内存配额的一部分可以被重新分配给忙工作代理,以用于处理剩余的任务。 一旦空闲工作代理完成处理其最初分配的任务,然后将部分或全部内存配额重新分配给忙工作代理,则可以通过将内存配额释放回内存池来重新分配内存配额。

    System and Method for Flexible Distributed Massively Parallel Processing (MPP)
    17.
    发明申请
    System and Method for Flexible Distributed Massively Parallel Processing (MPP) 有权
    柔性分布式大规模并行处理系统与方法(MPP)

    公开(公告)号:US20140108633A1

    公开(公告)日:2014-04-17

    申请号:US13653308

    申请日:2012-10-16

    Abstract: An embodiment method for massively parallel processing includes initiating a management instance on an initial machine, the management instance generating an initial partition corresponding to the initial machine, determining a total number of partitions desired for processing a database, the total number of partitions including the initial partition, determining a number of additional machines available to process the database, grouping the initial machine and the additional machines together in a pod, and launching the management instance on the additional machines in the pod to generate the total number of partitions desired for the database. Additional embodiment methods and an embodiment system operable to perform such methods are also disclosed.

    Abstract translation: 用于大规模并行处理的实施例方法包括在初始机器上启动管理实例,所述管理实例生成对应于初始机器的初始分区,确定处理数据库所需的分区的总数,包括初始 分区,确定可用于处理数据库的多个附加机器,将起始机器和附加机器分组到一个pod中,并在该pod中的其他机器上启动管理实例以生成数据库所需的分区总数 。 还公开了另外的实施例方法和可操作以执行这些方法的实施例系统。

Patent Agency Ranking