System and method for distributed SQL join processing in shared-nothing relational database clusters using self directed data streams
    2.
    发明授权
    System and method for distributed SQL join processing in shared-nothing relational database clusters using self directed data streams 有权
    使用自定义数据流的无共享关系数据库集群中的分布式SQL连接处理的系统和方法

    公开(公告)号:US09576026B2

    公开(公告)日:2017-02-21

    申请号:US13802023

    申请日:2013-03-13

    CPC classification number: G06F17/30445

    Abstract: System and method embodiments are provided for improving the performance of query processing in a massively parallel processing (MPP) database system by pushing down join query processing to data nodes recursively. An embodiment method includes receiving, at a coordinator process, a join query associated with a plurality of tables of the MPP database system, generating, at the coordinator process, an execution plan tree for the join query, and processing, at each of a plurality of data nodes communicating with the coordinator process, the execution plan tree to obtain join query results. The method further includes, upon detecting a next join operator below a top join operator in the execution plan tree at each of the data nodes, forwarding to the other data nodes a sub-tree for the next join operator, and receiving, at each of the data nodes from the other data nodes, sub-tree processing results.

    Abstract translation: 提供了系统和方法实施例,用于通过将连接查询处理递归地推送到数据节点来提高大规模并行处理(MPP)数据库系统中查询处理的性能。 一种实施方式包括在协调器进程处接收与MPP数据库系统的多个表相关联的连接查询,在协调器进程处生成用于连接查询的执行计划树,并在多个表中的每一个处处理 的数据节点与协调器进程通信,执行计划树获取连接查询结果。 该方法还包括:在每个数据节点处检测执行计划树中的顶部连接运算符下面的下一个连接运算符,向下一个连接运算符转发另一个数据节点的子树,并在每个 来自其他数据节点的数据节点,子树处理结果。

    Method for two-stage query optimization in massively parallel processing database clusters
    3.
    发明授权
    Method for two-stage query optimization in massively parallel processing database clusters 有权
    大规模并行处理数据库集群中的两阶段查询优化方法

    公开(公告)号:US09311354B2

    公开(公告)日:2016-04-12

    申请号:US13730872

    申请日:2012-12-29

    CPC classification number: G06F17/30445 G06F17/30483

    Abstract: Queries may be processed more efficiently in an massively parallel processing (MPP) database by locally optimizing the global execution plan. The global execution plan and a semantic tree may be provided to MPP data nodes by an MPP coordinator. The MPP data nodes may then use the global execution plan and the semantic tree to generate a local execution plan. Thereafter, the MPP data nodes may select either the global execution plan or the local execution plan is accordance with a cost evaluation.

    Abstract translation: 在大规模并行处理(MPP)数据库中,可以通过局部优化全局执行计划,更有效地处理查询。 全局执行计划和语义树可以由MPP协调器提供给MPP数据节点。 然后,MPP数据节点可以使用全局执行计划和语义树来生成本地执行计划。 此后,MPP数据节点可以选择全局执行计划,或者本地执行计划根据成本评估。

    System and Method for Out of Order Multiple Query Execution within Stored Procedure
    4.
    发明申请
    System and Method for Out of Order Multiple Query Execution within Stored Procedure 有权
    在存储过程中执行多个查询执行的系统和方法

    公开(公告)号:US20150310065A1

    公开(公告)日:2015-10-29

    申请号:US14264851

    申请日:2014-04-29

    Inventor: Qingqing Zhou

    CPC classification number: G06F17/30463 G06F17/30339

    Abstract: Embodiments are provided herein for efficient out-of-order (OOO) multiple query execution within a stored procedure in a database processing system. An embodiment method includes compiling a procedure comprising a plurality of statements. During the compiling, any dependencies between the statements are detected and maintained in a dependency table. The method further includes executing the procedure. During the execution, the method includes upon detecting a change in a dependency between the statements, updating the dependency in the dependency table. The statements are scheduled for OOO execution according to the dependency table with the updated dependencies.

    Abstract translation: 本文提供了用于在数据库处理系统中的存储过程中的有效的无序(OOO)多个查询执行的实施例。 实施例方法包括编译包括多个语句的过程。 在编译期间,在依赖关系表中检测和维护语句之间的任何依赖关系。 该方法还包括执行该过程。 在执行期间,该方法包括在检测到语句之间的依赖性的变化时,更新依赖关系表中的依赖性。 根据具有更新的依赖关系的依赖关系表,这些语句被安排为OOO执行。

    Database functions-defined network switch and database system

    公开(公告)号:US10805241B2

    公开(公告)日:2020-10-13

    申请号:US15408206

    申请日:2017-01-17

    Abstract: A computer-implemented method and system are provided, including executing an application programming interface (API) in a network switch to define at least one of one or more database functions, performing, using one or more processors, the one or more database functions on at least a portion of data contained in a data message received at the switch, to generate result data, and routing the result data to one or more destination nodes. A database function-defined network switch includes a network switch and one or more processors to perform a pre-defined database function on query data contained in data messages received at the switch, to produce result data, wherein the pre-defined database function is performed on the query data in a first mode of operation to a state of full completion, generating complete result data and no skipped query data, or to a state of partial completion, generating partially completed result data and skipped query data.

    REDISTRIBUTING TABLE DATA IN A DATABASE CLUSTER

    公开(公告)号:US20190163773A1

    公开(公告)日:2019-05-30

    申请号:US15827660

    申请日:2017-11-30

    Abstract: A computer-implemented method of relocating data in a distributed database comprises: creating, by one or more processors, a second table in the distributed database, the second table including all columns from a first table; copying, by the one or more processors, a first set of tuples from the first table to the second table; modifying, by the one or more processors, during the copying of the first set of tuples, data of the first table according to a modification; after the copying of the first set of tuples, modifying, by the one or more processors, data of the second table according to the modification; and switching, by the one or more processors, the second table for the first table in a catalog of the distributed database.

    SYSTEM AND METHOD FOR DATA CACHING IN PROCESSING NODES OF A MASSIVELY PARALLEL PROCESSING (MPP) DATABASE SYSTEM
    8.
    发明申请
    SYSTEM AND METHOD FOR DATA CACHING IN PROCESSING NODES OF A MASSIVELY PARALLEL PROCESSING (MPP) DATABASE SYSTEM 有权
    用于数据处理的数字缓存系统和方法在大规模并行处理(MPP)数据库系统的处理中

    公开(公告)号:US20170010968A1

    公开(公告)日:2017-01-12

    申请号:US14794750

    申请日:2015-07-08

    Abstract: The present technology relates to managing data caching in processing nodes of a massively parallel processing (MPP) database system. A directory is maintained that includes a list and a storage location of the data pages in the MPP database system. Memory usage is monitored in processing nodes by exchanging memory usage information with each other. Each of the processing nodes manages a list and a corresponding amount of available memory in each of the processing nodes based on the memory usage information. Data pages are read from a memory of the processing nodes in response to receiving a request to fetch the data pages, and a remote memory manager is queried for available memory in each of the processing nodes in response to receiving the request. The data pages are distributed to the memory of the processing nodes having sufficient space available for storage during data processing.

    Abstract translation: 本技术涉及在大规模并行处理(MPP)数据库系统的处理节点中管理数据缓存。 维护一个包含MPP数据库系统中数据页的列表和存储位置的目录。 通过彼此交换内存使用信息,在处理节点中监视内存使用情况。 每个处理节点基于存储器使用信息管理每个处理节点中的列表和相应的可用存储器量。 响应于接收到提取数据页的请求,从处理节点的存储器读取数据页面,并且响应于接收到请求,在每个处理节点中查询远程存储器管理器以查找可用存储器。 在数据处理期间,将数据页分配给具有足够空间的处理节点的存储器。

    Methods and Systems for Dynamically Allocating Resources and Tasks Among Database Work Agents in an SMP Environment
    9.
    发明申请
    Methods and Systems for Dynamically Allocating Resources and Tasks Among Database Work Agents in an SMP Environment 审中-公开
    在SMP环境中动态分配资源和数据库工作代理任务的方法和系统

    公开(公告)号:US20150227586A1

    公开(公告)日:2015-08-13

    申请号:US14175489

    申请日:2014-02-07

    Abstract: Dynamically re-allocating tasks and/or memory quotas amongst work agents in symmetric multiprocessing (SMP) systems can significantly mitigate delays and inefficiencies associated with data skew. For example, unfinished tasks can be reallocated from a busy work agent to an idle work agent upon determining that the idle work agent has finished processing its originally assigned set of tasks. Alternatively, a portion of a memory quota assigned to an idle work agent can be reallocated to a busy work agent for use in processing the remaining tasks. Memory quotas can be re-assigned by releasing the memory quota back into a memory pool once the idle work agent has finished processing its originally assigned tasks, and then reallocating some or all of the memory quota to the busy work agent.

    Abstract translation: 对称多处理(SMP)系统中的工作代理之间的动态重新分配任务和/或内存配额可以显着地减轻与数据偏移相关的延迟和低效率。 例如,未确定的任务可以在确定空闲工作代理已经完成处理其原始分配的一组任务之后,从繁忙的工作代理重新分配到空闲工作代理。 或者,分配给空闲工作代理的内存配额的一部分可以被重新分配给忙工作代理,以用于处理剩余的任务。 一旦空闲工作代理完成处理其最初分配的任务,然后将部分或全部内存配额重新分配给忙工作代理,则可以通过将内存配额释放回内存池来重新分配内存配额。

    Redistributing table data in a database cluster

    公开(公告)号:US11151111B2

    公开(公告)日:2021-10-19

    申请号:US15827660

    申请日:2017-11-30

    Abstract: A computer-implemented method of relocating data in a distributed database comprises: creating, by one or more processors, a second table in the distributed database, the second table including all columns from a first table; copying, by the one or more processors, a first set of tuples from the first table to the second table; modifying, by the one or more processors, during the copying of the first set of tuples, data of the first table according to a modification; after the copying of the first set of tuples, modifying, by the one or more processors, data of the second table according to the modification; and switching, by the one or more processors, the second table for the first table in a catalog of the distributed database.

Patent Agency Ranking