Database functions-defined network switch and database system

    公开(公告)号:US10805241B2

    公开(公告)日:2020-10-13

    申请号:US15408206

    申请日:2017-01-17

    Abstract: A computer-implemented method and system are provided, including executing an application programming interface (API) in a network switch to define at least one of one or more database functions, performing, using one or more processors, the one or more database functions on at least a portion of data contained in a data message received at the switch, to generate result data, and routing the result data to one or more destination nodes. A database function-defined network switch includes a network switch and one or more processors to perform a pre-defined database function on query data contained in data messages received at the switch, to produce result data, wherein the pre-defined database function is performed on the query data in a first mode of operation to a state of full completion, generating complete result data and no skipped query data, or to a state of partial completion, generating partially completed result data and skipped query data.

    SYSTEM AND METHOD FOR DATA CACHING IN PROCESSING NODES OF A MASSIVELY PARALLEL PROCESSING (MPP) DATABASE SYSTEM
    2.
    发明申请
    SYSTEM AND METHOD FOR DATA CACHING IN PROCESSING NODES OF A MASSIVELY PARALLEL PROCESSING (MPP) DATABASE SYSTEM 有权
    用于数据处理的数字缓存系统和方法在大规模并行处理(MPP)数据库系统的处理中

    公开(公告)号:US20170010968A1

    公开(公告)日:2017-01-12

    申请号:US14794750

    申请日:2015-07-08

    Abstract: The present technology relates to managing data caching in processing nodes of a massively parallel processing (MPP) database system. A directory is maintained that includes a list and a storage location of the data pages in the MPP database system. Memory usage is monitored in processing nodes by exchanging memory usage information with each other. Each of the processing nodes manages a list and a corresponding amount of available memory in each of the processing nodes based on the memory usage information. Data pages are read from a memory of the processing nodes in response to receiving a request to fetch the data pages, and a remote memory manager is queried for available memory in each of the processing nodes in response to receiving the request. The data pages are distributed to the memory of the processing nodes having sufficient space available for storage during data processing.

    Abstract translation: 本技术涉及在大规模并行处理(MPP)数据库系统的处理节点中管理数据缓存。 维护一个包含MPP数据库系统中数据页的列表和存储位置的目录。 通过彼此交换内存使用信息,在处理节点中监视内存使用情况。 每个处理节点基于存储器使用信息管理每个处理节点中的列表和相应的可用存储器量。 响应于接收到提取数据页的请求,从处理节点的存储器读取数据页面,并且响应于接收到请求,在每个处理节点中查询远程存储器管理器以查找可用存储器。 在数据处理期间,将数据页分配给具有足够空间的处理节点的存储器。

    Apparatus and Method for Using Parameterized Intermediate Representation for Just-In-Time Compilation in Database Query Execution Engine
    3.
    发明申请
    Apparatus and Method for Using Parameterized Intermediate Representation for Just-In-Time Compilation in Database Query Execution Engine 审中-公开
    在数据库查询执行引擎中使用参数化中间表示进行即时编译的设备和方法

    公开(公告)号:US20160306847A1

    公开(公告)日:2016-10-20

    申请号:US14687473

    申请日:2015-04-15

    CPC classification number: G06F16/24542 G06F8/44 G06F8/443 G06F9/4552

    Abstract: Embodiments are provided herein for using parameterized Intermediate Representation (IR) for just-in-time (JIT) compilation in database query execution engines. In an embodiment, a method supporting query JIT compilation and execution in a database management system includes identifying a central processing unit (CPU) intensive function in a query, and identifying, in the CPU intensive function, one or more parameters. The one or more parameters represent variables with values changeable at different query instances. The CPU intensive function tis compiled to a parameterized IR including the one or more parameters. The parameterized IR of the CPU intensive function is saved in a catalog of parameterized IRs.

    Abstract translation: 本文提供了在数据库查询执行引擎中使用参数化中间表示(IR)用于即时(JIT)编译的实施例。 在一个实施例中,支持数据库管理系统中的查询JIT编译和执行的方法包括识别查询中的中央处理单元(CPU)密集型功能,以及在CPU密集功能中识别一个或多个参数。 一个或多个参数表示在不同查询实例中值可更改的变量。 CPU密集功能被编译为包括一个或多个参数的参数化IR。 CPU密集型功能的参数化IR保存在参数化IR的目录中。

    Apparatus and method for managing storage of a primary database and a replica database

    公开(公告)号:US12189657B2

    公开(公告)日:2025-01-07

    申请号:US17836105

    申请日:2022-06-09

    Abstract: System and method embodiments are provided for using different storage formats for a primary database and its replicas in a database managed replication (DMR) system. As such, the advantages of both formats can be combined with suitable design complexity and implementation. In an embodiment, data is arranged in a sequence of rows and stored in a first storage format at the primary database. The data arranged in the sequence of rows is also stored in a second storage format at the replica database. The sequence of rows is determined according to the first storage format or the second storage format. The first storage format is a row store (RS) and the second storage format is a column store (CS), or vice versa. In an embodiment, the sequence of rows is determined to improve compression efficiency at the CS.

    Systems and methods to optimize multi-version support in indexes

    公开(公告)号:US10706036B2

    公开(公告)日:2020-07-07

    申请号:US15475990

    申请日:2017-03-31

    Inventor: Guogen Zhang

    Abstract: System and method embodiments are provided for multi-version support in indexes in a database. The embodiments enable substantially optimized multi-version support in index and avoid backfill of commit log sequence number (LSN) for a transaction identifier (TxID). In an embodiment, a method in a data processing system for managing a database includes determining with the data processing system whether a record is deleted according to a delete indicator in an index leaf page record corresponding to the record; and determining with the data processing system, when the record is not deleted, whether the record is visible according to a new record indicator in the index leaf page record and according to a comparison of a system commit TxID at the transaction start with a record commit TxID obtained from the index leaf page record.

    System and method for database query

    公开(公告)号:US10031940B2

    公开(公告)日:2018-07-24

    申请号:US14864289

    申请日:2015-09-24

    Abstract: A method includes receiving, by a database system, a query statement and forming a runtime plan tree in accordance with the query statement. The method also includes traversing the runtime plan tree including determining whether a function node of the runtime plan tree is qualified for just-in-time (JIT) compilation. Additionally, the method includes, upon determining that the function node is a qualified for JIT compilation producing a string key in accordance with a function of the function node and determining whether a compiled object corresponding to the string key is stored in a compiled object cache.

    Systems and methods to optimize multi-version support in indexes

    公开(公告)号:US09645844B2

    公开(公告)日:2017-05-09

    申请号:US14229638

    申请日:2014-03-28

    Inventor: Guogen Zhang

    Abstract: System and method embodiments are provided for multi-version support in indexes in a database. The embodiments enable substantially optimized multi-version support in index and avoid backfill of commit log sequence number (LSN) for a transaction identifier (TxID). In an embodiment, a method in a data processing system for managing a database includes determining with the data processing system whether a record is deleted according to a delete indicator in an index leaf page record corresponding to the record; and determining with the data processing system, when the record is not deleted, whether the record is visible according to a new record indicator in the index leaf page record and according to a comparison of a system commit TxID at the transaction start with a record commit TxID obtained from the index leaf page record.

    Efficient methods and systems for consistent read in record-based multi-version concurrency control
    8.
    发明授权
    Efficient methods and systems for consistent read in record-based multi-version concurrency control 有权
    高效的方法和系统,用于一致性读取基于记录的多版本并发控制

    公开(公告)号:US09430274B2

    公开(公告)日:2016-08-30

    申请号:US14229519

    申请日:2014-03-28

    Inventor: Guogen Zhang

    Abstract: System and method embodiments are provided for consistent read in a record-based multi-version concurrency control (MVCC) in database (DB) management systems. In an embodiment, a method in a record-based multi-version concurrent control (MVCC) database (DB) management system for a snapshot consistent read includes copying a system commit transaction identifier (TxID) and a current log record sequence number (LSN) from a transaction log at a start of a reader without backfilling of a commit LSN of a transaction to records that are changed and without copying an entire transaction table by the reader; and determining whether a record is visible according to a record TxID, the commit TxID and a current LSN, wherein a transaction table is consulted only when the record TxID is equal to or larger than a commit TxID at a transaction start.

    Abstract translation: 提供了系统和方法实施例,用于在数据库(DB)管理系统中的基于记录的多版本并发控制(MVCC)中的一致性读取。 在一个实施例中,用于快照一致性读取的基于记录的多版本并发控制(MVCC)数据库(DB)管理系统中的方法包括复制系统提交事务标识符(TxID)和当前日志记录序列号(LSN) 从读写器的开始处的事务日志中,没有将交易的提交LSN重新填充到已经改变的记录,并且读者没有复制整个事务表; 以及根据记录TxID,提交TxID和当前LSN来确定记录是否可见,其中只有当事务开始时记录TxID等于或大于提交TxID时才查阅事务表。

    Query plan and operation-aware communication buffer management

    公开(公告)号:US10360267B2

    公开(公告)日:2019-07-23

    申请号:US14736050

    申请日:2015-06-10

    Abstract: Data messages having different priorities may be stored in different communication buffers of a network node. The data messages may then be forwarded from the communication buffers to working buffers as space becomes available in the working buffers. After being forwarded to the working buffers, the data messages may be available to be processed by upper-layer operations of the network node. Priorities may be assigned to the data messages based on a priority level of a query associated with the data messages, a priority level of an upper-layer operation assigned to process the data messages, or combinations thereof.

Patent Agency Ranking