PERFORMING IN-MEMORY COLUMNAR ANALYTIC QUERIES ON EXTERNALLY RESIDENT DATA

    公开(公告)号:WO2019067964A1

    公开(公告)日:2019-04-04

    申请号:PCT/US2018/053523

    申请日:2018-09-28

    Abstract: Techniques herein use in-memory column vectors to process data that is external to a database management system (DBMS) and logically join the external data with data that is native to the DBMS. In an embodiment, a computer maintains a data dictionary for native data that is durably stored in an DBMS and external data that is not durably stored in the DBMS. From a client through a connection to the DBMS, the computer receives a query. The computer loads the external data into an in-memory column vector that resides in random access memory of the DBMS. Based on the query and the data dictionary, the DBMS executes a data join of the in-memory column vector with the native data. To the client through said connection, the computer returns results of the query based on the data join.

    FLEXIBLE IN-MEMORY COLUMN STORE PLACEMENT
    3.
    发明申请
    FLEXIBLE IN-MEMORY COLUMN STORE PLACEMENT 审中-公开
    灵活的记忆中柱存储放置

    公开(公告)号:WO2018063723A1

    公开(公告)日:2018-04-05

    申请号:PCT/US2017/049464

    申请日:2017-08-30

    Abstract: Techniques are described herein for distributing distinct portions of a database object across volatile memories of selected nodes of a plurality of nodes in a clustered database system. The techniques involve storing a unit-to-service mapping that associates a unit (a database object or portion thereof) to one or more database services. The one or more database services are mapped to one or more nodes. The nodes to which a service is mapped may include nodes in disjoint database systems, so long as those database systems have access to a replica of the unit. The database object is treated as in-memory enabled by nodes that are associated with the service, and are treated as not in-memory enabled by nodes that are not associated with the service.

    Abstract translation: 本文描述了用于跨集群数据库系统中的多个节点的选定节点的易失性存储器分布数据库对象的不同部分的技术。 这些技术涉及存储将单元(数据库对象或其部分)与一个或多个数据库服务相关联的单元到服务的映射。 一个或多个数据库服务被映射到一个或多个节点。 服务映射到的节点可以包括不相交数据库系统中的节点,只要这些数据库系统可以访问该单元的副本。 数据库对象被视为内存使能,由与服务关联的节点启用,并被视为不与服务关联的节点启用内存。

    EFFICIENT HARDWARE INSTRUCTIONS FOR SINGLE INSTRUCTION MULTIPLE DATA PROCESSORS
    6.
    发明申请
    EFFICIENT HARDWARE INSTRUCTIONS FOR SINGLE INSTRUCTION MULTIPLE DATA PROCESSORS 审中-公开
    高效的硬件指令,用于单个指令多个数据处理器

    公开(公告)号:WO2014150913A2

    公开(公告)日:2014-09-25

    申请号:PCT/US2014/024523

    申请日:2014-03-12

    Abstract: A method and apparatus for efficiently processing data in various formats in a single instruction multiple data ("SIMD") architecture is presented. Specifically, a method to unpack a fixed-width bit values in a bit stream to a fixed width byte stream in a SIMD architecture is presented. A method to unpack variable-length byte packed values in a byte stream in a SIMD architecture is presented. A method to decompress a run length encoded compressed bit-vector in a SIMD architecture is presented. A method to return the offset of each bit set to one in a bit-vector in a SIMD architecture is presented. A method to fetch bits from a bit-vector at specified offsets relative to a base in a SIMD architecture is presented. A method to compare values stored in two SIMD registers is presented.

    Abstract translation: 提出了一种在单指令多数据(“SIMD”)结构中有效处理各种格式的数据的方法和装置。 具体地说,提出了一种在SIMD架构中将位流中的固定宽度位值解包为固定宽度字节流的方法。 介绍了一种解码SIMD架构中字节流中可变长度字节打包值的方法。 提出了一种在SIMD架构中解压缩运行长度编码的压缩位向量的方法。 提出了一种在SIMD架构中将位设置为1的偏移量返回到位向量中的方法。 提出了一种从SIMD架构中相对于基址指定的偏移量从位向量获取位的方法。 提出了一种比较存储在两个SIMD寄存器中的值的方法。

    TECHNIQUES FOR MAINTAINING COLUMN VECTORS OF RELATIONAL DATA WITHIN VOLATILE MEMORY
    7.
    发明申请
    TECHNIQUES FOR MAINTAINING COLUMN VECTORS OF RELATIONAL DATA WITHIN VOLATILE MEMORY 审中-公开
    维护相关数据在波形存储器中的列向量的技术

    公开(公告)号:WO2013086384A1

    公开(公告)日:2013-06-13

    申请号:PCT/US2012/068531

    申请日:2012-12-07

    CPC classification number: G06F17/30315 G06F9/3887 G06F17/30339 G06F17/30595

    Abstract: Techniques are provided for more efficiently using the bandwidth of the I/O path between a CPU and volatile memory during the performance of database operation. Relational data from a relational table is stored in volatile memory as column vectors, where each column vector contains values for a particular column of the table. A binary-comparable format may be used to represent each value within a column vector, regardless of the data type associated with the column. The column vectors may be compressed and/or encoded while in volatile memory, and decompressed/decoded on-the-fly within the CPU. Alternatively, the CPU may be designed to perform operations directly on the compressed and/or encoded column vector data. In addition, techniques are described that enable the CPU to perform vector processing operations on the column vector values.

    Abstract translation: 在执行数据库操作期间,提供了技术来更有效地使用CPU和易失性存储器之间的I / O路径的带宽。 来自关系表的关系数据作为列向量存储在易失性存储器中,其中每个列向量包含表的特定列的值。 可以使用二进制可比较的格式来表示列向量中的每个值,而不管与列相关联的数据类型如何。 列向量可以在易失性存储器中被压缩和/或编码,并且在CPU内部实时解压缩/解码。 或者,CPU可以被设计为直接对压缩和/或编码的列向量数据执行操作。 另外,描述使CPU能够对列向量值执行向量处理操作的技术。

    ONE-SIDED RELIABLE REMOTE DIRECT MEMORY OPERATIONS

    公开(公告)号:WO2020033012A1

    公开(公告)日:2020-02-13

    申请号:PCT/US2019/023264

    申请日:2019-03-20

    Abstract: Techniques are provided to allow more sophisticated operations to be performed remotely by machines that are not fully functional. Operations that can be performed reliably by a machine that has experienced a hardware and/or software error are referred to herein as Remote Direct Memory Operations or "RDMOs". Unlike RDMAs, which typically involve trivially simple operations such as the retrieval of a single value from the memory of a remote machine, RDMOs may be arbitrarily complex. The techniques described herein can help applications run without interruption when there are software faults or glitches on a remote system with which they interact.

    MATERIALIZING EXPRESSIONS WITHIN IN-MEMORY VIRTUAL COLUMN UNITS TO ACCELERATE ANALYTIC QUERIES
    10.
    发明申请
    MATERIALIZING EXPRESSIONS WITHIN IN-MEMORY VIRTUAL COLUMN UNITS TO ACCELERATE ANALYTIC QUERIES 审中-公开
    在存储器虚拟色谱单元中进行表示以加速分析查询

    公开(公告)号:WO2017019881A1

    公开(公告)日:2017-02-02

    申请号:PCT/US2016/044523

    申请日:2016-07-28

    Abstract: Techniques are described for materializing pre-computed results of expressions. In an embodiment, a set of one or more column units are stored in volatile or non-volatile memory. Each column unit corresponds to a column that belongs to an on-disk table within a database managed by a database server instance and includes data items from the corresponding column. A set of one or more virtual column units, and data that associates the set of one or more column units with the set of one or more virtual column units, are also stored in memory. The set of one or more virtual column units includes a particular virtual column unit storing results that are derived by evaluating an expression on at least one column of the on-disk table.

    Abstract translation: 描述了用于实现预先计算的表达式结果的技术。 在一个实施例中,一组一个或多个列单元存储在易失性或非易失性存储器中。 每个列单元对应于属于由数据库服务器实例管理的数据库中的磁盘表上的列,并包括来自相应列的数据项。 一组一个或多个虚拟列单元以及将一个或多个列单元的集合与一个或多个虚拟列单元的集合相关联的数据也存储在存储器中。 一个或多个虚拟列单元的集合包括存储通过评估磁盘表的至少一列上的表达而导出的结果的特定虚拟列单元。

Patent Agency Ranking