Apparatus and Method for Managing Storage of a Primary Database and a Replica Database

    公开(公告)号:US20170097972A1

    公开(公告)日:2017-04-06

    申请号:US14872811

    申请日:2015-10-01

    CPC classification number: G06F16/27 G06F16/258

    Abstract: System and method embodiments are provided for using different storage formats for a primary database and its replicas in a database managed replication (DMR) system. As such, the advantages of both formats can be combined with suitable design complexity and implementation. In an embodiment, data is arranged in a sequence of rows and stored in a first storage format at the primary database. The data arranged in the sequence of rows is also stored in a second storage format at the replica database. The sequence of rows is determined according to the first storage format or the second storage format. The first storage format is a row store (RS) and the second storage format is a column store (CS), or vice versa. In an embodiment, the sequence of rows is determined to improve compression efficiency at the CS.

    System and Method for Database Query
    12.
    发明申请

    公开(公告)号:US20170091269A1

    公开(公告)日:2017-03-30

    申请号:US14864289

    申请日:2015-09-24

    CPC classification number: G06F17/30442 G06F17/30463 G06F17/30569

    Abstract: A method includes receiving, by a database system, a query statement and forming a runtime plan tree in accordance with the query statement. The method also includes traversing the runtime plan tree including determining whether a function node of the runtime plan tree is qualified for just-in-time (JIT) compilation. Additionally, the method includes, upon determining that the function node is a qualified for JIT compilation producing a string key in accordance with a function of the function node and determining whether a compiled object corresponding to the string key is stored in a compiled object cache.

    DATA PLACEMENT CONTROL FOR DISTRIBUTED COMPUTING ENVIRONMENT
    13.
    发明申请
    DATA PLACEMENT CONTROL FOR DISTRIBUTED COMPUTING ENVIRONMENT 审中-公开
    用于分布式计算环境的数据放置控制

    公开(公告)号:US20170031988A1

    公开(公告)日:2017-02-02

    申请号:US14813668

    申请日:2015-07-30

    CPC classification number: G06F17/30466 G06F17/30486

    Abstract: A method includes dividing a dataset into partitions by hashing a specified key, selecting a set of distributed file system nodes as a primary node group for storage of the partitions, and causing a primary copy of the partitions to be stored on the primary node group by a distributed storage system file server such that the location of each partition is known by hashing of the specified key.

    Abstract translation: 一种方法包括通过散列指定的密钥将数据集分成分区,选择一组分布式文件系统节点作为主节点组以存储分区,并使分区的主副本存储在主节点组上, 分布式存储系统文件服务器,使得每个分区的位置通过散列指定的密钥而已知。

    System and method for adaptive vector size selection for vectorized query execution
    15.
    发明授权
    System and method for adaptive vector size selection for vectorized query execution 有权
    用于向量化查询执行的自适应向量大小选择的系统和方法

    公开(公告)号:US09436732B2

    公开(公告)日:2016-09-06

    申请号:US13798680

    申请日:2013-03-13

    CPC classification number: G06F17/30463

    Abstract: System and method embodiments are provided for adaptive vector size selection for vectorized query execution. The adaptive vector size selection is implemented in two stages. In a query planning stage, a suitable vector size is estimated for a query by a query planner. The planning stage includes analyzing a query plan tree, segmenting the tree into different segments, and assigning to the query execution plan an initial vector size to each segment. In a subsequent query execution stage, an execution engine monitors hardware performance indicators, and adjusts the vector size according to the monitored hardware performance indicators. Adjusting the vector size includes trying different vector sizes and observing related processor counters to increase or decrease the vector size, wherein the vector size is increased to improve hardware performance according to the processor counters, and wherein the vector size is decreased when the processor counters indicate a decrease in hardware performance.

    Abstract translation: 为向量化查询执行的自适应向量大小选择提供了系统和方法实施例。 自适应向量大小选择分两个阶段实现。 在查询计划阶段,由查询计划器为查询估计合适的向量大小。 规划阶段包括分析查询计划树,将树分割成不同的段,并将查询执行计划分配给每个段的初始向量大小。 在随后的查询执行阶段,执行引擎监视硬件性能指标,并根据监视的硬件性能指标调整向量大小。 调整矢量大小包括尝试不同的矢量大小并观察相关处理器计数器以增加或减少矢量大小,其中根据处理器计数器增加矢量大小以提高硬件性能,并且其中当处理器计数器指示 硬件性能下降。

    FLEXIBLE TASK SCHEDULER FOR MULTIPLE PARALLEL PROCESSING OF DATABASE DATA

    公开(公告)号:US20170228422A1

    公开(公告)日:2017-08-10

    申请号:US15040747

    申请日:2016-02-10

    CPC classification number: G06F16/24532 G06F16/24542 G06F16/2455

    Abstract: A system and method of responding to a database query. A query is received for MPP database data stored on a plurality of processing systems. A total splits number of the database data, each split containing at least a portion of the database, is determined. If the total splits number splits is greater than a splits threshold number, partial task maps are created and streamed to the processing systems after compiling the query. If the total splits number is less than the splits threshold number, a complete task map for all splits is created and output to the plurality of processing systems.

    Systems and Methods for Parallelizing Hash-based Operators in SMP Databases
    18.
    发明申请
    Systems and Methods for Parallelizing Hash-based Operators in SMP Databases 审中-公开
    在SMP数据库中并行化基于哈希的运算符的系统和方法

    公开(公告)号:US20160378824A1

    公开(公告)日:2016-12-29

    申请号:US14749098

    申请日:2015-06-24

    CPC classification number: G06F16/24532 G06F16/2255

    Abstract: A system and method for parallelizing hash-based operators in symmetric multiprocessing (SMP) databases is provided. In an embodiment, a method in a device for performing hash based database operations includes receiving at the device an database query; creating a plurality of execution workers to process the query; and building by the execution workers a hash table from a database table, the database table comprising one of a plurality of partitions and a plurality of scan units, the hash table shared by the execution workers, each execution worker scanning a corresponding partition and adding entries to the hash table if the database table is partitioned, each execution worker scanning an unprocessed scan unit and adding entries to the hash table according to the scan unit if the database table comprises scan units, and the workers performing the scanning and the adding in a parallel manner.

    Abstract translation: 提供了一种用于在对称多处理(SMP)数据库中并行化基于散列算子的系统和方法。 在一个实施例中,用于执行基于散列的数据库操作的设备中的方法包括在所述设备处接收数据库查询; 创建多个执行人员来处理查询; 并且由执行工作者构建来自数据库表的散列表,所述数据库表包括多个分区和多个扫描单元之一,所述散列表由执行工作者共享,每个执行工作人员扫描相应的分区并添加条目 如果数据库表被分区,则每个执行人员扫描未处理的扫描单元,并且如果数据库表包括扫描单元,则根据扫描单元将条目添加到散列表,并且执行扫描和添加的工作人员 并行方式

    System and Method for Massively Parallel Processing Database
    19.
    发明申请
    System and Method for Massively Parallel Processing Database 有权
    大规模并行处理数据库的系统与方法

    公开(公告)号:US20150293966A1

    公开(公告)日:2015-10-15

    申请号:US14249558

    申请日:2014-04-10

    Inventor: Le Cai Guogen Zhang

    Abstract: In one embodiment, a method of performing point-in-time recovery (PITR) in a massively parallel processing (MPP) database includes receiving, by a data node from a coordinator, a PITR recovery request and reading a log record of the MPP database. The method also includes determining a type of the log record and updating a transaction table when the type of the log record is an abort transaction or a commit transaction.

    Abstract translation: 在一个实施例中,在大规模并行处理(MPP)数据库中执行时间点恢复(PITR)的方法包括从协调器接收数据节点PITR恢复请求并读取MPP数据库的日志记录 。 该方法还包括当日志记录的类型是中止事务或提交事务时,确定日志记录的类型和更新事务表。

    Systems and Methods to Optimize Multi-version Support in Indexes
    20.
    发明申请
    Systems and Methods to Optimize Multi-version Support in Indexes 有权
    优化索引中多版本支持的系统和方法

    公开(公告)号:US20150278270A1

    公开(公告)日:2015-10-01

    申请号:US14229638

    申请日:2014-03-28

    Inventor: Guogen Zhang

    Abstract: System and method embodiments are provided for multi-version support in indexes in a database. The embodiments enable substantially optimized multi-version support in index and avoid backfill of commit log sequence number (LSN) for a transaction identifier (TxID). In an embodiment, a method in a data processing system for managing a database includes determining with the data processing system whether a record is deleted according to a delete indicator in an index leaf page record corresponding to the record; and determining with the data processing system, when the record is not deleted, whether the record is visible according to a new record indicator in the index leaf page record and according to a comparison of a system commit TxID at the transaction start with a record commit TxID obtained from the index leaf page record.

    Abstract translation: 提供了系统和方法实施例用于数据库中的索引中的多版本支持。 实施例实现了索引中的基本上优化的多版本支持,并避免了交易标识符(TxID)的提交日志序列号(LSN)的补余。 在一个实施例中,用于管理数据库的数据处理系统中的方法包括:根据与该记录相对应的索引叶页面记录中的删除指示符,利用数据处理系统确定是否删除记录; 并且利用数据处理系统确定记录何时未被删除,根据索引叶页面记录中的新记录指示符是否可见记录,并且根据事务开始处的系统提交TxID与记录提交的比较 从索引叶页面记录中获取TxID。

Patent Agency Ranking