Execution plan stitching
    3.
    发明授权

    公开(公告)号:US10810202B2

    公开(公告)日:2020-10-20

    申请号:US16008905

    申请日:2018-06-14

    Abstract: Systems, methods, and computer-executable instructions for creating a query execution plan for a query of a database includes receiving, from the database, a set of previously executed query execution plans for the query. Each previously-executed query execution plans includes subplans. Each subplan indicates a tree of physical operators. Physical operators that executed in the set of previously-executed query execution plans are determined. For each physical operator, an execution cost based is determined. Invalid physical operators from the previously-executed query execution plans that are invalid for the database are removed. Equivalent subplans from the previously-executed query execution plans are identified based on physical properties and logical expressions of the subplans. A constrained search space is created based on the equivalent subplans. A query execution plan for the query is constructed from the constrained search space based on the execution cost. The constructed query execution plan is not within the previously-executed query execution plans.

    LEVERAGING QUERY EXECUTIONS TO IMPROVE INDEX RECOMMENDATIONS

    公开(公告)号:US20200272667A1

    公开(公告)日:2020-08-27

    申请号:US16282116

    申请日:2019-02-21

    Abstract: Systems and techniques for leveraging query executions to improve index recommendations are described herein. In an example, a machine learning model is adapted to receive a first query plan and a second query plan for performing a query with a database, where the first query plan is different from the second query plan. The machine learning model may be further adapted to determine execution cost efficiency between the first query plan and the second query plan. The machine learning model is trained using relative execution cost comparisons between a set of pairs of query plans for the database. The machine learning model is further adapted to output a ranking of the first query plan and second query plan, where the first query plan and second query plan are ranked based on execution cost efficiency.

    Finding data in connected corpuses using examples

    公开(公告)号:US10140366B2

    公开(公告)日:2018-11-27

    申请号:US14659303

    申请日:2015-03-16

    Abstract: In one embodiment, datasets are stored in a catalog. The datasets are enriched by establishing relationships among the domains in different datasets. A user searches for relevant datasets by providing examples of the domains of interest. The system identifies datasets corresponding to the user-provided examples. The system them identifies connected subsets of the datasets that are directly linked or indirectly linked through other domains. The user provides known relationship examples to filter the connected subsets and to identify the connected subsets that are most relevant to the user's query. The selected connected subsets may be further analyzed by business intelligence/analytics to create pivot tables or to process the data.

    JOINING TABLES BY LEVERAGING TRANSFORMATIONS

    公开(公告)号:US20180157706A1

    公开(公告)日:2018-06-07

    申请号:US15481055

    申请日:2017-04-06

    CPC classification number: G06F16/24537 G06F16/2456 G06F16/258

    Abstract: Methods and systems for joining two tables are provided. At least two tables to be joined are received. A joinable row pair between the at least two tables is determined. The determined joinable row pair includes a first row from a first table having a common string value with a second row from a second table of the at least two tables. A transformation model is generated from the determined joinable row pair. A column of the first table is transformed based on the generated transformation model. The transformed first table is joined with the second table.

    EFFICIENT TRANSFORMATION PROGRAM GENERATION
    7.
    发明申请

    公开(公告)号:US20180113923A1

    公开(公告)日:2018-04-26

    申请号:US15331690

    申请日:2016-10-21

    CPC classification number: G06F16/258 G06F16/90344

    Abstract: Methods, computer systems, computer-storage media, and graphical user interfaces are provided for facilitating data transformations, according to embodiments of the present invention. In one embodiment, a transformation function is executed using an example input value to obtain an initial output value. Thereafter, a plurality of supplemental transformation tools is applied to the initial output value to generate a plurality of intermediary output values. Based on a comparison of each of the intermediary output values to an example output value, the supplemental transformation tool that generated an intermediary output value having a greatest extent of similarity to the example output values is identified. The identified supplemental transformation tool and the transformation function are used to generate a transformation program that transforms the example input values to the desired form in which to transform data.

    EXTENSIBLE DATA TRANSFORMATIONS
    8.
    发明申请

    公开(公告)号:US20180107724A1

    公开(公告)日:2018-04-19

    申请号:US15295858

    申请日:2016-10-17

    CPC classification number: G06F16/258 G06F21/604

    Abstract: Methods, computer systems, computer-storage media, and graphical user interfaces are provided for facilitating data transformations, according to embodiments of the present invention. In one embodiment, a set of example values are received. A repository of transformation tools is searched to identify a new transformation tool as relevant to a data transformation associated with the received set of example values. The repository includes annotations associated with the new transformation tool. The new transformation tool is used to generate a transformation program that produces transformed output values. Additional annotations are generated for the new transformation tool based on the transformed output values.

    Isolating Resources and Performance in a Database Management System
    9.
    发明申请
    Isolating Resources and Performance in a Database Management System 审中-公开
    在数据库管理系统中隔离资源和性能

    公开(公告)号:US20160371307A1

    公开(公告)日:2016-12-22

    申请号:US15252972

    申请日:2016-08-31

    Abstract: Techniques for tenant performance isolation in a multiple-tenant database management system are described. These techniques may include providing a reservation of server resources. The server resources reservation may include a reservation of a central processing unit (CPU), a reservation of Input/Ouput throughput, and/or a reservation of buffer pool memory or working memory. The techniques may also include a metering mechanism that determines whether the resource reservation is satisfied. The metering mechanism may be independent of an actual resource allocation mechanism associated with the server resource reservation.

    Abstract translation: 描述了多租户数据库管理系统中租户绩效隔离的技术。 这些技术可以包括提供服务器资源的预留。 服务器资源预留可以包括中央处理单元(CPU)的预留,输入/输出吞吐量的预留和/或缓冲池存储器或工作存储器的预留。 这些技术还可以包括确定资源预留是否被满足的计量机制。 计量机制可以独立于与服务器资源预留相关联的实际资源分配机制。

    Finding Data in Connected Corpuses Using Examples
    10.
    发明申请
    Finding Data in Connected Corpuses Using Examples 审中-公开
    使用示例在连接的公司中查找数据

    公开(公告)号:US20150193533A1

    公开(公告)日:2015-07-09

    申请号:US14659303

    申请日:2015-03-16

    Abstract: In one embodiment, datasets are stored in a catalog. The datasets are enriched by establishing relationships among the domains in different datasets. A user searches for relevant datasets by providing examples of the domains of interest. The system identifies datasets corresponding to the user-provided examples. The system them identifies connected subsets of the datasets that are directly linked or indirectly linked through other domains. The user provides known relationship examples to filter the connected subsets and to identify the connected subsets that are most relevant to the user's query. The selected connected subsets may be further analyzed by business intelligence/analytics to create pivot tables or to process the data.

    Abstract translation: 在一个实施例中,数据集存储在目录中。 通过在不同数据集中建立域之间的关系来丰富数据集。 用户通过提供感兴趣的域的示例来搜索相关的数据集。 系统识别与用户提供的示例对应的数据集。 系统识别通过其他域直接链接或间接链接的数据集的连接子集。 用户提供已知的关系示例来过滤连接的子集并识别与用户查询最相关的连接的子集。 可以通过商业智能/分析进一步分析所选择的连接子集以创建枢轴表或处理数据。

Patent Agency Ranking