MULTI-PARTITION OPERATION IN COMBINATION OPERATIONS

    公开(公告)号:US20190095493A1

    公开(公告)日:2019-03-28

    申请号:US15713976

    申请日:2017-09-25

    Applicant: Splunk Inc.

    Abstract: In an environment where multiple datasets are to be combined, systems and methods are disclosed for allocating a group of data entries from at least one dataset into multiple partitions. For a particular partition, the subgroup in the partition can be combined with data entries from the other dataset. In some cases, groups of data entries from each dataset are assigned to different partitions. For a particular partition, a subgroup is duplicated, some of the data entries of the subgroup are reassigned to other partitions, the subgroup is reformed to include data entries from other partitions, and the reformed subgroup is combined with the subgroup from the other dataset(s).

    QUERY PROCESSING USING QUERY-RESOURCE USAGE AND NODE UTILIZATION DATA

    公开(公告)号:US20180089269A1

    公开(公告)日:2018-03-29

    申请号:US15665148

    申请日:2017-07-31

    Applicant: Splunk Inc.

    CPC classification number: G06F16/24542 G06F16/24554 G06F16/258

    Abstract: Systems and methods are disclosed for processing queries against one or more dataset sources. The system tracks query resource data and resource utilization data. The query-resource usage data can indicate resources used to execute queries. The node resource utilization data can indicate current utilization of nodes in the system. Upon receipt of a query that identifies a set of data to be processed and a manner of processing the set of data, the system can use the query-resource usage data and the resource utilization data to define a query processing scheme. The query can then be executed using the query processing scheme. In some cases, the query coordinator can dynamically allocate partitions operating on worker nodes to execute the query.

    EXTERNAL DATASET CAPABILITY COMPENSATION
    14.
    发明申请

    公开(公告)号:US20180089259A1

    公开(公告)日:2018-03-29

    申请号:US15665248

    申请日:2017-07-31

    Applicant: Splunk Inc.

    CPC classification number: G06F16/2425 G06F16/2282

    Abstract: Systems and methods are disclosed for processing queries against an external data source utilizing dynamically allocated partitions operating on one or more worker nodes. The external data source can include data that has not been processed by the system. To query the external data source, a query coordinator can generate a subquery for the external data source based on determined functionality of the data source. The subquery can identify data in the external data source for processing and a manner for processing the data. In addition, the query coordinator can dynamically allocate partitions operating on worker nodes to retrieve and intake results of the subquery. In some cases, number of partitions allocated can be based on a number of partitions supported by the external data source.

    Multi-phased execution of a search query

    公开(公告)号:US11625404B2

    公开(公告)日:2023-04-11

    申请号:US16687158

    申请日:2019-11-18

    Applicant: Splunk Inc.

    Abstract: The disclosed embodiments include a method performed by a data intake and query system. The method includes receiving a search query by a search head, defining a search process for applying the search query to indexers, delegating a first portion of the search process to indexers and a second portion of the search process to intermediary node(s) communicatively coupled to the search head and the indexers. The first portion can define a search scope for obtaining partial search results of the indexers and the second portion can define operations for combining the partial search results by the intermediary node(s) to produce a combination of the partial search results. The search head then receives the combination of the partial search results, and outputs final search results for the search query, where the final search results are based on the combination of the partial search results.

Patent Agency Ranking