-
公开(公告)号:US20190095493A1
公开(公告)日:2019-03-28
申请号:US15713976
申请日:2017-09-25
Applicant: Splunk Inc.
Inventor: Arindam Bhattacharjee , Sourav Pal , Christopher Pride
Abstract: In an environment where multiple datasets are to be combined, systems and methods are disclosed for allocating a group of data entries from at least one dataset into multiple partitions. For a particular partition, the subgroup in the partition can be combined with data entries from the other dataset. In some cases, groups of data entries from each dataset are assigned to different partitions. For a particular partition, a subgroup is duplicated, some of the data entries of the subgroup are reassigned to other partitions, the subgroup is reformed to include data entries from other partitions, and the reformed subgroup is combined with the subgroup from the other dataset(s).
-
公开(公告)号:US20180089290A1
公开(公告)日:2018-03-29
申请号:US15339912
申请日:2016-10-31
Applicant: Splunk Inc.
Inventor: Thomas Allan Haggie , Clint Sharp , Alexander Douglas James , David Ryan Marquardt , Hailun Yan , Christopher Pride , Vishal Patel , Amrittpal Singh Bath , Pratiksha Shah , Murugan Kandaswamy , Steve Yu Zhang , Ledion Bitincka , David E. Simmen , Marc Andre Chene , Esguerra Ma Kharisma , Igor Stojanovski
IPC: G06F17/30
CPC classification number: G06F16/248 , G06F3/0481 , G06F16/22 , G06F16/2228 , G06F16/2255 , G06F16/2425 , G06F16/2455 , G06F16/24568 , G06F16/2462 , G06F16/2477 , G06F16/25 , G06F16/285 , G06F16/8373 , G06F16/901 , G06F16/90335 , G06F16/9038 , G06F16/951 , G06F16/9535 , G06T11/206 , G06T2200/24 , H04L43/08 , H04L67/02 , H04L67/025
Abstract: The disclosed embodiments include a method performed by a data intake and query system. The method includes ingesting each metric including at least one key value and a measured value taken of a computing resource, and storing each metric in an index of a metrics store, where the index defines at least one dimension populated with the at least one key value and a measure populated with the measured value. The method further includes cataloging metadata in a metrics catalog, where the metadata is related to the metrics stored in the metrics store, performing an analysis of metrics data included in the metrics store and/or the metrics catalog to obtain results, and causing display of the results or an indication of the results on a display device.
-
公开(公告)号:US20180089269A1
公开(公告)日:2018-03-29
申请号:US15665148
申请日:2017-07-31
Applicant: Splunk Inc.
Inventor: Sourav Pal , Arindam Bhattacharjee , Christopher Pride
IPC: G06F17/30
CPC classification number: G06F16/24542 , G06F16/24554 , G06F16/258
Abstract: Systems and methods are disclosed for processing queries against one or more dataset sources. The system tracks query resource data and resource utilization data. The query-resource usage data can indicate resources used to execute queries. The node resource utilization data can indicate current utilization of nodes in the system. Upon receipt of a query that identifies a set of data to be processed and a manner of processing the set of data, the system can use the query-resource usage data and the resource utilization data to define a query processing scheme. The query can then be executed using the query processing scheme. In some cases, the query coordinator can dynamically allocate partitions operating on worker nodes to execute the query.
-
公开(公告)号:US20180089259A1
公开(公告)日:2018-03-29
申请号:US15665248
申请日:2017-07-31
Applicant: Splunk Inc.
Inventor: Alexander Douglas James , Sourav Pal , Arindam Bhattacharjee , Christopher Pride
IPC: G06F17/30
CPC classification number: G06F16/2425 , G06F16/2282
Abstract: Systems and methods are disclosed for processing queries against an external data source utilizing dynamically allocated partitions operating on one or more worker nodes. The external data source can include data that has not been processed by the system. To query the external data source, a query coordinator can generate a subquery for the external data source based on determined functionality of the data source. The subquery can identify data in the external data source for processing and a manner for processing the data. In addition, the query coordinator can dynamically allocate partitions operating on worker nodes to retrieve and intake results of the subquery. In some cases, number of partitions allocated can be based on a number of partitions supported by the external data source.
-
公开(公告)号:US20240086471A1
公开(公告)日:2024-03-14
申请号:US18470251
申请日:2023-09-19
Applicant: Splunk Inc.
Inventor: Sourav Pal , Christopher Pride , Arindam Bhattacharjee , Xiaowei Wang , James Alasdair Robert Hodge , Mustafa Ahamed
IPC: G06F16/951 , G06F16/21 , G06F16/2455 , G06F16/2458 , G06F16/248 , G06F16/25 , G06F16/27 , G06F16/901 , G06F16/903 , G06F16/9038 , G06F16/904
CPC classification number: G06F16/951 , G06F16/211 , G06F16/212 , G06F16/2455 , G06F16/2471 , G06F16/248 , G06F16/252 , G06F16/258 , G06F16/27 , G06F16/9024 , G06F16/90335 , G06F16/9038 , G06F16/904
Abstract: Disclosed is a data fabric service system that can be implemented in a distributed computer network, such as a data intake and query system. The data index and query system can receive a search query and define a search scheme for applying the search query on distributed data storage systems including internal data storage and external data storage. The data index and query system may provide a portion of the search scheme to a search service of the data fabric service system, which can cause worker nodes of the data fabric service system to perform various functions—including applying the search query to the external data storage based on the portion of the search scheme in order to obtain search results.
-
公开(公告)号:US11874691B1
公开(公告)日:2024-01-16
申请号:US16000664
申请日:2018-06-05
Applicant: Splunk Inc.
Inventor: Alexandros Batsakis , Ashish Mathew , Christopher Pride , Bharath Kishore Reddy Aleti , Sourav Pal , Arindam Bhattacharjee , James Monschke
IPC: G06F16/2453 , G06F16/22
CPC classification number: G06F16/24542 , G06F16/2272
Abstract: Systems and methods are disclosed for processing and executing queries in a data intake and query system. The data intake and query system receives a query identifying a set of data to be processed and a manner of processing the set of data. The data intake and query system identifies buckets that are to be searched and search nodes to execute the query. The data intake and query system maps the identified buckets to the search nodes and executes the query using the identified bucket and search nodes.
-
公开(公告)号:US11860874B2
公开(公告)日:2024-01-02
申请号:US18051470
申请日:2022-10-31
Applicant: Splunk Inc.
Inventor: Arindam Bhattacharjee , Sourav Pal , Christopher Pride
CPC classification number: G06F16/24554 , G06F7/5324 , G06F11/3006 , G06F11/3086 , G06F11/3433 , G06F16/278 , G06F2201/835 , G06F2201/86
Abstract: Systems and methods are disclosed for processing and executing queries against one or more dataset. As part of processing the query, the system determines whether the query is susceptible to a significantly imbalanced partition. In the event, the query is susceptible to an imbalanced partition, the system monitors the query and determines whether to perform a multi-partitioning determination to avoid a significantly imbalanced partition.
-
公开(公告)号:US11625404B2
公开(公告)日:2023-04-11
申请号:US16687158
申请日:2019-11-18
Applicant: Splunk Inc.
Inventor: Sourav Pal , Ashish Mathew , Xiaowei Wang , Christopher Pride
IPC: G06F16/2455 , G06F16/248 , G06F16/951
Abstract: The disclosed embodiments include a method performed by a data intake and query system. The method includes receiving a search query by a search head, defining a search process for applying the search query to indexers, delegating a first portion of the search process to indexers and a second portion of the search process to intermediary node(s) communicatively coupled to the search head and the indexers. The first portion can define a search scope for obtaining partial search results of the indexers and the second portion can define operations for combining the partial search results by the intermediary node(s) to produce a combination of the partial search results. The search head then receives the combination of the partial search results, and outputs final search results for the search query, where the final search results are based on the combination of the partial search results.
-
公开(公告)号:US11314758B2
公开(公告)日:2022-04-26
申请号:US16777592
申请日:2020-01-30
Applicant: Splunk Inc.
Inventor: David Ryan Marquardt , Hailun Yan , Christopher Pride , Vishal Patel
IPC: G06F16/00 , G06F16/248 , G06F16/22 , G06F16/25 , G06F16/28 , G06F16/901 , G06F16/951 , G06F16/242 , G06F16/2455 , G06F16/2458 , G06F16/835 , G06F16/9038 , G06F16/9535 , G06F16/903 , H04L67/025 , G06F3/0481 , G06T11/20 , H04L67/02 , H04L43/08
Abstract: The disclosed embodiments include a method performed by a data intake and query system to store and query metrics data. The method includes ingesting metrics, where each metric includes key values and numerical value indicative of a measured characteristic of a computing resource. The method further includes populating a first portion of a metric-series index (msidx) file with the key values and a second portion of the msidx file with numerical values indicative of a measured characteristic, where the first portion is distinct from the second portion. The method further includes receiving a query including criteria, evaluating the query by applying the criteria to the first portion of the msidx file to obtain query results indicative of metrics that satisfy the criteria, and displaying, on a display device, the query results or data indicative of the query results.
-
公开(公告)号:US11176208B2
公开(公告)日:2021-11-16
申请号:US16570545
申请日:2019-09-13
Applicant: Splunk Inc.
Inventor: Sourav Pal , Christopher Pride , Arindam Bhattacharjee , Xiaowei Wang , James Alasdair Robert Hodge , Mustafa Ahamed
IPC: G06F16/00 , G06F16/951 , G06F16/21 , G06F16/25 , G06F16/904 , G06F16/901 , G06F16/9038 , G06F16/903 , G06F16/248 , G06F16/2458 , G06F16/27 , G06F16/2455
Abstract: Disclosed is a technique that can be performed in a distributed computer network. The technique can include a data index and query system that receives a search query, defines a search scheme for applying the search query on distributed data storage systems including an internal data storage system of the data index and query system and an external data storage system. The internal data storage system stores data as time-indexed events including respective segments of raw machine data. The data index and query system can transfer a portion of the search scheme to a search service, which can return search results obtained by application of the search scheme to the distributed data storage systems including the internal data storage system and the external data storage system. Lastly, the search results or data indicative of the search results can be output on a display device to the user.
-
-
-
-
-
-
-
-
-