-
公开(公告)号:US20190171678A1
公开(公告)日:2019-06-06
申请号:US16264462
申请日:2019-01-31
Applicant: Splunk Inc.
Inventor: Sourav Pal , Christopher Pride , Arindam Bhattacharjee , Xiaowei Wang , James Alasdair Robert Hodge , Mustafa Ahamed
IPC: G06F16/951 , G06F16/21 , G06F16/903 , G06F16/9038 , G06F16/904 , G06F16/25 , G06F16/901
Abstract: Disclosed is a technique that can be performed in a distributed computer network. The technique can include a worker node that receives search instructions defined by a search service based on at least a portion of a search scheme defined by a data intake and query system, to cause the worker node to obtain search results from distributed data storage systems communicatively coupled to the worker node over a network. The distributed data storage systems include an external data storage system and/or an internal data storage system of the data intake and query system. The worker node obtains the search results by searching the distributed data storage systems in accordance with the search instructions, and communicating, over the network to the search service, a combination of search results based on the search results to cause an output by the data intake and query system in accordance with the search scheme.
-
公开(公告)号:US20190163840A1
公开(公告)日:2019-05-30
申请号:US15339840
申请日:2016-10-31
Applicant: Splunk Inc.
Inventor: Sourav Pal , Arindam Bhattacharjee , Christopher Pride
IPC: G06F17/30
CPC classification number: G06F16/951 , G06F16/211 , G06F16/212 , G06F16/2455 , G06F16/2471 , G06F16/248 , G06F16/252 , G06F16/258 , G06F16/27 , G06F16/9024 , G06F16/90335 , G06F16/9038 , G06F16/904
Abstract: The disclosed embodiments include techniques for organizing and presenting search results obtained from within a big data ecosystem via a data intake and query system. In particular, a data intake and query system may cause output of the search results or data indicative of the search results on a display device.
-
公开(公告)号:US20190163824A1
公开(公告)日:2019-05-30
申请号:US15339853
申请日:2016-10-31
Applicant: Splunk Inc.
Inventor: Sourav Pal , Arindam Bhattacharjee , Christopher Pride
IPC: G06F17/30
CPC classification number: G06F16/951 , G06F16/211 , G06F16/212 , G06F16/2455 , G06F16/2471 , G06F16/248 , G06F16/252 , G06F16/258 , G06F16/27 , G06F16/9024 , G06F16/90335 , G06F16/9038 , G06F16/904
Abstract: The disclosed embodiments include a technique to obtain search results from the application of transformation operations on partial search results obtained from across internal and/or external data sources. Examples of transformation operations include arithmetic operations such as an average, mean, count, or the like. Examples of reporting transformations include join operations, statistics, sort, top head. Hence, the search results of a search query can be derived from partial search result rather than include the actual partial search results. In this case, the ordering of the search results may be nonessential. An example of a search query that requires a transformation operation is a “batch” or “reporting” search query. The related disclosed techniques involve obtaining data stored in the bid data ecosystem, and returning that data or data derived from that data.
-
公开(公告)号:US20190163823A1
公开(公告)日:2019-05-30
申请号:US15339835
申请日:2016-10-31
Applicant: Splunk Inc.
Inventor: Arindam Bhattacharjee , Sourav Pal , Xiaowei Wang , Christopher Pride , James Alasdair Robert Hodge
IPC: G06F17/30
CPC classification number: G06F16/951 , G06F16/211 , G06F16/212 , G06F16/2455 , G06F16/2471 , G06F16/248 , G06F16/252 , G06F16/258 , G06F16/27 , G06F16/9024 , G06F16/90335 , G06F16/9038 , G06F16/904
Abstract: The disclosed embodiments include techniques for exporting partial search results in parallel from peer indexers of a data intake and query system to the worker nodes. In particular, partial search results (e.g., time-indexed events) obtained from peer indexers can be exported in parallel from the peer indexers to worker nodes. Exporting the partial search results from the peer indexers in parallel can improve the rate at which the partial search results are transferred to the worker nodes for subsequent combination with partial search results of the external data systems. As such, the rate at which the search results of a search query can be obtained from the distributed data system can be improved by implementing parallel export techniques.
-
公开(公告)号:US20190095494A1
公开(公告)日:2019-03-28
申请号:US15714029
申请日:2017-09-25
Applicant: Splunk Inc.
Inventor: Arindam Bhattacharjee , Sourav Pal , Christopher Pride
Abstract: Systems and methods are disclosed for processing and executing queries against one or more dataset. As part of processing the query, the system determines whether the query is susceptible to a significantly imbalanced partition. In the event, the query is susceptible to an imbalanced partition, the system monitors the query and determines whether to perform a multi-partitioning determination to avoid a significantly imbalanced partition.
-
公开(公告)号:US12204593B2
公开(公告)日:2025-01-21
申请号:US18470251
申请日:2023-09-19
Applicant: Splunk Inc.
Inventor: Sourav Pal , Christopher Pride , Arindam Bhattacharjee , Xiaowei Wang , James Alasdair Robert Hodge , Mustafa Ahamed
IPC: G06F16/00 , G06F16/21 , G06F16/2455 , G06F16/2458 , G06F16/248 , G06F16/25 , G06F16/27 , G06F16/901 , G06F16/903 , G06F16/9038 , G06F16/904 , G06F16/951
Abstract: Disclosed is a data fabric service system that can be implemented in a distributed computer network, such as a data intake and query system. The data index and query system can receive a search query and define a search scheme for applying the search query on distributed data storage systems including internal data storage and external data storage. The data index and query system may provide a portion of the search scheme to a search service of the data fabric service system, which can cause worker nodes of the data fabric service system to perform various functions—including applying the search query to the external data storage based on the portion of the search scheme in order to obtain search results.
-
公开(公告)号:US11797618B2
公开(公告)日:2023-10-24
申请号:US17810165
申请日:2022-06-30
Applicant: Splunk Inc.
Inventor: Sourav Pal , Christopher Pride , Arindam Bhattacharjee , Xiaowei Wang , James Alasdair Robert Hodge , Mustafa Ahamed
IPC: G06F16/00 , G06F16/951 , G06F16/21 , G06F16/25 , G06F16/904 , G06F16/901 , G06F16/9038 , G06F16/903 , G06F16/248 , G06F16/2458 , G06F16/27 , G06F16/2455
CPC classification number: G06F16/951 , G06F16/211 , G06F16/212 , G06F16/248 , G06F16/2455 , G06F16/2471 , G06F16/252 , G06F16/258 , G06F16/27 , G06F16/904 , G06F16/9024 , G06F16/9038 , G06F16/90335
Abstract: Disclosed is a data fabric service system that can be implemented in a distributed computer network, such as a data intake and query system. The data index and query system can receive a search query and define a search scheme for applying the search query on distributed data storage systems including internal data storage and external data storage. The data index and query system may provide a portion of the search scheme to a search service of the data fabric service system, which can cause worker nodes of the data fabric service system to perform various functions—including applying the search query to the external data storage based on the portion of the search scheme in order to obtain search results.
-
公开(公告)号:US20220405331A1
公开(公告)日:2022-12-22
申请号:US17810165
申请日:2022-06-30
Applicant: Splunk Inc.
Inventor: Sourav Pal , Christopher Pride , Arindam Bhattacharjee , Xiaowei Wang , James Alasdair Robert Hodge , Mustafa Ahamed
IPC: G06F16/951 , G06F16/21 , G06F16/25 , G06F16/904 , G06F16/901 , G06F16/9038 , G06F16/903 , G06F16/248 , G06F16/2458 , G06F16/27 , G06F16/2455
Abstract: Disclosed is a data fabric service system that can be implemented in a distributed computer network, such as a data intake and query system. The data index and query system can receive a search query and define a search scheme for applying the search query on distributed data storage systems including internal data storage and external data storage. The data index and query system may provide a portion of the search scheme to a search service of the data fabric service system, which can cause worker nodes of the data fabric service system to perform various functions—including applying the search query to the external data storage based on the portion of the search scheme in order to obtain search results.
-
公开(公告)号:US11461334B2
公开(公告)日:2022-10-04
申请号:US15665197
申请日:2017-07-31
Applicant: Splunk Inc.
Inventor: Arindam Bhattacharjee , Sourav Pal , Alexander Douglas James , Christopher Pride
IPC: G06F16/2455 , H04L43/08 , G06F11/20 , H04L43/12 , H04L69/22 , H04L67/1097 , G06F16/27 , G06F16/951 , G06F16/2458 , G06F16/903 , H04L43/028 , H04L43/00 , G06F3/06 , G06F11/34
Abstract: Systems and methods are disclosed for processing queries against one or more dataset sources utilizing dynamically allocated partitions operating on one or more worker nodes. The results of the processing are stored in a dataset destination. The queries can identify data in the one or more dataset sources for processing and a manner for processing the data. In addition, the queries can identify the dataset destination for storing results of the query. To process the query, a query coordinator can dynamically allocate partitions operating on worker nodes to retrieve data for processing, process the data, and communicate the data to the dataset sources. In addition, the query coordinator can dynamically allocate partitions based on an identification of the dataset destination.
-
公开(公告)号:US11232100B2
公开(公告)日:2022-01-25
申请号:US15665187
申请日:2017-07-31
Applicant: Splunk Inc.
Inventor: Arindam Bhattacharjee , Sourav Pal , Christopher Pride
IPC: G06F16/242 , G06F16/22 , G06F16/2453
Abstract: Systems and methods are disclosed for processing queries against multiple dataset sources. One dataset source can include indexers that index and store data. The system can receive a query that identifies a set of data to be processed and a manner of processing the set of data. The set of data can include a first dataset that is accessible by one or more indexers and a second dataset that is accessible by one or more other dataset sources. A query coordinator can define a query processing scheme for obtaining and processing the set of data that includes a dynamic allocation of multiple layers of partitions. The partitions can operate on multiple worker nodes. The query can then be executed based on the query processing scheme.
-
-
-
-
-
-
-
-
-