-
公开(公告)号:US11294890B2
公开(公告)日:2022-04-05
申请号:US16365219
申请日:2019-03-26
Applicant: Snowflake Inc.
Inventor: Jiansheng Huang , Jiaxing Liang , Scott Ziegler , Haowei Yu , Benoit Dageville , Varun Ganesh
Abstract: Systems, methods, and devices for batch ingestion of data into a table of a database. A method includes determining a notification indicating a presence of a user file received from a client account to be ingested into a database. The method includes identifying data in the user file and identifying a target table of the database to receive the data in the user file. The method includes generating an ingest task indicating the data and the target table. The method includes assigning the ingest task to an execution node of an execution platform, wherein the execution platform comprises a plurality of execution nodes operating independent of a plurality of shared storage devices collectively storing database data. The method includes registering metadata concerning the target table in a metadata store after the data has been fully committed to the target table by the execution node.
-
公开(公告)号:US10997163B2
公开(公告)日:2021-05-04
申请号:US16943251
申请日:2020-07-30
Applicant: Snowflake Inc.
Inventor: Benoit Dageville , Varun Ganesh , Jiansheng Huang , Jiaxing Liang , Haowei Yu , Scott Ziegler
Abstract: The subject technology at a data system, an ingest request to ingest one or more files into a table. The subject technology, after obtaining the ingest request and prior to the ingesting of the one or more files, persists the one or more files in a first file queue that corresponds to the table, the first file queue further corresponding to a client account, and the data system further comprising a second file queue that corresponds to both a second client account and a second table. The subject technology ingests, by one or more execution nodes, the one or more files into one or more micro-partitions of the table, each of the one or more micro-partitions comprising contiguous units of storage of a storage device.
-
公开(公告)号:US10977245B2
公开(公告)日:2021-04-13
申请号:US16942421
申请日:2020-07-29
Applicant: Snowflake Inc.
Inventor: Benoit Dageville , Varun Ganesh , Jiansheng Huang , Jiaxing Liang , Haowei Yu , Scott Ziegler
Abstract: The subject technology obtains, at a database system, an ingest request to ingest one or more files into a table of a database. The subject technology, after obtaining the ingest request and prior to the ingesting of the one or more files, persists the one or more files in a file queue that corresponds to the table. The subject technology assigns the one or more files to one or more execution nodes to be ingested into the table. The subject technology operates an ingest puller to poll the file queue. The subject technology ingests, by the one or more execution nodes, the one or more files into one or more micro-partitions of the table via one or more pipes.
-
公开(公告)号:US11494386B2
公开(公告)日:2022-11-08
申请号:US17646905
申请日:2022-01-04
Applicant: Snowflake Inc.
Inventor: Bing Li , Edward Ma , Mingli Rui , Haowei Yu , Andong Zhan
Abstract: A shared database platform can interface with a cluster computing platform over a network through a connector. The data transferred over the network can include metadata result packages that can be distributed to worker nodes of the cluster computing platform, which receive the metadata objects and access the result data for further processing on a staging platform, such as a scalable storage platform.
-
公开(公告)号:US11295009B2
公开(公告)日:2022-04-05
申请号:US17352005
申请日:2021-06-18
Applicant: Snowflake Inc.
Inventor: Elliott Brossard , Derek Denny-Brown , Isaac Kunen , Soumitr Rajiv Pandey , Jacob Salassi , Srinath Shankar , Haowei Yu , Andong Zhan
Abstract: The subject technology receives, in a computing process, a user defined function, the user defined function including code related to at least one operation to be performed. The subject technology determines by a security manager whether performing the at least one operation is permitted, the security manager determines restrictions, based at least in part on a security policy. The subject technology performs the at least one operation. The subject technology sends a result of the at least one operation to the computing process, where sending the result of the at least one operation utilizes a data transport mechanism that supports a network transfer of columnar data.
-
公开(公告)号:US10719517B1
公开(公告)日:2020-07-21
申请号:US16719218
申请日:2019-12-18
Applicant: Snowflake Inc.
Inventor: Bing Li , Edward Ma , Mingli Rui , Haowei Yu , Andong Zhan
Abstract: A shared database platform can interface with a cluster computing platform over a network through a connector. The data transferred over the network can include metadata result packages that can be distributed to worker nodes of the duster computing platform, which receive the metadata objects and access the result data for further processing on a staging platform, such as a scalable storage platform.
-
公开(公告)号:US11055280B2
公开(公告)日:2021-07-06
申请号:US16201854
申请日:2018-11-27
Applicant: Snowflake Inc.
Inventor: Jiansheng Huang , Jiaxing Liang , Scott Ziegler , Haowei Yu , Benoit Dageville , Varun Ganesh
Abstract: Systems, methods, and devices for batch ingestion of data into a table of a database. A method includes determining a notification indicating a presence of a user file received from a client account to be ingested into a database. The method includes identifying data in the user file and identifying a target table of the database to receive the data in the user file. The method includes generating an ingest task indicating the data and the target table. The method includes assigning the ingest task to an execution node of an execution platform, wherein the execution platform comprises a plurality of execution nodes operating independent of a plurality of shared storage devices collectively storing database data. The method includes registering metadata concerning the target table in a metadata store after the data has been fully committed to the target table by the execution node.
-
公开(公告)号:US10896172B2
公开(公告)日:2021-01-19
申请号:US16720418
申请日:2019-12-19
Applicant: Snowflake Inc.
Inventor: Benoit Dageville , Varun Ganesh , Jiansheng Huang , Jiaxing Liang , Haowei Yu , Scott Ziegler
Abstract: Systems, methods, and devices for batch ingestion of data into a table of a database. A method includes determining a notification indicating a presence of a user file received from a client account to be ingested into a database. The method includes identifying data in the user file and identifying a target table of the database to receive the data in the user file. The method includes generating an ingest task indicating the data and the target table. The method includes assigning the ingest task to an execution node of an execution platform, wherein the execution platform comprises a plurality of execution nodes operating independent of a plurality of shared storage devices collectively storing database data. The method includes registering metadata concerning the target table in a metadata store after the data has been fully committed to the target table by the execution node.
-
公开(公告)号:US20240364744A1
公开(公告)日:2024-10-31
申请号:US18428890
申请日:2024-01-31
Applicant: Snowflake Inc.
Inventor: Brandon S. Baker , Derek Denny-Brown , Michael A. Halcrow , Sven Tenzing Choden Konigsmark , Niranjan Kumar Sharma , Nitya Kumar Sharma , Haowei Yu , Andong Zhan
IPC: H04L9/40
CPC classification number: H04L63/20 , H04L63/0245 , H04L63/101
Abstract: Systems and methods are disclosed for securely executing user-defined functions within a cloud data platform. A method involves receiving, via hardware processors, a request to execute a user-defined function (UDF) contained within a sandbox process. The UDF comprises code for performing specified operations that necessitate access to external resources. To facilitate this access, a secure egress path is established using an overlay network designed to isolate the UDF's network traffic from other processes. Authentication and authorization details for the UDF are managed externally to the sandbox process, ensuring that the UDF's functionality remains orthogonal to the cloud data platform's operations. This approach enables the secure and controlled execution of UDFs, allowing them to interact with external systems while maintaining the integrity and security of the cloud data platform environment.
-
公开(公告)号:US20220129467A1
公开(公告)日:2022-04-28
申请号:US17646905
申请日:2022-01-04
Applicant: Snowflake Inc.
Inventor: Bing Li , Edward Ma , Mingli Rui , Haowei Yu , Andong Zhan
IPC: G06F16/2455 , G06F16/25 , G06F21/62 , G06F16/28 , G06F16/27
Abstract: A shared database platform can interface with a cluster computing platform over a network through a connector. The data transferred over the network can include metadata result packages that can be distributed to worker nodes of the cluster computing platform, which receive the metadata objects and access the result data for further processing on a staging platform, such as a scalable storage platform.
-
-
-
-
-
-
-
-
-