-
公开(公告)号:US12135745B2
公开(公告)日:2024-11-05
申请号:US18461897
申请日:2023-09-06
Applicant: Snowflake Inc.
Inventor: Polita Paulus , Aravind Ramarathinam , Saurin Shah , Sukruth Komarla Sukumar
IPC: G06F16/00 , G06F16/22 , G06F16/2455 , G06F16/901 , G06F16/908 , G06F16/955
Abstract: Disclosed herein are embodiments of systems and methods for synchronizing file-catalog table with a file stage. In an embodiment, a data platform receives a notification of a modification to one or more files in a file stage. The file stage includes data storage having a storage location. The data platform updates, based on receiving the notification of the modification, a first file-catalog table for the file stage based on the modification. The first file-catalog table includes a row for each file in the file stage, as well as a column for each of one or more metadata properties of the one or more files in the file stage.
-
公开(公告)号:US20230214405A1
公开(公告)日:2023-07-06
申请号:US18119775
申请日:2023-03-09
Applicant: Snowflake Inc.
Inventor: Subramanian Muralidhar , Polita Paulus , Sahaj Saini , Saurin Shah , Srinidhi Karthik Bisthavalli Srinivasa
IPC: G06F16/27 , G06F16/955 , G06F16/25
CPC classification number: G06F16/27 , G06F16/9566 , G06F16/254
Abstract: The embodiments described herein provide means for replicating external stages between deployments of e.g., a cloud data lake using a modified storage integration. The modified storage integration may be defined with a set of storage locations, wherein the storage integration comprises a base URL for each of the set of storage locations and wherein each storage location identifies a remote deployment where a cloud platform is hosted and a geographic region of the remote deployment. An external stage object may be bound to the storage integration, wherein the external stage object facilitates a data loading operation that is currently in progress on the first storage location. In response to detecting an outage at the first storage location, the data loading operation that was in progress on the first storage location may be continued at the second storage location using the storage integration.
-
公开(公告)号:US11620307B2
公开(公告)日:2023-04-04
申请号:US17396576
申请日:2021-08-06
Applicant: Snowflake Inc.
Inventor: Subramanian Muralidhar , Polita Paulus , Sahaj Saini , Saurin Shah , Srinidhi Karthik Bisthavalli Srinivasa
IPC: G06F16/20 , G06F16/27 , G06F16/955 , G06F16/25
Abstract: Described herein are techniques for replicating external stages between deployments of e.g., a cloud data lake using a modified storage integration. The modified storage integration may be defined with multiple storage locations that it can point to, as well as a designation of an active storage location. The storage integration may also be defined with base file paths for each storage location as well as a relative file path which together may serve to synchronize data loading operations between deployments when e.g., a fail-over occurs from one deployment to another. The storage integration may be replicated from a first deployment to a second deployment, and when database replication occurs, an external stage may be replicated to the second deployment and bound to the replicated storage integration. Thus, a fail-over to the second deployment may result in a seamless transition of data loading processes to the second deployment.
-
公开(公告)号:US11522860B2
公开(公告)日:2022-12-06
申请号:US17587829
申请日:2022-01-28
Applicant: Snowflake Inc.
Inventor: Polita Paulus , Peter Povinec , Saurin Shah , Srinidhi Karthik Bisthavalli Srinivasa
Abstract: A command to load or unload data at a storage location is received. In response to the command, a storage integration object associated with the storage location is identified. The storage integration object identifies a cloud identity object that corresponds to a cloud identity that is associated with a proxy identity object corresponding to a proxy identity granted permission to access the storage location. The data is loaded or unloaded at the storage location by assuming the proxy identity.
-
公开(公告)号:US11494438B2
公开(公告)日:2022-11-08
申请号:US17645415
申请日:2021-12-21
Applicant: Snowflake Inc.
Inventor: Polita Paulus , Aravind Ramarathinam , Saurin Shah , Sukruth Komarla Sukumar
IPC: G06F16/00 , G06F16/901 , G06F16/955 , G06F16/2455 , G06F16/22 , G06F16/908
Abstract: Disclosed herein are systems and methods for population of a file-catalog table for a file stage in a user account on a data platform. In an embodiment, a data platform receives, from a client associated with a user account, a request to populate a file-catalog table of the user account based on a plurality of files stored in a file stage of the user account. The data platform responsively executes a list-files table function with respect to the file stage to generate a database-table object having a row for each file stored in the file stage. The data platform populates the file-catalog table of the user account based on the database-table object generated by the list-files table function.
-
公开(公告)号:US11423081B1
公开(公告)日:2022-08-23
申请号:US17390969
申请日:2021-08-01
Applicant: Snowflake Inc.
Inventor: Elliott Brossard , Sukruth Komarla Sukumar , Isaac Kunen , Ju-Yi Kuo , Jonathan Leang , Edward Ma , Schuyler James Manchester , Polita Paulus , Saurin Shah , Igor Zinkovsky
IPC: G06F16/00 , G06F16/901 , G06F16/955 , G06F16/2455 , G06F16/22 , G06F16/908
Abstract: A file access system for user defined functions (UDFs) can be implemented on a distributed database system. The system can store UDF signatures and interfaces (e.g., classes, sub-classes) that can be called by other users. Upon a UDF being called, one or more interface objects (e.g., InputStream) can be created and requests transferred to a execution node via a network channel. The execution node can implement multiple threads that are authorized and download file data from a staging location (e.g., internal stage, external stage) concurrently.
-
公开(公告)号:US20220237231A1
公开(公告)日:2022-07-28
申请号:US17657548
申请日:2022-03-31
Applicant: Snowflake Inc.
Inventor: Elliott Brossard , Sukruth Komarla Sukumar , Isaac Kunen , Ju-yi Kuo , Jonathan Lee Leang , Edward Ma , Schuyler James Manchester , Polita Paulus , Saurin Shah , Igor Zinkovsky
IPC: G06F16/901 , G06F16/955 , G06F16/2455 , G06F16/22 , G06F16/908
Abstract: A file access system for user defined functions (UDFs) can be implemented on a distributed database system. The system can store UDF interfaces and file reference objects that can be called by other users. Upon a UDF being called, files on a stage, one or more interface objects (e.g., InputStream), and file reference objects can be implemented by execution nodes of the distributed database system. The execution nodes can implement multiple threads that are authenticated and can download file data from a staging location concurrently.
-
公开(公告)号:US20210406310A1
公开(公告)日:2021-12-30
申请号:US17244349
申请日:2021-04-29
Applicant: Snowflake Inc.
Inventor: Polita Paulus , Aravind Ramarathinam , Saurin Shah , Sukruth Komarla Sukumar
IPC: G06F16/901 , G06F16/908 , G06F16/2455 , G06F16/955 , G06F16/22
Abstract: Disclosed herein are systems and methods for implementing a file-catalog table for a file stage. In an embodiment, a data platform receives, from a client, a request that the data platform create a file-catalog table for a file stage of a user account associated with the client, where the file stage contains one or more files at a storage location. The user account includes a file-stage object that corresponds to the file stage and that includes an indication of the storage location. In response to receiving the request, the data platform creates the file-catalog table for the file stage. The file-catalog table lists the one or more files in the file stage, and includes at least two different metadata properties of the files listed in the file-catalog table.
-
公开(公告)号:US20250068676A1
公开(公告)日:2025-02-27
申请号:US18933173
申请日:2024-10-31
Applicant: Snowflake Inc.
Inventor: Polita Paulus , Aravind Ramarathinam , Saurin Shah , Sukruth Komarla Sukumar
IPC: G06F16/901 , G06F16/22 , G06F16/2455 , G06F16/908 , G06F16/955
Abstract: Disclosed herein are embodiments of systems and methods for synchronizing file-catalog table with a file stage. In an embodiment, a data platform receives a notification of a modification to one or more files in a file stage. The file stage includes data storage having a storage location. The data platform updates, based on receiving the notification of the modification, a first file-catalog table for the file stage based on the modification. The first file-catalog table includes a row for each file in the file stage, as well as a column for each of one or more metadata properties of the one or more files in the file stage.
-
10.
公开(公告)号:US20240427755A1
公开(公告)日:2024-12-26
申请号:US18517744
申请日:2023-11-22
Applicant: Snowflake Inc.
Inventor: Selcuk Aya , Thierry Cruanes , Istvan Cseri , Benoit Dageville , Marcia Feitel , Steven P. Herbert , Dennis Huo , Xinglian Liu , Nithin Mahesh , James Malone , Subramanian Muralidhar , Muthunagappan Muthuraman , Ronald Lee Ortloff , Polita Paulus , Marianne Shaw , Nileema Shingte , Wai Sing Wong , Jiaqi Yan
IPC: G06F16/22 , G06F16/215 , G06F16/2457
Abstract: The subject technology provides embodiments for supporting a unified table which may be a managed table or an unmanaged table. Managed tables are those where the subject technology manages the metastore/catalog for the table, whereas unmanaged tables are tables where an external catalog controls the table and the subject technology integrates with that catalog to work with the table, but does not assume control of the table.
-
-
-
-
-
-
-
-
-