Batch data ingestion in database systems

    公开(公告)号:US11294890B2

    公开(公告)日:2022-04-05

    申请号:US16365219

    申请日:2019-03-26

    Applicant: Snowflake Inc.

    Abstract: Systems, methods, and devices for batch ingestion of data into a table of a database. A method includes determining a notification indicating a presence of a user file received from a client account to be ingested into a database. The method includes identifying data in the user file and identifying a target table of the database to receive the data in the user file. The method includes generating an ingest task indicating the data and the target table. The method includes assigning the ingest task to an execution node of an execution platform, wherein the execution platform comprises a plurality of execution nodes operating independent of a plurality of shared storage devices collectively storing database data. The method includes registering metadata concerning the target table in a metadata store after the data has been fully committed to the target table by the execution node.

    Data ingestion using file queues
    2.
    发明授权

    公开(公告)号:US10997163B2

    公开(公告)日:2021-05-04

    申请号:US16943251

    申请日:2020-07-30

    Applicant: Snowflake Inc.

    Abstract: The subject technology at a data system, an ingest request to ingest one or more files into a table. The subject technology, after obtaining the ingest request and prior to the ingesting of the one or more files, persists the one or more files in a first file queue that corresponds to the table, the first file queue further corresponding to a client account, and the data system further comprising a second file queue that corresponds to both a second client account and a second table. The subject technology ingests, by one or more execution nodes, the one or more files into one or more micro-partitions of the table, each of the one or more micro-partitions comprising contiguous units of storage of a storage device.

    Batch data ingestion
    3.
    发明授权

    公开(公告)号:US10977245B2

    公开(公告)日:2021-04-13

    申请号:US16942421

    申请日:2020-07-29

    Applicant: Snowflake Inc.

    Abstract: The subject technology obtains, at a database system, an ingest request to ingest one or more files into a table of a database. The subject technology, after obtaining the ingest request and prior to the ingesting of the one or more files, persists the one or more files in a file queue that corresponds to the table. The subject technology assigns the one or more files to one or more execution nodes to be ingested into the table. The subject technology operates an ingest puller to poll the file queue. The subject technology ingests, by the one or more execution nodes, the one or more files into one or more micro-partitions of the table via one or more pipes.

    Batch data ingestion in database systems

    公开(公告)号:US11055280B2

    公开(公告)日:2021-07-06

    申请号:US16201854

    申请日:2018-11-27

    Applicant: Snowflake Inc.

    Abstract: Systems, methods, and devices for batch ingestion of data into a table of a database. A method includes determining a notification indicating a presence of a user file received from a client account to be ingested into a database. The method includes identifying data in the user file and identifying a target table of the database to receive the data in the user file. The method includes generating an ingest task indicating the data and the target table. The method includes assigning the ingest task to an execution node of an execution platform, wherein the execution platform comprises a plurality of execution nodes operating independent of a plurality of shared storage devices collectively storing database data. The method includes registering metadata concerning the target table in a metadata store after the data has been fully committed to the target table by the execution node.

    Batch data ingestion in database systems

    公开(公告)号:US10896172B2

    公开(公告)日:2021-01-19

    申请号:US16720418

    申请日:2019-12-19

    Applicant: Snowflake Inc.

    Abstract: Systems, methods, and devices for batch ingestion of data into a table of a database. A method includes determining a notification indicating a presence of a user file received from a client account to be ingested into a database. The method includes identifying data in the user file and identifying a target table of the database to receive the data in the user file. The method includes generating an ingest task indicating the data and the target table. The method includes assigning the ingest task to an execution node of an execution platform, wherein the execution platform comprises a plurality of execution nodes operating independent of a plurality of shared storage devices collectively storing database data. The method includes registering metadata concerning the target table in a metadata store after the data has been fully committed to the target table by the execution node.

    SECURE EXECUTION OF USER-DEFINED FUNCTIONS
    9.
    发明公开

    公开(公告)号:US20240364744A1

    公开(公告)日:2024-10-31

    申请号:US18428890

    申请日:2024-01-31

    Applicant: Snowflake Inc.

    CPC classification number: H04L63/20 H04L63/0245 H04L63/101

    Abstract: Systems and methods are disclosed for securely executing user-defined functions within a cloud data platform. A method involves receiving, via hardware processors, a request to execute a user-defined function (UDF) contained within a sandbox process. The UDF comprises code for performing specified operations that necessitate access to external resources. To facilitate this access, a secure egress path is established using an overlay network designed to isolate the UDF's network traffic from other processes. Authentication and authorization details for the UDF are managed externally to the sandbox process, ensuring that the UDF's functionality remains orthogonal to the cloud data platform's operations. This approach enables the secure and controlled execution of UDFs, allowing them to interact with external systems while maintaining the integrity and security of the cloud data platform environment.

Patent Agency Ranking