SYSTEM AND METHOD FOR DETERMINING AN AMOUNT OF VIRTUAL MACHINES FOR USE WITH EXTRACT, TRANSFORM, LOAD (ETL) PROCESSES

    公开(公告)号:US20200334089A1

    公开(公告)日:2020-10-22

    申请号:US16852509

    申请日:2020-04-19

    Abstract: In accordance with an embodiment, described herein are systems and methods for determining or allocating an amount, quantity, or number of compute instances or virtual machines for use with extract, transform, load (ETL) processes. In an example embodiment, a particular (e.g., optimal) number of virtual machines (VM's) can be determined by predicting ETL completion times for customers, using historical data. ETL processes can be simulated with an initial/particular number of virtual machines. If the predicted duration is greater than the desired duration, the number of virtual machines can be incremented, and the simulation repeated. Actual completion times from ETL processes can be fed back, to update a determined number of compute instances or virtual machines. In accordance with an embodiment, the system can be used, for example, to generate alerts associated with customer service level agreements (SLA's).

    SYSTEM AND METHOD FOR AUTOMATIC GENERATION OF EXTRACT, TRANSFORM, LOAD (ETL) ASSERTS

    公开(公告)号:US20200334267A1

    公开(公告)日:2020-10-22

    申请号:US16851869

    申请日:2020-04-17

    Abstract: In accordance with an embodiment, described herein are systems and methods for use with an analytic applications environment, for automatic generation of asserts in such environments. A data pipeline or process, such as, for example an extract, transform, load (ETL) process, can operate in accordance with an analytic applications schema adapted to address particular analytics use cases or best practices, to receive data from a customer's (tenant's) enterprise software application or data environment, for loading into a data warehouse instance. Each customer (tenant) can additionally be associated with a customer tenancy and a customer schema. During the process of populating a data warehouse instance, the system can automatically generate dynamic data-driven ETL asserts, including determining a list of columns for tables in the data warehouse; determining a data type for each column; generating an assert for each determined data type; validating the generated assert; and maintaining the generated assert.

    SYSTEM AND METHOD FOR RANKING OF DATABASE TABLES FOR USE WITH EXTRACT, TRANSFORM, LOAD PROCESSES

    公开(公告)号:US20210049183A1

    公开(公告)日:2021-02-18

    申请号:US17076164

    申请日:2020-10-21

    Abstract: In accordance with various embodiments, described herein are systems and methods for use with an analytic applications environment, for ranking of database tables for use in controlling extract, transform, load (ETL) processes. In accordance with an embodiment, the system uses a ranking algorithm or process to rank database tables and/or table columns associated with a set of data. The table/column rankings can then be used to prioritize ETL processing of a customer's data for use with a data warehouse or other data analytics environment. In accordance with an embodiment, the method includes determining a global rank; a business rank; and a tenant or customer-specific rank, for a plurality of tables and columns in a customer's database; and aggregating or otherwise using the determined rankings to control the ETL process for a particular customer (tenant), to load their data into the data warehouse.

    System and method for ranking of database tables for use with extract, transform, load processes

    公开(公告)号:US12248490B2

    公开(公告)日:2025-03-11

    申请号:US17076164

    申请日:2020-10-21

    Abstract: In accordance with various embodiments, described herein are systems and methods for use with an analytic applications environment, for ranking of database tables for use in controlling extract, transform, load (ETL) processes. In accordance with an embodiment, the system uses a ranking algorithm or process to rank database tables and/or table columns associated with a set of data. The table/column rankings can then be used to prioritize ETL processing of a customer's data for use with a data warehouse or other data analytics environment. In accordance with an embodiment, the method includes determining a global rank; a business rank; and a tenant or customer-specific rank, for a plurality of tables and columns in a customer's database; and aggregating or otherwise using the determined rankings to control the ETL process for a particular customer (tenant), to load their data into the data warehouse.

    SYSTEM AND METHOD FOR DETERMINATION OF RECOMMENDATIONS AND ALERTS IN AN ANALYTICS ENVIRONMENT

    公开(公告)号:US20200334608A1

    公开(公告)日:2020-10-22

    申请号:US16851872

    申请日:2020-04-17

    Abstract: In accordance with an embodiment, described herein are systems and methods for use with an analytic applications environment, for determination of recommendations and alerts in such environments. A data pipeline or process can operate in accordance with an analytic applications schema adapted to address particular analytics use cases or best practices, to receive data from a customer's (tenant's) enterprise software application or data environment, for loading into a data warehouse instance. When provided as part of a software-as-a-service (SaaS) or cloud environment, the data sourced from a plurality of organizations can be aggregated, to leverage information gleaned from the collective or shared data. The system can be used to generate semantic alerts, including obtaining permission from; and analyzing the collective data of; the plurality of organizations, to determine operational advantages indicated by the data, and providing alerts associated with those operational advantages.

Patent Agency Ranking