-
公开(公告)号:US12038906B2
公开(公告)日:2024-07-16
申请号:US18054870
申请日:2022-11-11
Applicant: Amazon Technologies, Inc.
Inventor: Anurag Windlass Gupta , Neal Fachan , Samuel James McKelvie , Laurion Darrell Burchall , Christopher Richard Newcombe , Pradeep Jnana Madhavarapu , Benjamin Tobler , James McClellan Corey
CPC classification number: G06F16/2365 , G06F11/1451 , G06F11/1471 , G06F16/23 , G06F11/2094 , G06F2201/80
Abstract: A database system may include a database service and a separate distributed storage service. The database service (or a database engine head node thereof) may be responsible for query parsing, optimization, and execution, transactionality, and consistency, while the storage service may be responsible for generating data pages from redo log records and for durability of those data pages. For example, in response to a write request directed to a particular data page, the database engine head node may generate a redo log record and send it, but not the data page, to a storage service node. The storage service node may store the redo log record and return a write acknowledgement to the database service prior to applying the redo log record. The server node may apply the redo log record and other redo log records to a previously stored version of the data page to create a current version.
-
公开(公告)号:US12013856B2
公开(公告)日:2024-06-18
申请号:US16102545
申请日:2018-08-13
Applicant: Amazon Technologies, Inc.
Inventor: Bhaven Avalani , Andrew Edward Caldwell , Naresh Chainani , Martin Grund , Anurag Windlass Gupta , Frederick Ryan Johnson , Ippokratis Pandis , Michail Petropoulos , Srividhya Srinivasan
IPC: G06F16/24 , G06F16/2453 , G06F16/2455 , G06F16/28
CPC classification number: G06F16/2455 , G06F16/2453 , G06F16/283 , G06F16/285
Abstract: Burst performance of a database query may be determined according to a size of the database query. A query to a database may be received. A size may be determined for the query. If the size is less than a size threshold assigned to a first query engine, then the query may be performed at the first query engine. If the size is greater than or equal to the size threshold assigned to the first query engine, then the query may be performed at a second query engine.
-
3.
公开(公告)号:US11941016B2
公开(公告)日:2024-03-26
申请号:US17687492
申请日:2022-03-04
Applicant: Amazon Technologies, Inc.
Inventor: Timothy Jones , Andrew Borthwick , Sergei Dobroshinsky , Shehzad Qureshi , Stephen Michael Ash , Pedrito Uriah Maynard-Zhang , Chethan Kommaranahalli Rudramuni , Abhishek Sharma , Juliana Saussy , Adam Lawrence Joseph Heinermann , Alaykumar Navinchandra Desai , Mehul A. Shah , Mehul Y. Shah , Anurag Windlass Gupta , Prajakta Datta Damle
CPC classification number: G06F16/254 , G06F9/543 , G06N20/00
Abstract: Specified performance attributes may be used to configure machine learning transformations for ETL jobs. Performance attributes for a machine learning pipeline that applies a model to as part of a transformation for an ETL job may be used to configure a parameter in a stage of the machine learning pipeline. The configured stage may then be used when training the model. The trained machine learning pipeline may then be applied as part of a transformation operation included in an ETL job performed by the ETL system.
-
公开(公告)号:US11494339B2
公开(公告)日:2022-11-08
申请号:US16029430
申请日:2018-07-06
Applicant: Amazon Technologies, Inc.
Inventor: Stefano Stefani , Anurag Windlass Gupta
IPC: G06F16/174 , H03M7/30 , G06F16/22
Abstract: Data to be stored in a data block for a columnar database table may be compressed according to a multi-level compression scheme. Data to be stored in the data block may be received. The data may be compressed according a column-specific compression technique to produce compressed data. The compressed data may then be compressed according to a second compression technique different than the column-specific compression technique to produce multi-level compressed data. The multi-level compressed data may be stored in the data block. When reading from the data block, multi-level compressed data may be decompressed according to the column-specific compression technique and the default compression technique applied to the data.
-
公开(公告)号:US20220279049A1
公开(公告)日:2022-09-01
申请号:US17694585
申请日:2022-03-14
Applicant: Amazon Technologies, Inc.
Inventor: George Steven McPherson , Mehul A. Shah , Supratik Chakraborty , Prajakta Datta Damle , Gopinath Duddi , Anurag Windlass Gupta
IPC: H04L67/00 , H04L67/10 , G06F9/4401 , G06F9/50 , H04L67/01
Abstract: Code may be dynamically routed to computing resources for execution. Code may be received for execution on behalf of a client. Execution criteria for the code may be determined and computing resources that satisfy the execution criteria may be identified. The identified computing resources may then be procured for executing the code and then the code may be routed to the procured computing resources for execution. Permissions or authorization to execute the code may be shared to ensure that computing resources executing the code have the same permissions or authorization when executing the code.
-
公开(公告)号:US11429630B2
公开(公告)日:2022-08-30
申请号:US16870761
申请日:2020-05-08
Applicant: Amazon Technologies, Inc.
Inventor: Anurag Windlass Gupta , Andrew Edward Caldwell
Abstract: Tiered storage may be implemented for processing data. Data processors may maintain some of a data set, including user data and metadata describing the user data, locally. The data set is also maintained a data store remote to the data processor. When processing requests are received, a determination is made as to whether the local portions of the data set can execute the processing request or one or more additional portions of the data set are needed from the remote data store. If additional portions of the data set are needed, then a request may be sent to the data store for the additional portions. Once received, the data processor may execute the processing request utilizing the additional portions. Portions of the data set maintained locally at the data processor may be selected and flushed from local storage to the remote data store.
-
公开(公告)号:US11269846B2
公开(公告)日:2022-03-08
申请号:US16543132
申请日:2019-08-16
Applicant: Amazon Technologies, Inc.
Inventor: Samuel James McKelvie , Anurag Windlass Gupta
IPC: G06F16/23
Abstract: A database system may implement efficient database journaling using non-volatile system memory. An update to a database may be received, and log records indicating the update may be generated. A non-volatile portion of the system memory may be directly accessed to store the generated log records such that in the event of a failure of the database, the log records may be still maintained. Log records stored to the non-volatile portion of the system memory may be identified as durably persisted, in some embodiments. Log records may be identified and sent to block-based storage, in some embodiments. Recovery from a database system failure, in some embodiments, by obtaining log records from the non-volatile portion of system memory.
-
公开(公告)号:US10909091B1
公开(公告)日:2021-02-02
申请号:US15360787
申请日:2016-11-23
Applicant: Amazon Technologies, Inc.
Inventor: Aakash Shah , Kamal Kant Gupta , Alexandre Olegovich Verbitski , Sailesh Krishnamurthy , Hyungsoo Jung , Anurag Windlass Gupta , Zhaohui Zhang , Tengiz Kharatishvili , Stefano Stefani
Abstract: A data store may implement on-demand data schema modifications. Request to change a schema for a data set in a data store may be received and a description of the change recorded as part of a schema history for the data set. The request to change the schema may then be acknowledged. When access requests directed to the data set are received at the data store, the schema history for the data set may be evaluated. If the schema history indicates that data that is to be accessed in order to service the data store needs to include one or more changes in the schema history, then a version of the schema that includes the one or more changes may be applied to the data.
-
公开(公告)号:US10860562B1
公开(公告)日:2020-12-08
申请号:US14484991
申请日:2014-09-12
Applicant: Amazon Technologies, Inc.
Inventor: Anurag Windlass Gupta , Mengchu Cai , Andrew Scott Richardson
IPC: G06F16/22 , G06F16/2453
Abstract: A data store may implement dynamic query predicates for processing queries. Query predicate indexes may be maintained for individual storage locations in a data store. The query predicate indexes may indicate data values not stored in the storage location that satisfy a query predicate in the query predicate index. The query predicate index may be populated when queries are received at the data store which include new query predicates. When queries are received that include query predicates that correspond to query predicates in the query predicate index, the index values for each of the storage locations may be evaluated. Based, at least in part, on the evaluation, particular storage locations may be identified to read for servicing the queries.
-
公开(公告)号:US10776368B1
公开(公告)日:2020-09-15
申请号:US15650704
申请日:2017-07-14
Applicant: Amazon Technologies, Inc.
Inventor: George Constantin Caragea , Andrew Edward Caldwell , Anurag Windlass Gupta , Michail Petropoulos
IPC: G06F17/00 , G06F16/2458 , G06F16/2453
Abstract: Cardinality values can be derived from an approximate quantile summary. An approximate quantile summary can be generated for a column of a database table at data ingestion, data update, upon request, in response to a query, and in various other scenarios. When a query is received that includes a predicate directed to the column of the approximate quantile summary, a cardinality value may be derived from the boundary values of one or more quantiles that include the predicate. The cardinality value may then be used to select a query plan. The query may be performed according to the selected query plan.
-
-
-
-
-
-
-
-
-