-
公开(公告)号:US11068501B2
公开(公告)日:2021-07-20
申请号:US15464272
申请日:2017-03-20
Applicant: Amazon Technologies, Inc.
Inventor: Anurag Windlass Gupta , Jakub Kulesza , Don Johnson , Deepak Agarwal , Tushar Jain
Abstract: A distributed database system may perform a single phase commit for transactions involving updates to multiple databases of the distributed database system. A client request may be received that involves updates to multiple database of the distributed database system. The updates may be performed at a front-end database and a back-end database. Log records indicating the updates to the front-end database may be sent to the back-end database. The log records and the updates performed at the back-end database may be committed together as a single phase commit at the back-end database. In the event of a system failure of the front-end database, log records may be requested and received from the back-end database. A restoration of the front-end database may be performed based, at least in part, on the received log records.
-
公开(公告)号:US10885023B1
公开(公告)日:2021-01-05
申请号:US14480335
申请日:2014-09-08
Applicant: Amazon Technologies, Inc.
Abstract: A database system may implement asynchronous processing for synchronous requests received at the database. A pool of request processing threads may be maintained. As access requests for the database are received from clients, an available request processing thread in the pool may process the access request. The access request may be dependent on a persistent storage I/O operation, such that processing of the access request waits at least until the persistent storage I/O operation is complete. The request processing thread may perform processing operations for the access request so that persistent storage I/O operation is performed and become available to process other access requests. A response processing thread may determine that the persistent storage I/O operation for the access request is complete, and send a response to the client.
-
公开(公告)号:US10831759B2
公开(公告)日:2020-11-10
申请号:US16179802
申请日:2018-11-02
Applicant: Amazon Technologies, Inc.
IPC: G06F17/00 , G06F16/2455 , G06F16/27
Abstract: A probabilistic counting structure such as a hyperloglog may be formed during a table scan for each of a selected set of columns. The columns may be selected based on an initial estimate of relatedness, which may be based on data types of the respective columns. An estimated cardinality of an intersection or union of columns may be formed based on an intersection of the probabilistic data structures. A join path may be determined based on the estimated cardinality of an intersection or union of the columns.
-
公开(公告)号:US10831614B2
公开(公告)日:2020-11-10
申请号:US15676933
申请日:2017-08-14
Applicant: Amazon Technologies, Inc.
Inventor: Elizabeth Sanocki , Jonathan James Pezzino , Manish Dalwadi , Anurag Windlass Gupta
Abstract: A database may provide a visualization of restoration operation granularity. Information may be obtained describing possible restoration points within a given period of time for a database. The information may include a respective state of the database associated with a point in time and a respective restoration operation type for the possible restoration points. A graphical representation may be generated illustrating the possible restoration points ordered within the given time period according to the points in time associated with the respective states of the database. The graphical representation may be provided via network-based interface to a client of the database. The graphical representation of the possible restoration points may be updated in response to changes to the given time period or database volume events that modify the possible restoration points.
-
公开(公告)号:US10747746B2
公开(公告)日:2020-08-18
申请号:US13873467
申请日:2013-04-30
Applicant: Amazon Technologies, Inc.
Inventor: Anurag Windlass Gupta , Pradeep Jnana Madhavarapu , Samuel James McKelvie , Yan Valerie Leshinsky
Abstract: A database system may receive a write request that specifies a modification to be made to a particular data record stored by the database system. A log record representing the modification to be made to the particular data record may be sent to a storage service of the database system. An indication (e.g., log record or other indication) that indicates a cached version of the particular data record stored in a read replica's cache is stale may be sent to a read replica. For a subsequent read of the particular data record received by the read replica, the read replica may request the particular data record from the storage service.
-
公开(公告)号:US10545979B2
公开(公告)日:2020-01-28
申请号:US15385789
申请日:2016-12-20
Applicant: Amazon Technologies, Inc.
Inventor: George Steven McPherson , Mehul A. Shah , Prajakta Datta Damle , Gopinath Duddi , Anurag Windlass Gupta
IPC: G06F16/25 , G06F16/23 , G06F16/2455
Abstract: History for data objects may be maintained to detect data events. An indication of an Extract, Transform, Load (ETL) process applied to one or more source data objects to generate one or more transformed data objects may be received. History for the source data objects may be updated to include the transformed data objects and the ETL process that generated the transformed data objects. An evaluation of the update may be performed to determine whether an event associated with the data lineage is triggered. If the event is triggered, a notification of the event may be sent to one or more subscribers for the event.
-
公开(公告)号:US10387399B1
公开(公告)日:2019-08-20
申请号:US14070347
申请日:2013-11-01
Applicant: Amazon Technologies, Inc.
Inventor: Samuel James McKelvie , Anurag Windlass Gupta
Abstract: A database system may implement efficient database journaling using non-volatile system memory. An update to a database may be received, and log records indicating the update may be generated. A non-volatile portion of the system memory may be directly accessed to store the generated log records such that in the event of a failure of the database, the log records may be still maintained. Log records stored to the non-volatile portion of the system memory may be identified as durably persisted, in some embodiments. Log records may be identified and sent to block-based storage, in some embodiments. Recovery from a database system failure, in some embodiments, by obtaining log records from the non-volatile portion of system memory.
-
公开(公告)号:US10180951B2
公开(公告)日:2019-01-15
申请号:US14201512
申请日:2014-03-07
Applicant: Amazon Technologies, Inc.
IPC: G06F17/30
Abstract: A database system may maintain a plurality of log records at a distributed storage system. Each of the plurality of log records may be associated with a respective change to a data page. A snapshot may be generated that is usable to read the data as of a state corresponding to the snapshot. Generating the snapshot may include generating metadata that is indicative of a particular log identifier of a particular one of the log records. Generating the snapshot may be performed without additional reading, copying, or writing of the data.
-
公开(公告)号:US10120905B2
公开(公告)日:2018-11-06
申请号:US14578841
申请日:2014-12-22
Applicant: Amazon Technologies, Inc.
Abstract: A probabilistic counting structure such as a hyperloglog may be formed during a table scan for each of a selected set of columns. The columns may be selected based on an initial estimate of relatedness, which may be based on data types of the respective columns. An estimated cardinality of an intersection or union of columns may be formed based on an intersection of the probabilistic data structures. A join path may be determined based on the estimated cardinality of an intersection or union of the columns.
-
公开(公告)号:US20180165340A1
公开(公告)日:2018-06-14
申请号:US15893496
申请日:2018-02-09
Applicant: Amazon Technologies, Inc.
Inventor: Deepak Agarwal , Anurag Windlass Gupta , Jakub Kulesza
Abstract: A distributed data warehouse system maintains data blocks on behalf of clients, and stores primary and secondary copies of data blocks on different disks or nodes in a cluster. The data warehouse system may back up data blocks in a key-value backup storage system. In response to a query targeting a data block previously stored in the cluster, the data warehouse system may determine whether a consistent, uncorrupted copy of the data block is available in the cluster (e.g., by applying a consistency check). If not (e.g., if a disk or node failed), the data warehouse system may automatically initiate an operation to restore the data block from the backup storage system, using a unique identifier of the data block to access a backup copy. The target data may be returned in a query response prior to restoring primary and secondary copies of the data block in the cluster.
-
-
-
-
-
-
-
-
-