Document store with non-uniform memory access aware high performance query processing

    公开(公告)号:US10678793B2

    公开(公告)日:2020-06-09

    申请号:US15354841

    申请日:2016-11-17

    Applicant: SAP SE

    Abstract: Systems and methods are described for implementation by a computing system having a non-uniform memory access (NUMA) architecture comprising a plurality of NUMA nodes. A query of a document store storing a collection of slices each comprising one or more documents is received from a client. Thereafter, it is determined which of the slices within the document store are required for execution of the query. An execution plan is then generated, using the query, that comprises a plurality of nodes each specifying at least one database operation to execute a portion of the query. The execution plan assigns one of the plurality of NUMA nodes to each slice determined to be required for execution of the query. The database operations specified by the nodes of the execution plan are then executed using the corresponding assigned NUMA nodes for the associated slice. Data responsive to the query is then provided to the client.

    Management of transactions spanning different database types

    公开(公告)号:US10565187B2

    公开(公告)日:2020-02-18

    申请号:US15354900

    申请日:2016-11-17

    Applicant: SAP SE

    Abstract: A database system can receive, from a client, a query specifying at least one database operation implicating data within a first database and within a second database. The first database can have a first transaction manager and the second database can have a second transaction manager. Thereafter, a plurality of transactions are initiated that are responsive to the query to access the data within the first database and the second database. The first transaction manager and the second transaction manager together coordinate execution of the plurality of transactions. All changes specified by the at least one database operation are persisted in persistence of the first database including changes implicating data stored within the second database. Data responsive to the query can then optionally be provided to the client. Related apparatus, systems, techniques and articles are also described.

    Database systems architecture incorporating distributed log

    公开(公告)号:US10474668B2

    公开(公告)日:2019-11-12

    申请号:US15354883

    申请日:2016-11-17

    Applicant: SAP SE

    Abstract: A query is received from a client that specifies at least one database operation implicating data within a first database and within a second database. The first database can have a first transaction manager and the second database can have a second transaction manager. Thereafter, a plurality of transactions responsive to the query are initiated to access the data within the first database and the second database. Execution of the plurality of transactions is then coordinated using the first transaction manager and the second transaction manager. All changes specified by the at least one database operation are logged in a distributed log in the first database including changes implicating data stored within the second database. The distributed log is configured such that it is accessible by both of the first transaction manager and the second transaction manager.

    DOCUMENT STORE EXPORT/IMPORT
    44.
    发明申请

    公开(公告)号:US20190340278A1

    公开(公告)日:2019-11-07

    申请号:US15969124

    申请日:2018-05-02

    Applicant: SAP SE

    Abstract: Provided are systems and methods to acquire a manifest data file and a binary copy of a document store slice log having a first filename and comprising a plurality of entries, each of the plurality of entries associated with a respective database transaction and a commit identifier, and the manifest data file indicating one or more data partitions and one or more slices of each of the one or more data partitions, acquire a partition log for one of the indicated one or more data partitions, the generated partition log comprising a log entry associated with a transaction to create a slice and including a second filename associated with the slice, sort the plurality of entries of the document store slice log based on their associated commit identifiers, change the commit identifiers of each of the plurality of entries to a same commit identifier, and store the sorted and changed entries in a second document store slice log having the second filename.

    SYSTEM AND METHODS FOR PROVIDING A MEMORY SLICE DATA STRUCTURE FOR USE WITH DIVERSE MEMORY AND STORAGE MEDIA

    公开(公告)号:US20190339894A1

    公开(公告)日:2019-11-07

    申请号:US15972467

    申请日:2018-05-07

    Applicant: SAP SE

    Abstract: A data storage and retrieval system for a computer memory including a memory slice formed of segments and adapted to contain one or more documents and a checkpoint adapted to persist the memory slice. The checkpoint includes a document vector containing a document pointer corresponding to a document. The document pointer including a segment identifier identifying a logical segment of the memory slice and an offset value defining a relative memory location of the first document within the identified segment. There are checkpoint memory blocks, each storing a copy of a corresponding segment of the memory slice. The segment identifier of the document pointer identifies a checkpoint memory block and the offset value of the document pointer defines a relative location of the document within the checkpoint memory block.

    Interleaved storage of dictionary blocks in a page chain

    公开(公告)号:US10289709B2

    公开(公告)日:2019-05-14

    申请号:US14949635

    申请日:2015-11-23

    Applicant: SAP SE

    Abstract: A first block in a page chain can be assigned to a first dictionary for a first column of a database. The page chain can include a plurality of dictionary blocks allocated to a plurality of dictionaries of a plurality of columns including the first column such that the plurality of dictionary blocks is interleaved in the page chain. A new unique value to be added to the first column can be appended to an end of the first dictionary. The plurality of dictionaries can be loaded from the page chain into memory of one or more computers performing operations of a database management system. The loading can include scanning the page chain a single time and assigning the plurality of dictionary blocks to the respective dictionaries of the plurality of dictionaries.

    TRANSACTION COMMIT PROTOCOL WITH RECOVERABLE COMMIT IDENTIFIER

    公开(公告)号:US20180232412A1

    公开(公告)日:2018-08-16

    申请号:US15430091

    申请日:2017-02-10

    Applicant: SAP SE

    Abstract: Technologies are described for facilitating database system recovery in a distributed database environment having a master node and at least one slave node. The master node maintains an in-memory reference that includes transaction identifiers and their corresponding commit identifiers. When a slave node requests that a commit identifier be assigned to a transaction, the transaction and commit identifiers for the transaction are added to the in-memory reference. The commit identifier is sent by the master node to the slave node. The slave node writes a log entry to a commit log that includes the transaction identifier and the assigned commit identifier. If the database system is interrupted before the slave node writes the commit log entry, the slave node can request the commit identifier for the transaction from the master node, which retrieves the identifier from the in-memory reference.

    Document Store Utilizing Partial Object Compression

    公开(公告)号:US20180137176A1

    公开(公告)日:2018-05-17

    申请号:US15354807

    申请日:2016-11-17

    Applicant: SAP SE

    Abstract: A query is received by a document store from a client. The query specifies at least one database operation implicating documents within the document store. At least one of the implicated documents has partially compressed objects in which only a portion of the document is compressed. Thereafter, the documents are scanned to identify the compressed objects and to identify decompression keys. The scanned compressed objects are then decompressed using the corresponding decompression keys. Data responsive to the query can then be provided to the client. lated apparatus, systems, techniques and articles are also described.

    FLEXIBLE TEXT SEARCHING FOR DATA OBJECTS OF OBJECT NOTATION

    公开(公告)号:US20170177672A1

    公开(公告)日:2017-06-22

    申请号:US14971883

    申请日:2015-12-16

    Applicant: SAP SE

    CPC classification number: G06F17/30424 G06F17/30418

    Abstract: Flexible text searching for data objects can be implemented. For example, inexact matching of key name terms can be supported. An inverted index can be used to find data objects having key names with terms that are a superset of the key name term being searched. A similar index can be used for values, and the search results can be determined via an intersection. Pattern searching and other features can be supported. Text searching for schema-less collections of data objects can be supported for use in enterprise computing scenarios.

    ENCODED DATA OBJECT NOTATION PERSISTENCE FORMAT

    公开(公告)号:US20170177604A1

    公开(公告)日:2017-06-22

    申请号:US14971852

    申请日:2015-12-16

    Applicant: SAP SE

    CPC classification number: G06F8/4434

    Abstract: An encoded data object notation persistence format can be used in a variety of scenarios to advantage. The encoded format accommodates key name-value pairs and can employ dictionary compression to reduce the amount of storage needed to represent a data object. Searching and querying a collection of encoded data objects can be done more efficiently by taking advantage of the format.

Patent Agency Ranking