LARGE SCALE UNSTRUCTURED DATABASE SYSTEMS
    52.
    发明申请

    公开(公告)号:US20200301941A1

    公开(公告)日:2020-09-24

    申请号:US16895340

    申请日:2020-06-08

    Applicant: MongoDB, Inc.

    Abstract: Systems, methods, and computer-readable media provide techniques for on-demand access to object data hosted in distributed object storage. A query is received for the distributed object storage database, wherein the distributed object storage database includes first object data stored in a first data center at a first geographical location, and second object data stored in a second data center at a second geographical location. It is determined that the query is for at least a portion of the first object data stored by the first data center. The query is transmitted to a first computing device in the first data center. Results are received from the first computing device for the query, and the results are provided in response to the received query.

    METHOD AND APPARATUS FOR READING AND WRITING COMMITTED DATA

    公开(公告)号:US20200285549A1

    公开(公告)日:2020-09-10

    申请号:US16883653

    申请日:2020-05-26

    Applicant: MongoDB, Inc.

    Abstract: According to some aspects, a database system comprising a processor configured to execute a plurality of system components is provided. The plurality of system components may include an interface component configured to receive a write commit command and provide a write commit confirmation, a snapshot component configured to generate a plurality of snapshots of data stored in a data storage node of a plurality of data storage nodes and identify a committed snapshot representative of data that has been replicated on a majority of the plurality of data storage nodes, and a command processing component configured to modify a data element based on the write commit command, determine whether the majority of the plurality of storage nodes have replicated the modification using the committed snapshot, and generate the write commit confirmation responsive to a determination that the majority of the plurality of data storage nodes have replicated the modification.

    System and method for optimizing data migration in a partitioned database

    公开(公告)号:US10740355B2

    公开(公告)日:2020-08-11

    申请号:US15654590

    申请日:2017-07-19

    Applicant: MongoDB, Inc.

    Abstract: According to one aspect, provided is a horizontally scaled database architecture. Partition a database enables efficient distribution of data across a number of systems reducing processing costs associated with multiple machines. According to some aspects, the partitioned database can be managed as a single source interface to handle client requests. Further, it is realized that by identifying and testing key properties, horizontal scaling architectures can be implemented and operated with minimal overhead. In one embodiment, databases can be partitioned in an order preserving manner such that the overhead associated with moving the data for a given partition can be minimized during management of the data and/or database. In one embodiment, splits and migrations operations prioritize zero cost partitions, thereby, reducing computational burden associated with managing a partitioned database.

    Systems and methods for managing distributed database deployments

    公开(公告)号:US10740353B2

    公开(公告)日:2020-08-11

    申请号:US15627613

    申请日:2017-06-20

    Applicant: MongoDB, Inc.

    Abstract: Various aspects provide for implementation of a cloud service for running, monitoring, and maintaining cloud distributed database deployments and in particular examples, provides cloud based services to run, monitor and maintain deployments of the known MongoDB database. Various embodiments provide services, interfaces, and manage provisioning of dedicated servers for the distributed database instances (e.g., MongoDB instances). Further aspects, including providing a database as a cloud service that eliminates the design challenges associated with many distributed database implementations, while allowing the client's input on configuration choices in building the database. In some implementations, clients can simply identity a number of database nodes, capability of the nodes, and within minutes have a fully functioning, scalable, replicated, and secure distributed database in the cloud.

    Systems and methods for generating partial indexes in distributed databases

    公开(公告)号:US10585867B2

    公开(公告)日:2020-03-10

    申请号:US15605196

    申请日:2017-05-25

    Applicant: MongoDB, Inc.

    Abstract: According to one aspect, methods and systems are provided for creating partial indexes in a distributed database environment. The database includes an index engine configured to receive at least one index field, a criteria field, and a criteria condition, wherein the criteria field is not included in the at least one index field; and generate an index comprising the at least one index field from at least one record of the plurality of records and a pointer to the at least one record of the plurality of records, wherein the criteria field of the at least one record of the plurality of records satisfies the criteria condition. The database further includes a query engine configured to receive a search query containing the at least one index field; and search the index for the at least one index field.

    System and method for determining exact location results using hash encoding of multi-dimensioned data

    公开(公告)号:US10572465B2

    公开(公告)日:2020-02-25

    申请号:US15482419

    申请日:2017-04-07

    Applicant: MongoDB, Inc.

    Abstract: Aspects of the present invention are directed to system and methods for optimizing identification of locations within a search area using hash values. A hash value represents location information in a single dimension format. Computing points around some location includes calculating an identification boundary that surrounds the location of interest based on the location's hash value. The identification boundary is expanded until it exceeds a search area defined by the location and a distance. Points around the location can be identified based on having associated hash values that fall within the identification boundary. Hashing operations let a system reduce the geometric work (i.e. searching inside boundaries) and processing required, by computing straightforward operations on hash quantities (e.g. searching a linear range of geohashes), instead of, for example, point to point comparisons.

    NON-BLOCKING SECONDARY READS
    58.
    发明申请

    公开(公告)号:US20190391881A1

    公开(公告)日:2019-12-26

    申请号:US16442355

    申请日:2019-06-14

    Applicant: MongoDB, Inc.

    Inventor: Eliot Horowitz

    Abstract: Described herein are embodiments of a database system. The database system receives a read command to read at least some stored data. The database system may generate a plurality of snapshots of data stored in a first data storage node of a plurality of data storage nodes. The database system may determine whether data is being written to the first data storage node. Responsive to determining that data is not being written to the first data storage node, the database system may process the read command at least in part by reading at least some data stored on the first data storage node. Responsive to determining that data is being written to the first data storage node, the database system may process the read command at least in part by reading at least some data from a snapshot of the plurality of snapshots.

    Systems and methods for data conversion and comparison

    公开(公告)号:US10430433B2

    公开(公告)日:2019-10-01

    申请号:US15390351

    申请日:2016-12-23

    Applicant: MongoDB, Inc.

    Abstract: According to one embodiment, a translation component is configured to operate on document encoded data to translate the document encoded data into a canonical format comprising a plurality of canonical types that fold together into a byte stream. The translation component is configured to accept any storage format of data (e.g., column store, row store, LSM tree, etc. and/or data from any storage engine, WIREDTIGER, MMAP, AR tree, Radix tree, etc.) and translate that data into a byte stream to enable efficient comparison. When executing searches and using the translated data to provide comparisons there is necessarily a trade-off based on the cost of translating the data and how much the translated data can be leveraged to increase comparison efficiency.

    LARGE DISTRIBUTED DATABASE CLUSTERING SYSTEMS AND METHODS

    公开(公告)号:US20180096045A1

    公开(公告)日:2018-04-05

    申请号:US15721176

    申请日:2017-09-29

    Applicant: MongoDB, Inc.

    CPC classification number: G06F16/278 G06F16/273

    Abstract: Systems and methods are provided for managing asynchronous replication in a distributed database environment, while providing for scaling of the distributed database. A cluster of nodes can be assigned roles for managing partitions of data within the database and processing database requests. In one embodiment, each cluster includes a node with a primary role to process write operations and mange permit asynchronous replication of the operations to at least one secondary node. Each cluster or set of nodes can host one or more partitions of database data, and grouping of the clusters can be implemented a shard cluster that hosts the data of the distributed database. Each shard can be configured to manage the size of any hosted partitions, splitting database partitions, migrated partitions, and managing expansion of shard clusters to encompass new systems.

Patent Agency Ranking