Invention Grant
- Patent Title: Moving window data deduplication in distributed storage
-
Application No.: US17007495Application Date: 2020-08-31
-
Publication No.: US11442911B2Publication Date: 2022-09-13
- Inventor: Pavlo Padinker , Pavan Edara , Bigang Li
- Applicant: Google LLC
- Applicant Address: US CA Mountain View
- Assignee: Google LLC
- Current Assignee: Google LLC
- Current Assignee Address: US CA Mountain View
- Agency: Lerner, David, Littenberg, Krumholz & Mentlik, LLP
- Main IPC: G06F16/215
- IPC: G06F16/215 ; G06F16/22 ; G06F16/23 ; G06F12/0804

Abstract:
The present disclosure describes a service which provides primary in-line deduplication. A streaming application program interface (API) may allow for streaming records into a storage system with high throughput and low latency. As part of this process, the API allows user to add identifiers as a field used for data deduplication. The deduplication service keeps a moving window of the identifiers in memory and does in-line deduplication by quickly determining whether data is a duplicate. Keeping only deduplication keys in memory reduces the cost of running the service. Moreover, the real-time nature of the moving window approach allows for storing deduplication information alongside the data and accessing it immediately on read. In this regard, read after write consistency is supported, and costs are reduced.
Public/Granted literature
- US20220067006A1 Moving Window Data Deduplication in Distributed Storage Public/Granted day:2022-03-03
Information query