Invention Grant
- Patent Title: Memory-efficient streaming count estimation for multisets
-
Application No.: US16828188Application Date: 2020-03-24
-
Publication No.: US11314730B1Publication Date: 2022-04-26
- Inventor: Andrew Borthwick , Stephen Michael Ash
- Applicant: Amazon Technologies, Inc.
- Applicant Address: US WA Seattle
- Assignee: Amazon Technologies, Inc.
- Current Assignee: Amazon Technologies, Inc.
- Current Assignee Address: US WA Seattle
- Agency: Nicholson De Vos Webster & Elliott LLP
- Main IPC: G06F16/23
- IPC: G06F16/23 ; G06F16/22

Abstract:
Techniques for memory-efficient streaming count estimation for multisets are described. A method for memory-efficient streaming count estimation for multisets may include obtaining data from a plurality of data sources, and estimating a count for one or more attributes of the data using a telescoping count-min sketch (CMS) data structure, the telescoping CMS including at least a first table and a second table, wherein count values for the data are stored in a plurality of cells of the first table and when a cell of the first table is saturated, the count values for that cell are stored in a corresponding cell of the second table determined based at least on the cell of the first table.
Information query