Invention Publication
- Patent Title: WORD AWARE CONTENT DEFINED CHUNKING
-
Application No.: US18392968Application Date: 2023-12-21
-
Publication No.: US20240126733A1Publication Date: 2024-04-18
- Inventor: Philip N. Shilane
- Applicant: EMC IP Holding Company LLC
- Applicant Address: US MA Hopkinton
- Assignee: EMC IP Holding Company LLC
- Current Assignee: EMC IP Holding Company LLC
- Current Assignee Address: US MA Hopkinton
- Main IPC: G06F16/215
- IPC: G06F16/215 ; G06F16/22 ; G06F16/242

Abstract:
One example method includes, in a data buffer that includes one or more words and whitespaces, calculating a hash value of data in a window that is movable within the data buffer, comparing the hash value to a mask, and when the hash value matches the mask, identifying a position of the window in the data buffer as a chunk anchor position, searching for a whitespace nearest the chunk anchor position, and designating an offset of the whitespace as a segment boundary.
Public/Granted literature
- US12265513B2 Word aware content defined chunking Public/Granted day:2025-04-01
Information query