Invention Grant
- Patent Title: Computerized methods of data compression and analysis
-
Application No.: US16428675Application Date: 2019-05-31
-
Publication No.: US11269810B2Publication Date: 2022-03-08
- Inventor: Takashi Suzuki
- Applicant: Takashi Suzuki
- Applicant Address: US AZ Scottsdale
- Assignee: Takashi Suzuki
- Current Assignee: Takashi Suzuki
- Current Assignee Address: US AZ Scottsdale
- Agency: Snell & Wilmer L.L.P.
- Main IPC: G06F16/174
- IPC: G06F16/174 ; H03M7/30 ; G06F16/951 ; G06F16/22 ; G06F16/00

Abstract:
A computerized method and apparatus compresses symbolic information, such as text. Symbolic information is compressed by recursively identifying pairs of symbols (e.g., pairs of words or characters) and replacing each pair with a respective replacement symbol. The number of times each symbol pair appears in the uncompressed text is counted, and pairs are only replaced if they appear more than a threshold number of times. In recursive passes, each replaced pair can include a previously substituted replacement symbol. The method and apparatus can achieve high compression especially for large datasets. Metadata, such as the number of times each pair appears, generated during compression of the documents can be used to analyze the documents and find similarities between two documents.
Public/Granted literature
- US20190286618A1 Computerized methods of data compression and analysis Public/Granted day:2019-09-19
Information query