Invention Grant
- Patent Title: Big data processing method based on direct computation of compressed data
-
Application No.: US17744833Application Date: 2022-05-16
-
Publication No.: US11755539B2Publication Date: 2023-09-12
- Inventor: Feng Zhang , Xiaoyong Du
- Applicant: Renmin University of China
- Applicant Address: CN Beijing
- Assignee: RENMIN UNIVERSITY OF CHINA
- Current Assignee: RENMIN UNIVERSITY OF CHINA
- Current Assignee Address: CN Beijing
- Agency: HAUPTMAN HAM, LLP
- Priority: CN 2110301350.1 2021.03.22
- Main IPC: G06F16/174
- IPC: G06F16/174

Abstract:
A big data processing method based on direct computation of compressed data. The method includes 1) compressing, based on a modified Sequitur compression method, original input data according to a smallest compression granularity given by an user, and transforming them into a directed acyclic graph, DAG, consisting of digits; and 2) determining an optimal traversal pattern, and performing a top-downward traversal or a bottom-upward traversal on the DAG in the step 1) based on the determined optimal traversal pattern so as to enable direct processing of the compressed data. By providing a modified Sequitur algorithm and top-downward and bottom-upward traversal strategies in the disclosure, direct processing of compressed data is enabled, significant improvement in time and space has been gained with broad applicability, and certain representations with respect to more advanced document analytics can still be derived on the basis of these.
Public/Granted literature
- US20220300465A1 BIG DATA PROCESSING METHOD BASED ON DIRECT COMPUTATION OF COMPRESSED DATA Public/Granted day:2022-09-22
Information query