- Patent Title: Automatic generation of structured data from semi-structured data
-
Application No.: US15583966Application Date: 2017-05-01
-
Publication No.: US10467244B2Publication Date: 2019-11-05
- Inventor: Ravikiran Krishnan , Ayush Parashar , Sudeep Sarkar
- Applicant: Unifi Software
- Applicant Address: US CA San Mateo
- Assignee: UNIFI SOFTWARE, INC.
- Current Assignee: UNIFI SOFTWARE, INC.
- Current Assignee Address: US CA San Mateo
- Agency: Evergreen Valley Law Group
- Agent Kanika Radhakrishnan
- Main IPC: G06F7/02
- IPC: G06F7/02 ; G06F16/00 ; G06F16/25 ; G06F16/21 ; G06F16/22 ; G06F16/84

Abstract:
A method and system for generating structured data from semi-structured data are provided. The method includes reading a plurality of records from a data file including semi-structured data. Further, the method includes obtaining aligned delimiters in a list for every record that has been read. The method also includes selecting a most occurring delimiter from the list. The method then includes constructing a regular expression using the selected delimiter to split the records into different fields. The method also includes reconstructing the records for the regular expression to fit and split into fields. In addition, the method includes displaying the records split into the fields.
Public/Granted literature
- US20170316070A1 AUTOMATIC GENERATION OF STRUCTURED DATA FROM SEMI-STRUCTURED DATA Public/Granted day:2017-11-02
Information query