Computer implemented systems and methods for processing semi-structured documents
Abstract:
The present disclosure includes techniques pertaining to computer implemented systems and methods for processing semi-structured data. In one embodiment, semi-structured data in a data source is analyzed and structural information about the data is generated. The semi-structured documents may comprise a plurality of fields having associated values. Documents may be analyzed to determine particular fields in the plurality of semi-structured documents. Statistics may be generated to determine, for example, a frequency that each particular field appears in the plurality of semi-structured documents. Particular fields and associated frequencies and other statistics may be stored in a memory, for example.
Information query
Patent Agency Ranking
0/0