Automatic NER dictionary generation from structured business data
Abstract:
The present disclosure involves systems, software, and computer implemented methods for generating NER dictionaries from structured business data. In one example, a method may include identifying a set of structured data for use in generating a NER dictionary, identifying a particular collection within the identified structured data to use in the dictionary generation, the particular identified collection including a plurality of values, identifying at least one variant of the value for each value in the particular identified collection, each of the at least one variants of the value including at least a portion of the underlying value, determining a set of unique variants from the identified at least one variants across each of the values in the particular identified subset, adding the determined set of unique variants to the generated dictionary, each unique variant added to the generated dictionary being associated with the value from which the unique variant was derived.
Public/Granted literature
Information query
Patent Agency Ranking
0/0