Identification of domain information for use in machine learning models
Abstract:
A device may analyze a set of unstructured documents of an organization associated with a domain to identify a first set of entities. The device may analyze a set of semi-structured documents of the organization to determine a second set of entities. The device may filter the first set of entities using the second set of entities. Filtering the first set of entities may include removing, from the first set of entities, one or more entities that do not satisfy a threshold level of similarity with entities included in the second set of entities. The device may consolidate the filtered first set of entities and the second set of entities to identify a set of key entities. The device may provide the set of key entities to a user device to allow the set of key entities to be annotated and used for one or more machine learning models.
Information query
Patent Agency Ranking
0/0