Invention Grant
- Patent Title: Methods, apparatus and systems for annotation of text documents
-
Application No.: US16815873Application Date: 2020-03-11
-
Publication No.: US11263391B2Publication Date: 2022-03-01
- Inventor: Christopher Potts , Evan Lin , Andrew Maas , Abhilash Itharaju , Kevin Reschke , Jordan Vincent
- Applicant: PAREXEL International, LLC
- Applicant Address: US MA Waltham
- Assignee: PAREXEL International, LLC
- Current Assignee: PAREXEL International, LLC
- Current Assignee Address: US MA Waltham
- Agency: Wolf, Greenfield & Sacks, P.C.
- Main IPC: G06F40/169
- IPC: G06F40/169 ; G06F3/04842 ; G16H10/60

Abstract:
Methods and apparatus to facilitate annotation projects to extract structured information from free-form text using NLP techniques. Annotators explore text documents via automated preannotation functions, flexibly formulate annotation schemes and guidelines, annotate text, and adjust annotation labels, schemes and guidelines in real-time as a project evolves. NLP models are readily trained on iterative annotations of sample documents by domain experts in an active learning workflow. Trained models are then employed to automatically annotate a larger body of documents in a project dataset. Experts in a variety of domains can readily develop an annotation project for a specific use-case or business question. In one example, documents relating to the health care domain are effectively annotated and employed to train sophisticated NLP models that provide valuable insights regarding many facets of health care. In another example, annotation methods are enhanced by utilizing domain-specific information derived from a novel knowledge graph architecture.
Public/Granted literature
- US20200293712A1 METHODS, APPARATUS AND SYSTEMS FOR ANNOTATION OF TEXT DOCUMENTS Public/Granted day:2020-09-17
Information query