Identifying information in plain text narratives EMRs
Abstract:
A clinical information extraction and training mechanism is provided for automatically extracting and identifying information in plain text narratives in a set of electronic medical records. The mechanism segments each clinical note in a plurality of clinical notes into one or more identified sections, labels each identified section with an associated tag, and generate a tag data structure utilizing explicitly tagged sequences of sentences and associated tags. The mechanism performs statistical analysis of the identified sections that contain tags identified in the tag data structure to identify one or more valid stop/start conditions; extracts a first set of positive examples of sentences for a selected type of information, and then trains a cognitive system to identify sentences in the plurality of clinical notes that fail to have a tag associated with the selected type using the positive examples of sentences for different types of information.
Public/Granted literature
Information query
Patent Agency Ranking
0/0