Invention Grant
- Patent Title: Auto-generating ground truth on clinical text by leveraging structured electronic health record data
-
Application No.: US16814896Application Date: 2020-03-10
-
Publication No.: US11782942B2Publication Date: 2023-10-10
- Inventor: Jennifer J Liang , Diwakar Mahajan
- Applicant: International Business Machines Corporation
- Applicant Address: US NY Armonk
- Assignee: International Business Machines Corporation
- Current Assignee: International Business Machines Corporation
- Current Assignee Address: US NY Armonk
- Agency: Otterstedt & Kammer PLLC
- Agent Kristofer Haggerty
- Main IPC: G06F16/25
- IPC: G06F16/25 ; G16H10/60 ; G06F16/332 ; G06F16/242 ; G06N20/00 ; G06F40/30 ; G06F40/295 ; G06F40/40 ; G06F16/383

Abstract:
A method improves performance of natural language processing by automatically generating ground truth from electronic health records comprising unstructured clinical notes and structured data comprising entries each having respective values for fields. The method includes: linking a given one of the notes to a given one of the entries responsive to determining that a specified field within the given entry matches an item of metadata for the given note; determining an initial set of the notes which satisfy criteria selected such that the criteria are a proxy for the ground truth, wherein the given note is determined to satisfy the criteria based at least in part on the given entry linked thereto; and designating at least a portion of the initial set of notes which satisfy the criteria, and the entries linked to the portion of the initial set of notes which satisfy the criteria, as the ground truth.
Public/Granted literature
- US20210286821A1 AUTO-GENERATING GROUND TRUTH ON CLINICAL TEXT BY LEVERAGING STRUCTURED ELECTRONIC HEALTH RECORD DATA Public/Granted day:2021-09-16
Information query