Invention Grant
- Patent Title: Systems and methods for extracting patient diagnostics from disparate
-
Application No.: US16855682Application Date: 2020-04-22
-
Publication No.: US11887731B1Publication Date: 2024-01-30
- Inventor: Michael Gallagher , Michael Capstick , Matthew Moran
- Applicant: Select Rehabilitation, Inc.
- Applicant Address: US IL Glenview
- Assignee: SELECT REHABILITATION, INC.
- Current Assignee: SELECT REHABILITATION, INC.
- Current Assignee Address: US IL Glenview
- Agency: Baker, Donelson, Bearman, Caldwell & Berkowitz, P.C.
- Main IPC: G06F40/20
- IPC: G06F40/20 ; G16H50/20 ; G16H10/60 ; G16H10/20 ; G16H10/40 ; G16H70/20 ; G06N3/08 ; G06F40/30 ; G06V30/413 ; G06N3/045 ; G06N3/047 ; G06V30/18

Abstract:
A method is described herein that comprises receiving scanned documents, wherein the scanned documents comprise unstructured data. The method includes performing optical character recognition of the scanned documents to produce text data for each page of the scanned documents, wherein the text data for each page comprises a sequence of words stored together with their location. The method includes dividing each page of the scanned documents into subsections. The method includes using the text data to identify a structure type of each subsection of a page, wherein the structure type includes at least one of a table and text paragraph. The method includes using the text data to label each subsection of a page with a semantic type, wherein the semantic type defines a context surrounding collection of information in a subsection. The method includes using the text data for each subsection of a page to identify medical concepts.
Information query