Invention Grant
- Patent Title: Searchable data structure for electronic documents
-
Application No.: US18054787Application Date: 2022-11-11
-
Publication No.: US12032605B2Publication Date: 2024-07-09
- Inventor: William McNeill
- Applicant: SparkCognition, Inc.
- Applicant Address: US TX Austin
- Assignee: SPARKCOGNITION, INC.
- Current Assignee: SPARKCOGNITION, INC.
- Current Assignee Address: US TX Austin
- Agency: Moore IP Law
- Main IPC: G06F16/31
- IPC: G06F16/31 ; G06F40/117 ; G06F40/137 ; G06F40/284 ; G06F40/30 ; G06V30/412 ; G06V30/414

Abstract:
A method includes obtaining, at a device, a hierarchical structure representing a graphical layout of content items of an electronic document, the content items including at least text. The method also includes generating a word embedding representing a word of the electronic document. The method further includes determining position information of a location of the word in the electronic document. The method also includes determining a descriptor that indicates a relationship of the location to the hierarchical structure. The method further includes providing input data to a machine learning model to generate a semantic region category label of a semantic region of the electronic document. The semantic region includes the word. The input data includes the word embedding, the position information, and the descriptor.
Public/Granted literature
- US20230153335A1 SEARCHABLE DATA STRUCTURE FOR ELECTRONIC DOCUMENTS Public/Granted day:2023-05-18
Information query