Invention Grant
- Patent Title: Generating tagged content from text of an electronic document
-
Application No.: US18150924Application Date: 2023-01-06
-
Publication No.: US12056434B2Publication Date: 2024-08-06
- Inventor: Vishank Bhatia , Xu Zhong , Thanh Long Duong , Mark Johnson , Srinivasa Phani Kumar Gadde , Vishal Vishnoi , King-Hwa Lee , Christopher Kennewick
- Applicant: Oracle International Corporation
- Applicant Address: US CA Redwood Shores
- Assignee: Oracle International Corporation
- Current Assignee: Oracle International Corporation
- Current Assignee Address: US CA Redwood Shores
- Agency: Invoke
- Main IPC: G06F40/117
- IPC: G06F40/117 ; G06F16/9538 ; G06F16/955 ; G06F40/134 ; G06F40/143 ; G06F40/205 ; G06T7/70

Abstract:
Techniques for generating formatting tags for textual content obtained from a source electronic document are disclosed. A system parses a digital file to obtain information about characters in an electronic document. The system applies tags to text generated based on the textual content of the electronic document by creating segments of textually-consecutive characters and applying corresponding text formatting style tags to the segments. The system further identifies segments of text overlapping bounding boxes in the electronic document. The system generates textual content including a segment of text and a corresponding hyperlink associated with the segment of text. The system further generates textual content by selectively applying line breaks from the source electronic document in the textual content.
Public/Granted literature
- US20240061992A1 GENERATING TAGGED CONTENT FROM TEXT OF AN ELECTRONIC DOCUMENT Public/Granted day:2024-02-22
Information query
IPC分类: