Invention Grant
- Patent Title: Identifying corrupted text segments
-
Application No.: US16001301Application Date: 2018-06-06
-
Publication No.: US10169398B2Publication Date: 2019-01-01
- Inventor: Chao Yuan Huang , Yi-Lin Tsai , Der-Joung Wang , Yen-Min Wu
- Applicant: International Business Machines Corporation
- Applicant Address: US NY Armonk
- Assignee: International Business Machines Corporation
- Current Assignee: International Business Machines Corporation
- Current Assignee Address: US NY Armonk
- Agent Stephen R. Yoder
- Main IPC: G06F17/30
- IPC: G06F17/30 ; G06F17/24 ; G06F17/27

Abstract:
A computer program product for taking a corrective action upon determination of an existence of a corrupted text segment within a set of web pages. Determination includes: determining a language affinity indicator corresponding to text segments within the set of web pages; generating an indexing repository based on a set of text artefacts within the text segments; creating an occurrence table for the set of text artefacts; and determining compliance of the text artefacts and text segments based on the single language grouping on which the set of text segments are based.
Public/Granted literature
- US20180268021A1 IDENTIFYING CORRUPTED TEXT SEGMENTS Public/Granted day:2018-09-20
Information query