Invention Grant
- Patent Title: Enhancement of massive data ingestion by similarity linkage of documents
-
Application No.: US15850674Application Date: 2017-12-21
-
Publication No.: US11049024B2Publication Date: 2021-06-29
- Inventor: Paul R. Bastide , Matthew E. Broomhall , Robert E. Loredo , Dale M. Schultz
- Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
- Applicant Address: US NY Armonk
- Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
- Current Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
- Current Assignee Address: US NY Armonk
- Agent David Spalding
- Main IPC: G06F17/00
- IPC: G06F17/00 ; G06N5/02 ; G06F40/106

Abstract:
A method for ingesting a plurality of content according to a statistical similarity of at least one portion of the ingested plurality of content into an information handling system capable of answering questions, whereby the ingested plurality of content is based on a received topic and ingesting the plurality of content comprises ingesting a plurality of documents associated with the received topic is provided. The method may include determining at least one similarity between each document based on a similarity criteria. The method may also include applying a statistical model to characterize the determined at least one similarity between each document. The method may further include creating at least one pair-wise link for each document. The method may additionally include mapping the created at least one pair-wise link. The method may include generating a plurality of rules for ingesting a plurality of additional content.
Public/Granted literature
- US20180121812A1 ENHANCEMENT OF MASSIVE DATA INGESTION BY SIMILARITY LINKAGE OF DOCUMENTS Public/Granted day:2018-05-03
Information query