Invention Grant
- Patent Title: Authorship enhanced corpus ingestion for natural language processing
-
Application No.: US15289265Application Date: 2016-10-10
-
Publication No.: US10795922B2Publication Date: 2020-10-06
- Inventor: Paul R. Bastide , Matthew E. Broomhall , Robert E. Loredo , Fang Lu
- Applicant: International Business Machines Corporation
- Applicant Address: US NY Armonk
- Assignee: International Business Machines Corporation
- Current Assignee: International Business Machines Corporation
- Current Assignee Address: US NY Armonk
- Agent Stephen J. Walder, Jr.; David B. Woycechowsky
- Main IPC: G06F17/30
- IPC: G06F17/30 ; G06F16/33 ; G06F16/24 ; G06F40/205 ; G06F3/0482

Abstract:
Mechanisms for processing a corpus of information in a natural language processing system are provided. A corpus of information to process is identified and a set of author profiles associated with the corpus of information is retrieved. A content profile is generated for a portion of content of the corpus of information and the content profile is compared to the set of author profiles to generate an association of the content profile with at least one author profile in the set of author profiles. In addition, a processing operation of the natural language processing (NLP) system is controlled based on the association of the content profile with the at least one author profile.
Public/Granted literature
- US20170024463A1 Authorship Enhanced Corpus Ingestion for Natural Language Processing Public/Granted day:2017-01-26
Information query