Invention Grant
- Patent Title: Contextual interestingness ranking of documents for due diligence in the banking industry with entity grouping
-
Application No.: US16198708Application Date: 2018-11-21
-
Publication No.: US11593385B2Publication Date: 2023-02-28
- Inventor: Mandar Mutalikdesai , Arjun Das , Ratnanu Ghosh-Roy , Sudarsan Lakshminarayanan , Veerababu Moodu , Raunak Swarnkar , Anagha M , Shrishti Aggarwal , Lavina Durgani
- Applicant: International Business Machines Corporation
- Applicant Address: US NY Armonk
- Assignee: International Business Machines Corporation
- Current Assignee: International Business Machines Corporation
- Current Assignee Address: US NY Armonk
- Agent Mark Bergner
- Main IPC: G06F16/2457
- IPC: G06F16/2457 ; G06F40/30 ; G06F40/279

Abstract:
Documents needing to be analyzed for various reasons, such as financial crimes, are ranked by examining the topicality and sentiment present in each document for a given subject of interest. In one approach a given document is classified to determine its category, and entity recognition is used to identify the subject of interest. Passages from the document that relate to the entity are grouped and analyzed for sentiment to generate a sentiment score. Documents are then ranked based on the sentiment scores. In another approach, a classification probability score is computed for each passage representing a likelihood that the passage relates to a category of interest, and the document is ranked based on the sentiment scores and the classification probability scores. The category classification uses an ensemble of natural language text classifiers. One of the classifiers is a naïve Bayes classifier with feature vectors generated using Word2Vec modeling.
Public/Granted literature
Information query