Interpreting text classification predictions through deterministic extraction of prominent n-grams

Invention Grant

US11462038B2 Interpreting text classification predictions through deterministic extraction of prominent n-grams 有权

Please log in to see more content

Patent Title: Interpreting text classification predictions through deterministic extraction of prominent n-grams
Application No.: US16740308

Application Date: 2020-01-10
Publication No.: US11462038B2

Publication Date: 2022-10-04
Inventor: Alexander Brooks , Gaurav Kumbhat
Applicant: International Business Machines Corporation
Applicant Address: US NY Armonk
Assignee: International Business Machines Corporation
Current Assignee: International Business Machines Corporation
Current Assignee Address: US NY Armonk
Agency: Konrad Raynes Davda & Victor LLP
Agent David W. Victor
Main IPC: G06V30/40
IPC: G06V30/40 ; G06V30/418 ; G06F40/20 ; G06V30/413 ; G06F40/30 ; G06F40/205 ; G06V30/414

Interpreting text classification predictions through deterministic extraction of prominent n-grams

Abstract:

Provided are a computer program product, system, and method for interpreting text classification predictions through deterministic extraction of prominent n-grams. A determination is made of n-gram vectors comprising word embeddings of n-grams in a document and of a document vector comprising word embeddings of the document. A label is received from the text classifier program, comprising a text classification of the document. A determination is made of a label vector comprising word embeddings of the label. The n-gram vectors, the document vector, and the label vector are used to determine n-grams that explain the text classification of the text classifier program.

Public/Granted literature

US20210216762A1 INTERPRETING TEXT CLASSIFICATION PREDICTIONS THROUGH DETERMINISTIC EXTRACTION OF PROMINENT N-GRAMS Public/Granted day:2021-07-15

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06V	图像或视频识别或理解
G06V30/00	字符识别；数字墨迹识别；面向文档的基于图像的模式识别（文档等的扫描、传输或复制 H04N1/00）
G06V30/40	.面向文档的基于图像的模式识别