Interpreting text classification predictions through deterministic extraction of prominent n-grams
Abstract:
Provided are a computer program product, system, and method for interpreting text classification predictions through deterministic extraction of prominent n-grams. A determination is made of n-gram vectors comprising word embeddings of n-grams in a document and of a document vector comprising word embeddings of the document. A label is received from the text classifier program, comprising a text classification of the document. A determination is made of a label vector comprising word embeddings of the label. The n-gram vectors, the document vector, and the label vector are used to determine n-grams that explain the text classification of the text classifier program.
Information query
Patent Agency Ranking
0/0