Hybrid model for short text classification with imbalanced data

Invention Grant

US11328221B2 Hybrid model for short text classification with imbalanced data 有权

Please log in to see more content

Patent Title: Hybrid model for short text classification with imbalanced data
Application No.: US16379192

Application Date: 2019-04-09
Publication No.: US11328221B2

Publication Date: 2022-05-10
Inventor: Yang Yu , Ming Tan , Ravi Nair , Haoyu Wang , Saloni Potdar
Applicant: International Business Machines Corporation
Applicant Address: US NY Armonk
Assignee: International Business Machines Corporation
Current Assignee: International Business Machines Corporation
Current Assignee Address: US NY Armonk
Agent Stephanie L. Carusillo
Main IPC: G06F15/16
IPC: G06F15/16 ; G06N20/00 ; G06F16/35

Hybrid model for short text classification with imbalanced data

Abstract:

A method of text classification includes generating a text embedding vector representing a text sample and applying weights of a regression layer to the text embedding vector to generate a first data model output vector. The method also includes generating a plurality of prototype embedding vectors associated with a respective classification labels and comparing the plurality of prototype embedding vectors to the text embedding vector to generate a second data model output vector. The method further includes assigning a particular classification label to the text sample based on the first data model output vector, the second data model output vector, and one or more weighting values.

Public/Granted literature

US20200327445A1 HYBRID MODEL FOR SHORT TEXT CLASSIFICATION WITH IMBALANCED DATA Public/Granted day:2020-10-15

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06F	电数字数据处理（基于特定计算模型的计算机系统入G06N）
G06F15/00	通用数字计算机（零部件入G06F1/00至G06F13/00组）；通用数据处理设备
G06F15/16	.两个或多个数字计算机的组合，每台计算机至少具有一个运算单元、一个程序单元和一个寄存器，例如，用于数个程序的同时处理