Invention Grant
US08463053B1 Enhanced max margin learning on multimodal data mining in a multimedia database
有权
增强多媒体数据库中多模态数据挖掘的最大利润率学习
- Patent Title: Enhanced max margin learning on multimodal data mining in a multimedia database
- Patent Title (中): 增强多媒体数据库中多模态数据挖掘的最大利润率学习
-
Application No.: US12538845Application Date: 2009-08-10
-
Publication No.: US08463053B1Publication Date: 2013-06-11
- Inventor: Zhen Guo , Zhongfei (Mark) Zhang
- Applicant: Zhen Guo , Zhongfei (Mark) Zhang
- Applicant Address: US NY Binghamton
- Assignee: The Research Foundation of State University of New York
- Current Assignee: The Research Foundation of State University of New York
- Current Assignee Address: US NY Binghamton
- Agency: Ostrolenk Faber LLP
- Agent Steven M. Hoffberg
- Main IPC: G06K9/62
- IPC: G06K9/62

Abstract:
Multimodal data mining in a multimedia database is addressed as a structured prediction problem, wherein mapping from input to the structured and interdependent output variables is learned. A system and method for multimodal data mining is provided, comprising defining a multimodal data set comprising image information; representing image information of a data object as a set of feature vectors in a feature space; clustering in the feature space to group similar features; associating a non-image representation with a respective image data object based on the clustering; determining a joint feature representation of a respective data object as a mathematical weighted combination of a set of components of the joint feature representation; optimizing a weighting for a plurality of components of the mathematical weighted combination with respect to a prediction error between a predicted classification and a training classification; and employing the mathematical weighted combination for automatically classifying a new data object.
Information query