Invention Grant
- Patent Title: Cross-media search method
-
Application No.: US16314673Application Date: 2016-12-01
-
Publication No.: US10719664B2Publication Date: 2020-07-21
- Inventor: Wenmin Wang , Liang Han , Mengdi Fan , Ronggang Wang , Ge Li , Shengfu Dong , Zhenyu Wang , Ying Li , Hui Zhao , Wen Gao
- Applicant: Peking University Shenzhen Graduate School
- Applicant Address: CN Shenzhen
- Assignee: Peking University Shenzhen Graduate School
- Current Assignee: Peking University Shenzhen Graduate School
- Current Assignee Address: CN Shenzhen
- Agency: SV Patent Service
- Priority: com.zzzhc.datahub.patent.etl.us.BibliographicData$PriorityClaim@3b7318b
- International Application: PCT/CN2016/108196 WO 20161201
- International Announcement: WO2018/010365 WO 20180118
- Main IPC: G06F16/00
- IPC: G06F16/00 ; G06T11/60 ; G06F40/30 ; G06F40/216 ; G06F40/284 ; G06N20/00 ; G06K9/00 ; G06K9/62 ; G06N3/08 ; G06N7/00

Abstract:
A cross-media search method using a VGG convolutional neural network (VGG net) to extract image features. The 4096-dimensional feature of a seventh fully-connected layer (fc7) in the VGG net, after processing by a ReLU activation function, serves as image features. A Fisher Vector based on Word2vec is utilized to extract text features. Semantic matching is performed on heterogeneous images and the text features by means of logistic regression. A correlation between the two heterogeneous features, which are images and text, is found by means of semantic matching based on logistic regression, and thus cross-media search is achieved. The feature extraction method can effectively indicate deep semantics of image and text, improve cross-media search accuracy, and thus greatly improve the cross-media search effect.
Public/Granted literature
- US20190205393A1 A cross-media search method Public/Granted day:2019-07-04
Information query