Invention Grant
- Patent Title: Topic-guided model for image captioning system
-
Application No.: US16473898Application Date: 2017-03-20
-
Publication No.: US11042782B2Publication Date: 2021-06-22
- Inventor: Zhou Su , Jianguo Li , Anbang Yao , Yurong Chen
- Applicant: INTEL CORPORATION
- Applicant Address: US CA Santa Clara
- Assignee: INTEL CORPORATION
- Current Assignee: INTEL CORPORATION
- Current Assignee Address: US CA Santa Clara
- Agency: Hanley, Flight & Zimmerman
- International Application: PCT/CN2017/077280 WO 20170320
- International Announcement: WO2018/170671 WO 20180927
- Main IPC: G06K9/00
- IPC: G06K9/00 ; G06K9/62 ; G06T11/60 ; G06N3/08 ; G06K9/72

Abstract:
Techniques are provided for training and operation of a topic-guided image captioning system. A methodology implementing the techniques according to an embodiment includes generating image feature vectors, for an image to be captioned, based on application of a convolutional neural network (CNN) to the image. The method further includes generating the caption based on application of a recurrent neural network (RNN) to the image feature vectors. The RNN is configured as a long short-term memory (LSTM) RNN. The method further includes training the LSTM RNN with training images and associated training captions. The training is based on a combination of: feature vectors of the training image; feature vectors of the associated training caption; and a multimodal compact bilinear (MCB) pooling of the training caption feature vectors and an estimated topic of the training image. The estimated topic is generated by an application of the CNN to the training image.
Public/Granted literature
- US20190340469A1 TOPIC-GUIDED MODEL FOR IMAGE CAPTIONING SYSTEM Public/Granted day:2019-11-07
Information query