Invention Grant
- Patent Title: Method and device for visual question answering, computer apparatus and medium
-
Application No.: US17161466Application Date: 2021-01-28
-
Publication No.: US11768876B2Publication Date: 2023-09-26
- Inventor: Xiameng Qin , Yulin Li , Qunyi Xie , Ju Huang , Junyu Han
- Applicant: Beijing Baidu Netcom Science Technology Co., Ltd.
- Applicant Address: CN Beijing
- Assignee: Beijing Baidu Netcom Science Technology Co., Ltd.
- Current Assignee: Beijing Baidu Netcom Science Technology Co., Ltd.
- Current Assignee Address: CN Beijing
- Agency: Hamre, Schumann, Mueller & Larson, P.C.
- Priority: CN 2010616632.6 2020.06.30
- Main IPC: G06F16/9032
- IPC: G06F16/9032 ; G06F16/583 ; G06F16/532 ; G06F40/279 ; G06N3/04 ; G06N3/088 ; G06F18/213 ; G06F18/25 ; G06V10/25 ; G06V10/764 ; G06V10/80 ; G06V10/82 ; G06V10/44

Abstract:
The present disclosure provides a method for visual question answering, which relates to a field of computer vision and natural language processing. The method includes: acquiring an input image and an input question; constructing a Visual Graph based on the input image, wherein the Visual Graph comprises a Node Feature and an Edge Feature; updating the Node Feature by using the Node Feature and the Edge Feature to obtain an updated Visual Graph; determining a question feature based on the input question; fusing the updated Visual Graph and the question feature to obtain a fused feature; and generating a predicted answer for the input image and the input question based on the fused feature. The present disclosure further provides an apparatus for visual question answering, a computer device and a non-transitory computer-readable storage medium.
Public/Granted literature
- US20210406468A1 METHOD AND DEVICE FOR VISUAL QUESTION ANSWERING, COMPUTER APPARATUS AND MEDIUM Public/Granted day:2021-12-30
Information query