Invention Grant
- Patent Title: Image captioning with weakly-supervised attention penalty
-
Application No.: US17501199Application Date: 2021-10-14
-
Publication No.: US11775838B2Publication Date: 2023-10-03
- Inventor: Jiayun Li , Mohammad K. Ebrahimpour , Azadeh Moghtaderi , Yen-Yun Yu
- Applicant: Ancestry.com Operations Inc.
- Applicant Address: US UT Lehi
- Assignee: Ancestry.com Operations Inc.
- Current Assignee: Ancestry.com Operations Inc.
- Current Assignee Address: US UT Lehi
- Agency: Keller Preece PLLC
- Main IPC: G06N3/08
- IPC: G06N3/08 ; G06N3/084 ; G06F18/214 ; G06F18/21 ; G06N3/044 ; G06N3/045 ; G06V10/764 ; G06V10/778 ; G06V10/82 ; G06V10/44 ; G06V20/70 ; G06V20/20

Abstract:
Techniques for training a machine-learning (ML) model for captioning images are disclosed. A plurality of feature vectors and a plurality of visual attention maps are generated by a visual model of the ML model based on an input image. Each of the plurality of feature vectors correspond to different regions of the input image. A plurality of caption attention maps are generated by an attention model of the ML model based on the plurality of feature vectors. An attention penalty is calculated based on a comparison between the caption attention maps and the visual attention maps. A loss function is calculated based on the attention penalty. One or both of the visual model and the attention model are trained using the loss function.
Public/Granted literature
- US20220067438A1 IMAGE CAPTIONING WITH WEAKLY-SUPERVISED ATTENTION PENALTY Public/Granted day:2022-03-03
Information query