Coarse-to-fine multimodal gallery search system with attention-based neural network models
Abstract:
A method, computer program, and computer system is provided for multimodal content retrieval. A search query corresponding to a request for content is received. Content features corresponding to a subset of content items from among a plurality of content items are retrieved based on receiving the search query. Similarity values are calculated between the search query and the retrieved content features. Attention scores are determined for the calculated similarity values. A content item is selected from among the subset of content items of the plurality of content items. The selected content item contains a content feature corresponding to a highest attention score of the attention scores.
Information query
Patent Agency Ranking
0/0