Query response device and method
Abstract:
A query response method includes: dividing image frames, audio data and caption data included in video data of a data set on a per-shot basis based on the same single caption; extracting a shot feature vector by calculating the feature vectors of image frames, audio data and caption data included in each shot; extracting feature vectors of query data and a plurality of pieces of option data corresponding to the query data from each query-response pair included in the data set; calculating a video feature vector by inputting the shot feature vectors into a multilayer neural network, assigning an attention weight, calculated based on the feature vector of the query data, to output vectors of respective layers, and then summing the weighted output vectors; and selecting a final response from among the plurality of pieces of option data based on similarities between the video feature vector and option feature vectors.
Public/Granted literature
Information query
Patent Agency Ranking
0/0