Invention Grant
- Patent Title: Correlating videos and sentences
- Patent Title (中): 关联视频和句子
-
Application No.: US14099300Application Date: 2013-12-06
-
Publication No.: US09183466B2Publication Date: 2015-11-10
- Inventor: Jeffrey Mark Siskind , Andrei Barbu , Siddharth Narayanaswamy , Haonan Yu
- Applicant: Purdue Research Foundation
- Applicant Address: US IN West Lafayette
- Assignee: Purdue Research Foundation
- Current Assignee: Purdue Research Foundation
- Current Assignee Address: US IN West Lafayette
- Agency: Lee & Hayes, PLLC
- Agent Christopher J. White
- Main IPC: G06K9/62
- IPC: G06K9/62 ; G06K9/72 ; G06K9/00 ; G06F17/30

Abstract:
A method of testing a video against an aggregate query includes automatically receiving an aggregate query defining participant(s) and condition(s) on the participant(s). Candidate object(s) are detected in the frames of the video. A first lattice is constructed for each participant, the first-lattice nodes corresponding to the candidate object(s). A second lattice is constructed for each condition. An aggregate lattice is constructed using the respective first lattice(s) and the respective second lattice(s). Each aggregate-lattice node includes a scoring factor combining a first-lattice node factor and a second-lattice node factor. respective aggregate score(s) are determined of one or more path(s) through the aggregate lattice, each path including a respective plurality of the nodes in the aggregate lattice, to determine whether the video corresponds to the aggregate query. A method of providing a description of a video is also described and includes generating a candidate description with participant(s) and condition(s) selected from a linguistic model; constructing component lattices for the participant(s) or condition(s), producing an aggregate lattice having nodes combining component-lattice factors, and determining a score for the video with respect to the candidate description by determining an aggregate score for a path through the aggregate lattice. If the aggregate score does not satisfy a termination condition, participant(s) or condition(s) from the linguistic model are added to the condition, and the process is repeated. A method of testing a video against an aggregate query by mathematically optimizing a unified cost function is also described.
Public/Granted literature
- US20140369596A1 CORRELATING VIDEOS AND SENTENCES Public/Granted day:2014-12-18
Information query