Video action localization from proposal-attention
Abstract:
A method for processing a sequence of frames includes receiving a sequence of frames and multiple action proposals for the sequence of frames. The method also includes generating a representation of the sequence of frames and pooling the representation around each of the action proposals. The method further includes classifying the action proposals based on the pooled representations and controlling a device based on the classifying.
Public/Granted literature
Information query
Patent Agency Ranking
0/0