HUMAN ACTION RECOGNITION IN DRONE VIDEOS
    1.
    发明申请

    公开(公告)号:WO2020040929A1

    公开(公告)日:2020-02-27

    申请号:PCT/US2019/043544

    申请日:2019-07-26

    Abstract: A method is provided for drone- video-based action recognition. The method learns (220) a transformation for each of target video clips taken from a set of target videos, responsive to original features extracted from the target video clips. The transformation corrects differences between a target drone domain corresponding to the target video clips and a source non-drone domain corresponding to source video clips taken from a set of source videos. The method adapts (225) the target to the source domain by applying the transformation to the original features to obtain transformed features for the target video clips. The method converts (230) the original and transformed features of same ones of the target video clips into a single classification feature for each of the target videos. The method classifies (240) a human action in a new target video relative to the set of source videos using the single classification feature for each of the target videos.

    SELF-SUPERVISED CROSS-VIDEO TEMPORAL DIFFERENCE LEARNING FOR UNSUPERVISED DOMAIN ADAPTATION

    公开(公告)号:WO2021242520A1

    公开(公告)日:2021-12-02

    申请号:PCT/US2021/031929

    申请日:2021-05-12

    Abstract: A method is provided for Cross Video Temporal Difference (CVTD) learning. The method adapts (540) a source domain video to a target domain video using a CVTD loss. The source domain video is annotated, and the target domain video is unannotated. The CVTD loss is computed by quantizing (510A) clips derived from the source and target domain videos by dividing the source domain video into source domain clips and the target domain video into target domain clips. The CVTD loss is further computed by sampling (510B) two clips from each of the source domain clips and the target domain clips to obtain four sampled clips including a first source domain clip, a second source domain clip, a first target domain clip, and a second target domain clip. The CVTD loss is computed (510D) as | (second source domain clip – first source domain clip) – (second target domain clip – first target domain clip).

Patent Agency Ranking