Invention Grant
- Patent Title: Transformer-based temporal detection in video
-
Application No.: US17572624Application Date: 2022-01-10
-
Publication No.: US12148214B2Publication Date: 2024-11-19
- Inventor: Zhiyu Cheng , Le Kang , Xin Zhou , Hao Tian , Xing Li , Bo He , Jingyu Xin
- Applicant: Baidu USA, LLC
- Applicant Address: US CA Sunnyvale
- Assignee: Baidu USA, LLC
- Current Assignee: Baidu USA, LLC
- Current Assignee Address: US CA Sunnyvale
- Agency: Oppedahl Patent Law Firm LLC
- Main IPC: G06N3/08
- IPC: G06N3/08 ; G06N3/09 ; G06V10/42 ; G06V20/40

Abstract:
With rapidly evolving technologies and emerging tools, sports-related videos generated online are rapidly increasing. To automate the sports video editing/highlight generation process, a key task is to precisely recognize and locate events-of-interest in videos. Embodiments herein comprise a two-stage paradigm to detect categories of events and when these events happen in videos. In one or more embodiments, multiple action recognition models extract high-level semantic features, and a transformer-based temporal detection module locates target events. These novel approaches achieved state-of-the-art performance in both action spotting and replay grounding. While presented in the context of sports, it shall be noted that the systems and methods herein may be used for videos comprising other content and events.
Public/Granted literature
- US20230055636A1 TRANSFORMER-BASED TEMPORAL DETECTION IN VIDEO Public/Granted day:2023-02-23
Information query