Hierarchical generated audio detection system

Invention Grant

US11763836B2 Hierarchical generated audio detection system 有权

Please log in to see more content

Patent Title: Hierarchical generated audio detection system
Application No.: US17674086

Application Date: 2022-02-17
Publication No.: US11763836B2

Publication Date: 2023-09-19
Inventor: Jianhua Tao , Zhengkun Tian , Jiangyan Yi
Applicant: INSTITUTE OF AUTOMATION, CHINESE ACADEMY OF SCIENCES
Applicant Address: CN Beijing
Assignee: INSTITUTE OF AUTOMATION, CHINESE ACADEMY OF SCIENCES
Current Assignee: INSTITUTE OF AUTOMATION, CHINESE ACADEMY OF SCIENCES
Current Assignee Address: CN Beijing
Agency: Westbridge IP LLC
Priority: CN 2110827718.8 2021.07.21
Main IPC: G10L25/24
IPC: G10L25/24 ; G10L25/30

Abstract:

Disclosed is a hierarchical generated audio detection system, comprising an audio preprocessing module, a CQCC feature extraction module, a LFCC feature extraction module, a first-stage lightweight coarse-level detection model and a second-stage fine-level deep identification model; the audio preprocessing module preprocesses collected audio or video data to obtain an audio clip with a length not exceeding the limit; inputting the audio clip into CQCC feature extraction module and LFCC feature extraction module respectively to obtain CQCC feature and LFCC feature; inputting CQCC feature or LFCC feature into the first-stage lightweight coarse-level detection model for first-stage screening to screen out the first-stage real audio and the first-stage generated audio; inputting the CQCC feature or LFCC feature of the first-stage generated audio into the second-stage fine-level deep identification model to identify the second-stage real audio and the second-stage generated audio, and the second-stage generated audio is identified as generated audio.

Public/Granted literature

US20230027645A1 HIERARCHICAL GENERATED AUDIO DETECTION SYSTEM Public/Granted day:2023-01-26

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L25/00	不限于组G10L 15/00-G10L 21/00的语言或者声音分析技术(当利用语音检测器来感知一些信号特殊特征的基于半导体的静噪放大器，如无信号时的感知入H03G3/34)
G10L25/03	.以提取参数类型为特征的
G10L25/24	..提取参数的倒谱