Voice activity detection method and apparatus

Invention Grant

US10937448B2 Voice activity detection method and apparatus 有权

Please log in to see more content

Patent Title: Voice activity detection method and apparatus
Application No.: US16234423

Application Date: 2018-12-27
Publication No.: US10937448B2

Publication Date: 2021-03-02
Inventor: Chao Li , Weixin Zhu
Applicant: BAIDU ONLINE NETWORK TECHNOLOGY (BEIJING) CO., LTD.
Applicant Address: CN Beijing
Assignee: BAIDU ONLINE NETWORK TECHNOLOGY (BEIJING) CO., LTD.
Current Assignee: BAIDU ONLINE NETWORK TECHNOLOGY (BEIJING) CO., LTD.
Current Assignee Address: CN Beijing
Agency: J.C. Patents
Priority: CN201810606354.9 20180613
Main IPC: G10L25/78
IPC: G10L25/78 ; G10L25/84 ; G10L15/02 ; G10L15/06 ; G10L15/16 ; G10L15/22 ; G10L25/87

Voice activity detection method and apparatus

Abstract:

A voice activity detection method and an apparatus are provided by embodiments of the present application. The method includes: performing framing processing on a voice to be detected to obtain a plurality of audio frames to be detected; obtaining an acoustic feature of each of the audio frames to be detected, and sequentially inputting the acoustic feature of the each of the audio frames to be detected to a VAD model, wherein the VAD model is configured to classify a first N voice frame in the voice to be detected as a noise frame, classify frames from an (N+1)-th voice frame to a last voice frame as voice frames, and classify a M noise frame after the last voice frame as a voice frame, where N and M are integers; and determining, according to a classification result output by the VAD model.

Public/Granted literature

US20190385636A1 VOICE ACTIVITY DETECTION METHOD AND APPARATUS Public/Granted day:2019-12-19

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L25/00	不限于组G10L 15/00-G10L 21/00的语言或者声音分析技术(当利用语音检测器来感知一些信号特殊特征的基于半导体的静噪放大器，如无信号时的感知入H03G3/34)
G10L25/78	.语音信号存在或不存在的检测（在双向扩音电话系统中通过语音频率切换传输的方向入H04M9/10）