Improving custom keyword spotting system accuracy with text-to-speech-based data augmentation

Invention Grant

US12159627B2 Improving custom keyword spotting system accuracy with text-to-speech-based data augmentation 有权

Please log in to see more content

Patent Title: Improving custom keyword spotting system accuracy with text-to-speech-based data augmentation
Application No.: US17626629

Application Date: 2020-06-12
Publication No.: US12159627B2

Publication Date: 2024-12-03
Inventor: Yao Tian , Yuija Xiao , Edward Lin , Lei He , Hui Zhu
Applicant: Microsoft Technology Licensing, LLC
Applicant Address: US WA Redmond
Assignee: Microsoft Technology Licensing, LLC
Current Assignee: Microsoft Technology Licensing, LLC
Current Assignee Address: US WA Redmond
Agency: Schwegman Lundberg & Woessner, P.A.
Priority: CN201910783303.8 20190823
International Application: PCT/US2020/037339 WO 20200612
International Announcement: WO2021/040842 WO 20210304
Main IPC: G10L15/16
IPC: G10L15/16 ; G06F40/166 ; G06F40/279 ; G06F40/30 ; G10L13/02 ; G10L15/18 ; G10L15/22 ; G10L15/08

Improving custom keyword spotting system accuracy with text-to-speech-based data augmentation

Abstract:

The present disclosure provides methods and apparatus for optimizing a keyword spotting system. A set of utterance texts including a given keyword may be generated. A set of speech signals corresponding to the set of utterance texts may be synthesized. An acoustic model in the keyword spotting system may be optimized with at least a part of speech signals in the set of speech signals and utterance texts in the set of utterance texts corresponding to the at least a part of speech signals.

Public/Granted literature

US20220262352A1 Improving custom keyword spotting system accuracy with text-to-speech-based data augmentation Public/Granted day:2022-08-18

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L15/00	语音识别（G10L17/00优先）
G10L15/08	.语音分类或检索
G10L15/16	..利用人工神经网络