Improving custom keyword spotting system accuracy with text-to-speech-based data augmentation
Abstract:
The present disclosure provides methods and apparatus for optimizing a keyword spotting system. A set of utterance texts including a given keyword may be generated. A set of speech signals corresponding to the set of utterance texts may be synthesized. An acoustic model in the keyword spotting system may be optimized with at least a part of speech signals in the set of speech signals and utterance texts in the set of utterance texts corresponding to the at least a part of speech signals.
Information query
Patent Agency Ranking
0/0