Inaudible watermark enabled text-to-speech framework

Invention Grant

US11138964B2 Inaudible watermark enabled text-to-speech framework 有权

Please log in to see more content

Patent Title: Inaudible watermark enabled text-to-speech framework
Application No.: US16659550

Application Date: 2019-10-21
Publication No.: US11138964B2

Publication Date: 2021-10-05
Inventor: Wei Ping , Zhenyu Zhong , Yueqiang Cheng , Xing Li , Tao Wei
Applicant: Baidu USA LLC
Applicant Address: US CA Sunnyvale
Assignee: Baidu USA LLC
Current Assignee: Baidu USA LLC
Current Assignee Address: US CA Sunnyvale
Agency: Womble Bond Dickinson (US) LLP
Main IPC: G10L13/047
IPC: G10L13/047 ; G10L25/30 ; G10L19/018

Inaudible watermark enabled text-to-speech framework

Abstract:

According to various embodiments, an end-to-end TTS framework can integrate a watermarking process into the training of the TTS framework, which enables watermarks to be imperceptible within a synthesized/cloned audio segment generated by the TTS framework. The watermarks added in such a matter are statistically undetectable to prevent authorized removal. According to an exemplary method of training the TTS framework, a TTS neural network model and a watermarking neural network mode in the TTS framework are trained in an end to end manner, with the watermarking being part of the optimization process of the TTS framework. During the training, neuron values of the TTS neural network model are adjusted based on training data to prepare one or more spaces for adding a watermark in a synthesized audio segment to be generated by the TTS framework.

Public/Granted literature

US20210118423A1 INAUDIBLE WATERMARK ENABLED TEXT-TO-SPEECH FRAMEWORK Public/Granted day:2021-04-22

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L13/00	语音合成；文本-语音合成系统
G10L13/02	.产生合成语音的方法；语音合成设备
G10L13/04	..语音合成系统的零部件，例如合成设备结构或存储器管理
G10L13/047	...语音合成设备的体系结构