Invention Grant
- Patent Title: Inaudible watermark enabled text-to-speech framework
-
Application No.: US16659550Application Date: 2019-10-21
-
Publication No.: US11138964B2Publication Date: 2021-10-05
- Inventor: Wei Ping , Zhenyu Zhong , Yueqiang Cheng , Xing Li , Tao Wei
- Applicant: Baidu USA LLC
- Applicant Address: US CA Sunnyvale
- Assignee: Baidu USA LLC
- Current Assignee: Baidu USA LLC
- Current Assignee Address: US CA Sunnyvale
- Agency: Womble Bond Dickinson (US) LLP
- Main IPC: G10L13/047
- IPC: G10L13/047 ; G10L25/30 ; G10L19/018

Abstract:
According to various embodiments, an end-to-end TTS framework can integrate a watermarking process into the training of the TTS framework, which enables watermarks to be imperceptible within a synthesized/cloned audio segment generated by the TTS framework. The watermarks added in such a matter are statistically undetectable to prevent authorized removal. According to an exemplary method of training the TTS framework, a TTS neural network model and a watermarking neural network mode in the TTS framework are trained in an end to end manner, with the watermarking being part of the optimization process of the TTS framework. During the training, neuron values of the TTS neural network model are adjusted based on training data to prepare one or more spaces for adding a watermark in a synthesized audio segment to be generated by the TTS framework.
Public/Granted literature
- US20210118423A1 INAUDIBLE WATERMARK ENABLED TEXT-TO-SPEECH FRAMEWORK Public/Granted day:2021-04-22
Information query