Invention Grant
- Patent Title: Voice synthesis method, model training method, device and computer device
-
Application No.: US16999989Application Date: 2020-08-21
-
Publication No.: US12014720B2Publication Date: 2024-06-18
- Inventor: Xixin Wu , Mu Wang , Shiyin Kang , Dan Su , Dong Yu
- Applicant: Tencent Technology (Shenzhen) Company Limited
- Applicant Address: CN Shenzhen
- Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED
- Current Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED
- Current Assignee Address: CN Shenzhen
- Agency: Morgan, Lewis & Bockius LLP
- Priority: CN 1810828220.1 2018.07.25
- Main IPC: G10L13/00
- IPC: G10L13/00 ; G10L19/02

Abstract:
This application relates to a speech synthesis method and apparatus, a model training method and apparatus, and a computer device. The method includes: obtaining to-be-processed linguistic data; encoding the linguistic data, to obtain encoded linguistic data; obtaining an embedded vector for speech feature conversion, the embedded vector being generated according to a residual between synthesized reference speech data and reference speech data that correspond to the same reference linguistic data; and decoding the encoded linguistic data according to the embedded vector, to obtain target synthesized speech data on which the speech feature conversion is performed. The solution provided in this application can prevent quality of a synthesized speech from being affected by a semantic feature in a mel-frequency cepstrum.
Public/Granted literature
- US20200380949A1 VOICE SYNTHESIS METHOD, MODEL TRAINING METHOD, DEVICE AND COMPUTER DEVICE Public/Granted day:2020-12-03
Information query