Voice synthesis method, model training method, device and computer device

Invention Grant

US12014720B2 Voice synthesis method, model training method, device and computer device 有权

Please log in to see more content

Patent Title: Voice synthesis method, model training method, device and computer device
Application No.: US16999989

Application Date: 2020-08-21
Publication No.: US12014720B2

Publication Date: 2024-06-18
Inventor: Xixin Wu , Mu Wang , Shiyin Kang , Dan Su , Dong Yu
Applicant: Tencent Technology (Shenzhen) Company Limited
Applicant Address: CN Shenzhen
Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED
Current Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED
Current Assignee Address: CN Shenzhen
Agency: Morgan, Lewis & Bockius LLP
Priority: CN 1810828220.1 2018.07.25
Main IPC: G10L13/00
IPC: G10L13/00 ; G10L19/02

Voice synthesis method, model training method, device and computer device

Abstract:

This application relates to a speech synthesis method and apparatus, a model training method and apparatus, and a computer device. The method includes: obtaining to-be-processed linguistic data; encoding the linguistic data, to obtain encoded linguistic data; obtaining an embedded vector for speech feature conversion, the embedded vector being generated according to a residual between synthesized reference speech data and reference speech data that correspond to the same reference linguistic data; and decoding the encoded linguistic data according to the embedded vector, to obtain target synthesized speech data on which the speech feature conversion is performed. The solution provided in this application can prevent quality of a synthesized speech from being affected by a semantic feature in a mel-frequency cepstrum.

Public/Granted literature

US20200380949A1 VOICE SYNTHESIS METHOD, MODEL TRAINING METHOD, DEVICE AND COMPUTER DEVICE Public/Granted day:2020-12-03

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L13/00	语音合成；文本-语音合成系统