Speech synthesis method and apparatus, and readable storage medium

Invention Grant

US12033612B2 Speech synthesis method and apparatus, and readable storage medium 有权

Please log in to see more content

Patent Title: Speech synthesis method and apparatus, and readable storage medium
Application No.: US17984437

Application Date: 2022-11-10
Publication No.: US12033612B2

Publication Date: 2024-07-09
Inventor: Yibin Zheng , Xinhui Li , Li Lu
Applicant: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED
Applicant Address: CN Shenzhen
Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED
Current Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED
Current Assignee Address: CN Shenzhen
Agency: ANOVA LAW GROUP, PLLC
Priority: CN 2110267221.5 2021.03.11
Main IPC: G10L13/02
IPC: G10L13/02 ; G10L19/04 ; G10L21/043

Speech synthesis method and apparatus, and readable storage medium

Abstract:

A speech synthesis method includes: converting a text input sequence into a text feature representation sequence; inputting the text feature representation sequence into an encoder including N encoding layers; the N encoding layers including an encoding layer Ei and an encoding layer Ei+1; the encoding layer Ei+1 including a first multi-head self-attention network; acquiring a first attention matrix and a historical text encoded sequence outputted by the encoding layer Ei, and generating a second attention matrix of the encoding layer Ei+1 according to residual connection between the first attention matrix and the first multi-head self-attention network and the historical text encoded sequence; and generating a target text encoded sequence of the encoding layer Ei+1 according to the second attention matrix and the historical text encoded sequence, and generating synthesized speech data matched with the text input sequence based on the target text encoded sequence.

Public/Granted literature

US20230075891A1 SPEECH SYNTHESIS METHOD AND APPARATUS, AND READABLE STORAGE MEDIUM Public/Granted day:2023-03-09

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L13/00	语音合成；文本-语音合成系统
G10L13/02	.产生合成语音的方法；语音合成设备