Artificial intelligence (AI)-based voice sampling apparatus and method for providing speech style in heterogeneous label

Invention Grant

US11056096B2 Artificial intelligence (AI)-based voice sampling apparatus and method for providing speech style in heterogeneous label 有权

Please log in to see more content

Patent Title: Artificial intelligence (AI)-based voice sampling apparatus and method for providing speech style in heterogeneous label
Application No.: US16566265

Application Date: 2019-09-10
Publication No.: US11056096B2

Publication Date: 2021-07-06
Inventor: Jonghoon Chae
Applicant: LG ELECTRONICS INC.
Applicant Address: KR Seoul
Assignee: LG ELECTRONICS INC.
Current Assignee: LG ELECTRONICS INC.
Current Assignee Address: KR Seoul
Agency: Birch, Stewart, Kolasch & Birch, LLP
Priority: KR10-2019-0093560 20190731
Main IPC: G10L13/00
IPC: G10L13/00 ; G10L13/10 ; G10L13/033 ; G10L13/047

Artificial intelligence (AI)-based voice sampling apparatus and method for providing speech style in heterogeneous label

Abstract:

Disclosed is an artificial intelligence (AI)-based voice sampling apparatus for providing a speech style in a heterogeneous label, including a rhyme encoder configured to receive a user's voice, extract a voice sample, and analyze a vocal feature included in the voice sample, a text encoder configured to receive text for reflecting the vocal feature, a processor configured to classify the voice sample input to the rhythm encoder into a label according to the vocal feature, provide a weight by measuring a distance between a voice sample corresponding to the label and a voice sample corresponding to a heterogeneous label as a label other than the label and provide a weight by measuring similarity between the label and the heterogeneous label, extract an embedding vector representing the vocal feature, generate a speech style from the embedding vector, and apply the generated speech style to the text, and a rhyme decoder configured to output synthesized voice data in which the speech style is applied to the text by the processor.

Public/Granted literature

US20200005764A1 ARTIFICIAL INTELLIGENCE (AI)-BASED VOICE SAMPLING APPARATUS AND METHOD FOR PROVIDING SPEECH STYLE IN HETEROGENEOUS LABEL Public/Granted day:2020-01-02

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L13/00	语音合成；文本-语音合成系统