Invention Grant
- Patent Title: Duration informed attention network (DURIAN) for audio-visual synthesis
-
Application No.: US16549068Application Date: 2019-08-23
-
Publication No.: US11151979B2Publication Date: 2021-10-19
- Inventor: Heng Lu , Chengzhu Yu , Dong Yu
- Applicant: TENCENT AMERICA LLC
- Applicant Address: US CA Palo Alto
- Assignee: TENCENT AMERICA LLC
- Current Assignee: TENCENT AMERICA LLC
- Current Assignee Address: US CA Palo Alto
- Agency: Sughrue Mion, PLLC
- Main IPC: G10L13/08
- IPC: G10L13/08 ; G10L13/027 ; G10L13/02 ; G10L13/033 ; G10L13/10 ; G10L19/03 ; G06T13/40 ; G10L19/00 ; G10L13/00

Abstract:
A method and apparatus include receiving a text input that includes a sequence of text components. Respective temporal durations of the text components are determined using a duration model. A spectrogram frame is generated based on the duration model. An audio waveform is generated based on the spectrogram frame. Video information is generated based on the audio waveform. The audio waveform is provided as an output along with a corresponding video.
Public/Granted literature
- US20210056949A1 DURATION INFORMED ATTENTION NETWORK (DURIAN) FOR AUDIO-VISUAL SYNTHESIS Public/Granted day:2021-02-25
Information query