Invention Grant
- Patent Title: Automatic synthesis of translated speech using speaker-specific phonemes
-
Application No.: US17131043Application Date: 2020-12-22
-
Publication No.: US11594226B2Publication Date: 2023-02-28
- Inventor: Su Liu , Yang Liang , Debbie Anglin , Fan Yang
- Applicant: International Business Machines Corporation
- Applicant Address: US NY Armonk
- Assignee: International Business Machines Corporation
- Current Assignee: International Business Machines Corporation
- Current Assignee Address: US NY Armonk
- Agency: Garg Law Firm, PLLC
- Agent Rakesh Garg; Nathan Rau
- Main IPC: G10L15/26
- IPC: G10L15/26 ; G10L15/02 ; G10L13/02 ; G06F40/279 ; G06F40/58 ; G10L25/54 ; G06F16/683

Abstract:
An embodiment includes converting an original audio signal to an original text string, the original audio signal being from a recording of the original text string spoken by a specific person in a source language. The embodiment generates a translated text string by translating the original text string from the source language to a target language, including translation of a word from the source language to a target language. The embodiment assembles a standard phoneme sequence from a set of standard phonemes, where the standard phoneme sequence includes a standard pronunciation of the translated word. The embodiment also associates a custom phoneme with a standard phoneme of the standard phoneme sequence, where the custom phoneme includes the specific person's pronunciation of a sound in the translated word. The embodiment synthesizes the translated text string to a translated audio signal including the translated word pronounced using the custom phoneme.
Public/Granted literature
- US20220199086A1 AUTOMATIC SYNTHESIS OF TRANSLATED SPEECH USING SPEAKER-SPECIFIC PHONEMES Public/Granted day:2022-06-23
Information query