- Patent Title: Voice conversion apparatus, voice conversion learning apparatus, image generation apparatus, image generation learning apparatus, voice conversion method, voice conversion learning method, image generation method, image generation learning method, and computer program
-
Application No.: US17640221Application Date: 2020-09-04
-
Publication No.: US12217755B2Publication Date: 2025-02-04
- Inventor: Hirokazu Kameoka , Ko Tanaka , Yasunori Oishi , Takuhiro Kaneko , Aaron Valero Puche
- Applicant: Nippon Telegraph and Telephone Corporation
- Applicant Address: JP Tokyo
- Assignee: Nippon Telegraph and Telephone Corporation
- Current Assignee: Nippon Telegraph and Telephone Corporation
- Current Assignee Address: JP Tokyo
- Agency: Fish & Richardson P.C.
- Priority: JP2019-163418 20190906
- International Application: PCT/JP2020/033607 WO 20200904
- International Announcement: WO2021/045194 WO 20210311
- Main IPC: G10L15/25
- IPC: G10L15/25 ; G06V40/16 ; G10L15/02 ; G10L15/06

Abstract:
A voice conversion device is provided with a linguistic information extraction unit that extracts linguistic information corresponding to utterance content from a conversion source voice signal, an appearance feature extraction unit that extracts appearance features expressing features related to the look of a person's face from a captured image of the person, and a converted voice generation unit that generates a converted voice on a basis of the linguistic information and the appearance features.
Public/Granted literature
Information query