-
公开(公告)号:US20250131934A1
公开(公告)日:2025-04-24
申请号:US18911317
申请日:2024-10-10
Applicant: Sony Interactive Entertainment Inc.
Inventor: Pierluigi Vito Amadori , Maria Pilataki Manika
IPC: G10L21/013 , G10L15/18 , G10L25/18
Abstract: An audio generation system for generating output audio comprising speech, the system comprising an input unit configured to receive a first input defining the semantic content of the output audio, and a second input defining one or more desired characteristics of the output audio, a parameter identification unit configured to identify, from one or more latent spaces each associated with one or more possible characteristics of the output audio, one or more parameters for use in generating the output audio in dependence upon the second input, and an output generating unit configured to generate output audio in dependence upon the first input and the identified one or more parameters.