System and method for synthetic audio generation

Invention Grant

US11893305B2 System and method for synthetic audio generation 有权

Please log in to see more content

Patent Title: System and method for synthetic audio generation
Application No.: US17661889

Application Date: 2022-05-03
Publication No.: US11893305B2

Publication Date: 2024-02-06
Inventor: Robin Tommy Kulangara Muriyil , Reshmi Ravindranathan , Aswathy Sreelekha Krishna , Rinu Michael
Applicant: Tata Consultancy Services Limited
Applicant Address: IN Mumbai
Assignee: TATA CONSULTANCY SERVICES LIMITED
Current Assignee: TATA CONSULTANCY SERVICES LIMITED
Current Assignee Address: IN Mumbai
Agency: FINNEGAN, HENDERSON, FARABOW, GARRETT & DUNNER LLP
Priority: IN 2121036836 2021.08.13
Main IPC: G06F3/16
IPC: G06F3/16 ; G06F40/279 ; G10K15/02

System and method for synthetic audio generation

Abstract:

Embodiments provide a method and system for audio generation from contextual text input is provided. The disclosure gives due importance to the granularity of the content. The system allows the user to specify the properties of the audio to be generated. Here, context is used to identify the importance of a particular sound over the others and thus automatic adjustments of the audio output to give a more realistic feel. The system generates dataset for training audio models. The user can give input query in natural language and the audio requested will be generated for training and developing the necessary classification or other necessary audio models. The system provides a feature of automated fine-tuning of the model parameters to suit the new automatically collected training data. Furthermore, the system provides a pre-trained inbuilt model repository with audio models belonging to the main categories of noises.

Public/Granted literature

US20230083346A1 SYSTEM AND METHOD FOR SYNTHETIC AUDIO GENERATION Public/Granted day:2023-03-16

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06F	电数字数据处理（基于特定计算模型的计算机系统入G06N）
G06F3/00	用于将所要处理的数据转变成为计算机能够处理的形式的输入装置；用于将数据从处理机传送到输出设备的输出装置，例如，接口装置
G06F3/16	.声音输入；声音输出（语音处理入G10L）