System and method for synthetic audio generation
Abstract:
Embodiments provide a method and system for audio generation from contextual text input is provided. The disclosure gives due importance to the granularity of the content. The system allows the user to specify the properties of the audio to be generated. Here, context is used to identify the importance of a particular sound over the others and thus automatic adjustments of the audio output to give a more realistic feel. The system generates dataset for training audio models. The user can give input query in natural language and the audio requested will be generated for training and developing the necessary classification or other necessary audio models. The system provides a feature of automated fine-tuning of the model parameters to suit the new automatically collected training data. Furthermore, the system provides a pre-trained inbuilt model repository with audio models belonging to the main categories of noises.
Public/Granted literature
Information query
Patent Agency Ranking
0/0