一种基于端到端的水场景音频的生成方法

Invention Publication

Please log in to see more content

Patent Title: 一种基于端到端的水场景音频的生成方法
Patent Title (English): A water scene audio generation method based on end-to-end
Application No.: CN201910091367.1

Application Date: 2019-01-30
Publication No.: CN109936766A

Publication Date: 2019-06-25
Inventor: 刘世光 , 程皓楠 , 王凯
Applicant: 天津大学
Applicant Address: 天津市南开区卫津路92号
Assignee: 天津大学
Current Assignee: 天津大学
Current Assignee Address: 天津市南开区卫津路92号
Agency: 天津市北洋有限责任专利代理事务所
Agent 潘俊达
Main IPC: H04N21/439
IPC: H04N21/439 ; G10L21/003

Abstract:

本发明属于音频处理的技术领域，具体涉及一种基于端到端的水场景音频的生成方法，包括如下步骤：步骤一，选取各类水场景视频，并进行预处理；步骤二，根据预处理后的数据，通过训练获得生成器模型；步骤三，将无声视频进行预处理，加载到训练好的生成器模型，输出与无声视频对应的音频；步骤四，根据音频的序列生成包络，并加载到训练好的音色增强器模型，输出音色增强后的音频。本发明能够实现端到端的户外水场景声音的自动生成，解决为场景配音费时和费力的问题，同时，利用训练所得的模型来生成水场景音频，能够提高生成速度和同步度，从而提高工作效率。

Abstract(English):

The invention belongs to the technical field of audio processing, and particularly relates to a water scene audio generation method based on end-to-end, which comprises the following steps of: 1, selecting various water scene videos, and preprocessing the water scene videos; Step 2, obtaining a generator model through training according to the preprocessed data; Step 3, preprocessing the silent video, loading the silent video to the trained generator model, and outputting an audio corresponding to the silent video; And step 4, generating an envelope according to the sequence of the audios, loading the envelope to the trained tone intensifier model, and outputting the audios with enhanced tone. According to the invention, automatic generation of end-to-end outdoor water scene sound can be realized, the problem that time and labor are wasted for scene dubbing is solved, and meanwhile, the water scene audio is generated by using the trained model, so that the generation speed and the synchronization degree can be improved, and the working efficiency is improved.

Public/Granted literature

CN109936766B 一种基于端到端的水场景音频的生成方法 Public/Granted day:2021-04-13

Information query

Chinese Patent Announcement Global Dossier Espacenet

IPC分类:

H	电学
H04	电通信技术
H04N	图像通信，如电视
H04N21/00	可选的内容分发，例如交互式电视,或视频点播[VOD]（运动视频数据的实时双向传输入H04N7/14）
H04N21/40	.专门适用于接收内容或者与内容交互的客户端设备，如STB〔机顶盒〕；相关操作
H04N21/43	..内容或者附加数据的处理，例如解复用来自数字视频流的附加数据；基本客户端操作，例如：本地网络的监控或者译码器时钟的同步；客户端中间件
H04N21/439	...音频基本流的处理