Abstract:
PROBLEM TO BE SOLVED: To provide a system and a method for providing a stereo image with an optimal method for an echo canceller being combined by using a Haas effect to simulate sound picture providing better subjective impression of stereo sounds as compared to an objective stereo image, without compromise with subjective sensitivity of stereophone. SOLUTION: A far-end audio presenter unit includes a first adder configured to add L to R for creating a non-delayed monaural signal, a delay unit configured to delay the non-delayed monaural signal by prescribed time for creating a delayed monaural signal to be loaded to one or more main loudspeakers, and right and left loudspeakers each having one or more in number, to which a first and a second deduced L and/or R signals are loaded. COPYRIGHT: (C)2010,JPO&INPIT
Abstract:
A method detects blocks that are to be indicated as skipped at an earlier stage of the encoding process, than would be the case with other implementations of the ITU H.263 and H.264 standards. The method includes transforming 4null4 blocks in the macro blocks having a skip vector of zero with a binary-transform function. The blocks having values of the four uppermost left binary-transform coefficients less than a predefined threshold, are defined as skipped, thus, minimizing the need for computationally demanding block transformation or quantization.
Abstract:
An echo canceller processing echo, noise and near end talk in a narrower, but still intelligible, frequency band for reducing required processing power and complexity. In a preferred embodiment of the present invention, an input audio signal of captured sound in an audio communication system is decimated and then divided into a number of sub bands by an analyze filter. Each sub band is processed as in background audio echo cancelling by subtracting the signal with an echo estimate from a model of the acoustic signal in the respective sub band, except from that the signal is also bypassed, adjusted by a filter and subtracted from the processed signal. The resulting signals are then recombined by a synthesize filter and interpolated to the original sampling rate and bandwidth. Finally, the output from the synthesize filter is added to the input audio signal, which has been delayed and adjusted by a filter. The filters are controlled by a control algorithm detecting the presence of near end sound, far end sound and noise, so that the filters, and consequently the high pass filter of the echo canceller, only pass high frequency (above low pass frequencies) when only near end sound is detected.
Abstract:
A method for prediction of the motion vector of a pixel block in a video picture that is to be coded. The actual motion vectors of two adjacent blocks close to the uppermost left corner of the block are selected as candidates for the prediction. One additional block, also adjacent to the block, is selected to decide which of the motion vectors to be used as the prediction. The vector difference to the motion vector of the decision block is decisive for the final selection.
Abstract:
A system, apparatus, method, and computer program product for detecting and correcting packet losses that adversely affect video quality. This packet loss feature may be employed as part of the secure video conference system, method and computer program product, or be employed in a non-secure system, method and computer program product.
Abstract:
A system, apparatus, method, and computer program product for establishing secure videoconferences between multiple nodes communicating via two or more transport protocols in which at least one encryption protocol is used.
Abstract:
A unified solution to coding/decoding of different video formats such as 4:2:0, 4:2:2 and 4:4:4 is provided. A method of video coding includes transforming a first mnulln macro block of residual chrominance pixel values of moving pictures by a first integer-transform function generating a corresponding second mnulln macro block of integer-transform coefficients, further transforming DC values of the integer-transform coefficients by a second integer-transform function to generate a third block of integer-transformed DC coefficients. The method further includes generating the second mnulln macro block of integer-transform coefficients by utilizing a knullk integer-transform function on each knullk sub-block of the first mnulln macro block, wherein n and m are each a multiple of k, and generating the third block of coefficients by utilizing a second inullj integer-transform function on the DC values resulting in a (m/k)null(n/k) third block of integer-transformed DC coefficients.
Abstract:
A video teleconferencing system uses digital transcoding to obtain algorithm transcoding, transmission rate matching, and spatial mixing. The video teleconferencing system comprises a multipoint control unit (MCU) for allowing multiple audiovisual terminals, which send and receive compressed digital data signals, to communicate with each other in a conference. The MCU has a video processing unit (VPU) that performs algorithm transcoding, rate matching, and spatial mixing among the terminals within a conference. The VPU includes a time division multiplex pixel bus and a plurality of processors. Each processor is assignable to an audiovisual terminal in the conference and is coupled to the pixel bus. In a receive mode, each processor receives and decodes compressed video signals from its assigned terminal and puts the decoded signal onto the pixel bus. In a transmit mode, each processor receives from the pixel bus uncompressed video signals from any terminal in the conference. The uncompressed video signals are processed and encoded for transmission to the respective assigned terminal. Video encoding time due to motion displacement search is reduced by passing displacement information from the compressed video signals to the encoder to be used directly or as a seed for further refinements of the motion displacement field.
Abstract:
A method for processing an audio signal executed by an audio echo suppression apparatus, the method including: receiving, at the audio echo suppression apparatus, the audio signal; generating a subband signal from the audio signal; delaying, at the audio echo suppression apparatus, the subband signal with a plurality of different delay values to form a plurality of time lag signals; multiplying, at the audio echo suppression apparatus, the plurality of time lag signals with first respective filter coefficients to generate a first signal; calculating, at the audio echo suppression apparatus, a complex product between pairs of the plurality of time lag signals to generate complex product signals; multiplying, at the audio echo suppression apparatus, each of a real part and imaginary part of the complex product signals with second respective filter coefficients, and taking a sum thereof, to generate a second signal; and estimating an echo subband signal from the first signal and the second signal.
Abstract:
A method and system for determining a direction between a detection point, e.g. at a camera in a video conference equipment, and an acoustic source, e.g. an active speaker participating in a video conference. The method comprises receiving acoustic signals originating from the acoustic source at a first and second pair of microphone elements, arranged symmetrically about the detection point; calculating a first cross correlation signal from the first pair of microphone elements; and calculating a second cross correlation of signals from the second pair of microphone elements. The direction is then calculated based on both the first and second cross correlation signals, e.g. by convolution. Further symmetrically arranged pairs of microphone elements may also be used.