Abstract:
PROBLEM TO BE SOLVED: To reduce the coding bit rate of wideband voice signal, without sacrificing the high quality associated with the increased bandwidth. SOLUTION: A bandwidth-adaptive quantization method and apparatus is given for determining the type of acoustic signal and the type of frequency spectrum exhibited by the acoustic signal, in order to selectively delete parameter information before vector quantization. The bits that would otherwise be allocated to the deleted parameters can then be reallocated to the quantization of the remaining parameters, which results in an improvement of the perceptual quality of the synthesized acoustic signal. Alternatively, the bits that would have been allocated to the deleted parameters are dropped which results in an overall bit-rate reduction. COPYRIGHT: (C)2011,JPO&INPIT
Abstract:
PROBLEM TO BE SOLVED: To provide a monoscopic low-power mobile device that employs a low-complexity approach to detect and estimate depth information for real-time capturing and generation of stereo video.SOLUTION: A monoscopic low-power mobile device is capable of creating real-time stereo images and videos from a single captured view. The device uses statistics from an autofocusing process to create a block depth map of a single capture view. Artifacts in the block depth map are reduced and an image depth map is created. Stereo three-dimensional (3D) left and right views are created from the image depth map using a created stereo Z-buffer based 3D surface recover process and a disparity map which is a function of the geometry of binocular vision.
Abstract:
PROBLEM TO BE SOLVED: To provide a method and system for generating and encoding line spectrum square roots. SOLUTION: The method of encoding linear prediction coefficient data is taught. The linear prediction coefficient data is converted to line spectrum cosine data (103). The line spectrum cosine data is used for generating two recursively defined vectors (104). The recursively defined vectors are used to calculate one set of sensitivity self-correlation values (106a-106N) and one set of sensitivity crosscorreations (107a-107N). The line spectrum cosine values are used for calculating one set of the line spectrum square roots. COPYRIGHT: (C)2003,JPO
Abstract:
PROBLEM TO BE SOLVED: To provide an apparatus for generating a stereo image for stereo image display, and a configuration of the same.SOLUTION: An apparatus for generating a stereo image for stereo image display is configured by including: a first image sensor; a second image sensor separated from the first image sensor; a diversity combine module to combine image data from the first and second image sensors; and an image processing module configured to process combined image data from the diversity combine module.
Abstract:
PROBLEM TO BE SOLVED: To provide a speech coder for analyzing and reconstructing a prototype of a frame by using intelligent subsampling of phase spectrum information of the prototype. SOLUTION: The speech coder produces phase parameters of a reference prototype, generates phase parameters of a current prototype, and correlates the phase parameters of the current prototype with the phase parameters of the reference prototype in multiple frequency bands to analyze the prototype. To reconstruct the prototype using linear phase shift values, the speech coder produces phase parameters of the reference prototype, generates a set of linear phase shift values associated with the prototype, and composes a phase vector from the phase parameters and the linear phase shift values across multiple frequency bands. COPYRIGHT: (C)2008,JPO&INPIT
Abstract:
PROBLEM TO BE SOLVED: To provide an error concealment mode determination method for determining which of spatial error concealment (SEC) and temporal error concealment (TEC) should be used.SOLUTION: If a video frame is an intra-frame, a similarity metric is used to determine whether the intra-frame represents a scene change or not, and a complex polynomial is used to determine which of SEC and TEC should be used. A spatial error concealment technique divides a corrupt macroblock into four different regions, namely, a corner region, a column region adjacent to the corner region, a row region adjacent to the corner region, and a remainder region. Those regions are reconstructed in the order above and information from earlier reconstructed regions may be used in later reconstructed regions. An inter-macroblock may be refreshed using spatial error concealment if there has been damage that may cause the inter-block to propagate the errors.
Abstract:
PROBLEM TO BE SOLVED: To provide a method of rendering 3D video images on a stereo-enabled display (e.g., stereoscopic or autostereoscopic display).SOLUTION: The process includes removing facets facing away from a viewer, defining foreground facets for left and right views and common background facets, determining lighting for these facets, and performing screen mapping and scene rendering for one view (e.g., the right view) using computational results for facets of the other view (i.e., the left view). In one embodiment, visualization of images is provided on the stereo-enabled display of a low-power device, such as a mobile phone, a computer, a video game platform, or a personal digital assistant (PDA) device.
Abstract:
PROBLEM TO BE SOLVED: To attain a low bit rate in a method and apparatus for variable rate coding of a speech signal. SOLUTION: An input speech signal is classified and an appropriate coding mode is selected based on this classification. For each classification, the coding mode that achieves the lowest bit rate with an acceptable quality of speech reproduction is selected. Low average bit rates are achieved by only employing high fidelity modes during portions of the speech where this fidelity is required for acceptable output. Lower bit rate modes are used during portions of speech where these modes produce acceptable output. The input speech signal is classified into active and inactive regions. Various coding modes are applied to active speech, depending upon the required level of fidelity. Coding modes may be utilized according to the strengths and weaknesses of each particular mode. The apparatus dynamically switches between these modes as the properties of the speech signal vary with time. COPYRIGHT: (C)2011,JPO&INPIT
Abstract:
PROBLEM TO BE SOLVED: To perform variable bit rate coding of speech signals.SOLUTION: An input speech signal is classified and a coding mode for achieving the lowest bit rate with allowable speech reproduction quality is selected on the basis of the classification. Low average bit rates are achieved by only employing a high fidelity mode (that is, a high bit rate widely applicable to different types of speech) during a period of portions of the speech where it is required for acceptable output. The input speech signal is classified into active and inactive regions. The active region is further classified into a voiced region, an unvoiced region, and a transient region. Various coding modes are applied according to the required level of fidelity. The modes are dynamically switched among the modes as properties of speech signals vary with time. And where appropriate, regions of the speech are modeled as pseudo random noise, resulting in a significantly low bit rate.
Abstract:
PROBLEM TO BE SOLVED: To use cycle-aligned fragments (CAFs) to facilitate access to encoded FGS video data.SOLUTION: An embodiment performs cycle-based coding of FGS video data block coefficients and syntax elements, and encapsulates cycles in fragments for transmission. The fragments may be cycle-aligned such that a start of a payload of each of the fragments substantially coincides with a start of one of the cycles. In this manner, coding cycles can be readily accessed via individual fragments. The CAFs may be entropy-coded independently of one another so that each fragment may be readily accessed and decoded without waiting for decoding of other fragments. Independent entropy coding permits parallel decoding and simultaneous processing of fragments.