Bandwidth-adaptive quantization method and apparatus
    1.
    发明专利
    Bandwidth-adaptive quantization method and apparatus 有权
    带宽自适应量化方法和装置

    公开(公告)号:JP2011188510A

    公开(公告)日:2011-09-22

    申请号:JP2011094733

    申请日:2011-04-21

    CPC classification number: G10L19/002 G10L19/0208 G10L19/12

    Abstract: PROBLEM TO BE SOLVED: To reduce the coding bit rate of wideband voice signal, without sacrificing the high quality associated with the increased bandwidth.
    SOLUTION: A bandwidth-adaptive quantization method and apparatus is given for determining the type of acoustic signal and the type of frequency spectrum exhibited by the acoustic signal, in order to selectively delete parameter information before vector quantization. The bits that would otherwise be allocated to the deleted parameters can then be reallocated to the quantization of the remaining parameters, which results in an improvement of the perceptual quality of the synthesized acoustic signal. Alternatively, the bits that would have been allocated to the deleted parameters are dropped which results in an overall bit-rate reduction.
    COPYRIGHT: (C)2011,JPO&INPIT

    Abstract translation: 要解决的问题:为了降低宽带语音信号的编码比特率,而不牺牲与增加的带宽相关联的高质量。 解决方案:为了选择性地删除矢量量化之前的参数信息,给出了用于确定声信号的类型和由声信号展现的频谱的类型的带宽自适应量化方法和装置。 否则将分配给删除的参数的位可以被重新分配到剩余参数的量化,这导致合成声信号的感知质量的改善。 或者,将丢弃将被分配给已删除参数的位,这导致整体比特率降低。 版权所有(C)2011,JPO&INPIT

    Real-time capture and generation of stereo images and videos with monoscopic low power mobile device
    2.
    发明专利
    Real-time capture and generation of stereo images and videos with monoscopic low power mobile device 有权
    立体图像和单声道低功率移动设备的实时捕获和生成

    公开(公告)号:JP2012231508A

    公开(公告)日:2012-11-22

    申请号:JP2012140053

    申请日:2012-06-21

    Abstract: PROBLEM TO BE SOLVED: To provide a monoscopic low-power mobile device that employs a low-complexity approach to detect and estimate depth information for real-time capturing and generation of stereo video.SOLUTION: A monoscopic low-power mobile device is capable of creating real-time stereo images and videos from a single captured view. The device uses statistics from an autofocusing process to create a block depth map of a single capture view. Artifacts in the block depth map are reduced and an image depth map is created. Stereo three-dimensional (3D) left and right views are created from the image depth map using a created stereo Z-buffer based 3D surface recover process and a disparity map which is a function of the geometry of binocular vision.

    Abstract translation: 要解决的问题:提供一种采用低复杂度方法来检测和估计深度信息以实时捕获和生成立体视频的单视低功率移动设备。

    解决方案:单视低功耗移动设备能够从单个捕获视图创建实时立体图像和视频。 该设备使用自动对焦过程的统计信息来创建单个捕获视图的块深度图。 减少块深度图中的人工制品,并创建图像深度图。 使用创建的基于立体声Z缓冲器的3D表面恢复过程和作为双目视觉几何的函数的视差图从图像深度图创建立体三维(3D)左视图和右视图。 版权所有(C)2013,JPO&INPIT

    Method and system for generating and encoding line spectrum square root
    3.
    发明专利
    Method and system for generating and encoding line spectrum square root 审中-公开
    用于生成和编码线光谱方法的方法和系统

    公开(公告)号:JP2003050600A

    公开(公告)日:2003-02-21

    申请号:JP2002140337

    申请日:2002-05-15

    CPC classification number: G10L19/07

    Abstract: PROBLEM TO BE SOLVED: To provide a method and system for generating and encoding line spectrum square roots.
    SOLUTION: The method of encoding linear prediction coefficient data is taught. The linear prediction coefficient data is converted to line spectrum cosine data (103). The line spectrum cosine data is used for generating two recursively defined vectors (104). The recursively defined vectors are used to calculate one set of sensitivity self-correlation values (106a-106N) and one set of sensitivity crosscorreations (107a-107N). The line spectrum cosine values are used for calculating one set of the line spectrum square roots.
    COPYRIGHT: (C)2003,JPO

    Abstract translation: 要解决的问题:提供用于生成和编码线谱平方根的方法和系统。 解决方案:教导编码线性预测系数数据的方法。 线性预测系数数据被转换为线谱余弦数据(103)。 线谱余弦数据用于产生两个递归定义的矢量(104)。 递归定义的向量用于计算一组灵敏度自相关值(106a-106N)和一组灵敏度交叉变量(107a-107N)。 线谱余弦值用于计算一组线谱平方根。

    Method and apparatus for subsampling phase spectrum information
    5.
    发明专利
    Method and apparatus for subsampling phase spectrum information 有权
    用于替代相位谱信息的方法和装置

    公开(公告)号:JP2008040509A

    公开(公告)日:2008-02-21

    申请号:JP2007213061

    申请日:2007-08-17

    CPC classification number: G10L19/097 G10L19/02 G10L25/27

    Abstract: PROBLEM TO BE SOLVED: To provide a speech coder for analyzing and reconstructing a prototype of a frame by using intelligent subsampling of phase spectrum information of the prototype. SOLUTION: The speech coder produces phase parameters of a reference prototype, generates phase parameters of a current prototype, and correlates the phase parameters of the current prototype with the phase parameters of the reference prototype in multiple frequency bands to analyze the prototype. To reconstruct the prototype using linear phase shift values, the speech coder produces phase parameters of the reference prototype, generates a set of linear phase shift values associated with the prototype, and composes a phase vector from the phase parameters and the linear phase shift values across multiple frequency bands. COPYRIGHT: (C)2008,JPO&INPIT

    Abstract translation: 要解决的问题:提供一种语音编码器,用于通过使用原型的相位谱信息的智能子采样来分析和重建帧的原型。 解决方案:语音编码器产生参考原型的相位参数,生成当前原型的相位参数,并将当前原型的相位参数与多个频带中的参考原型的相位参数相关联,以分析原型。 为了使用线性相移值重建原型,语音编码器产生参考原型的相位参数,产生与原型相关联的一组线性相移值,并且从相位参数和线性相移值组成相位矢量 多个频带。 版权所有(C)2008,JPO&INPIT

    Methods of performing error concealment for digital video
    6.
    发明专利
    Methods of performing error concealment for digital video 审中-公开
    对数字视频执行错误隐藏的方法

    公开(公告)号:JP2013048447A

    公开(公告)日:2013-03-07

    申请号:JP2012214297

    申请日:2012-09-27

    Abstract: PROBLEM TO BE SOLVED: To provide an error concealment mode determination method for determining which of spatial error concealment (SEC) and temporal error concealment (TEC) should be used.SOLUTION: If a video frame is an intra-frame, a similarity metric is used to determine whether the intra-frame represents a scene change or not, and a complex polynomial is used to determine which of SEC and TEC should be used. A spatial error concealment technique divides a corrupt macroblock into four different regions, namely, a corner region, a column region adjacent to the corner region, a row region adjacent to the corner region, and a remainder region. Those regions are reconstructed in the order above and information from earlier reconstructed regions may be used in later reconstructed regions. An inter-macroblock may be refreshed using spatial error concealment if there has been damage that may cause the inter-block to propagate the errors.

    Abstract translation: 要解决的问题:提供一种用于确定应该使用空间错误隐藏(SEC)和时间错误隐藏(TEC)的错误隐藏模式确定方法。 解决方案:如果视频帧是帧内帧,则使用相似性度量来确定帧内是否表示场景变化,并且使用复数多项式来确定应当使用哪个SEC和TEC 。 空间误差隐藏技术将腐败宏块分为四个不同的区域,即角区域,与拐角区域相邻的列区域,与拐角区域相邻的行区域以及其余区域。 这些区域按照上述顺序重建,并且来自较早重建区域的信息可以用于后面的重建区域。 如果存在可能导致块间传播错误的损坏,则可以使用空间错误隐藏来刷新宏块间宏块。 版权所有(C)2013,JPO&INPIT

    Rendering 3d video images on stereo-enabled display
    7.
    发明专利
    Rendering 3d video images on stereo-enabled display 有权
    在立体声显示器上渲染3D视频图像

    公开(公告)号:JP2012104144A

    公开(公告)日:2012-05-31

    申请号:JP2012004619

    申请日:2012-01-13

    CPC classification number: H04N13/0275 G06T15/10

    Abstract: PROBLEM TO BE SOLVED: To provide a method of rendering 3D video images on a stereo-enabled display (e.g., stereoscopic or autostereoscopic display).SOLUTION: The process includes removing facets facing away from a viewer, defining foreground facets for left and right views and common background facets, determining lighting for these facets, and performing screen mapping and scene rendering for one view (e.g., the right view) using computational results for facets of the other view (i.e., the left view). In one embodiment, visualization of images is provided on the stereo-enabled display of a low-power device, such as a mobile phone, a computer, a video game platform, or a personal digital assistant (PDA) device.

    Abstract translation: 要解决的问题:提供一种在支持立体声的显示器(例如,立体显示或自动立体显示)上渲染3D视频图像的方法。

    解决方案:该过程包括移除面向观察者的面,定义左右视图和常见背景面的前景面,确定这些面的照明,以及执行一个视图的屏幕映射和场景渲染(例如,右侧 视图)使用另一视图(即左视图)的方面的计算结果。 在一个实施例中,在诸如移动电话,计算机,视频游戏平台或个人数字助理(PDA)设备的低功率设备的支持立体声的显示器上提供图像的可视化。 版权所有(C)2012,JPO&INPIT

    Variable rate speech coding
    8.
    发明专利
    Variable rate speech coding 审中-公开
    可变速率语音编码

    公开(公告)号:JP2011123506A

    公开(公告)日:2011-06-23

    申请号:JP2011002269

    申请日:2011-01-07

    CPC classification number: G10L19/24 G10L19/20 G10L2025/783 G10L2025/935

    Abstract: PROBLEM TO BE SOLVED: To attain a low bit rate in a method and apparatus for variable rate coding of a speech signal. SOLUTION: An input speech signal is classified and an appropriate coding mode is selected based on this classification. For each classification, the coding mode that achieves the lowest bit rate with an acceptable quality of speech reproduction is selected. Low average bit rates are achieved by only employing high fidelity modes during portions of the speech where this fidelity is required for acceptable output. Lower bit rate modes are used during portions of speech where these modes produce acceptable output. The input speech signal is classified into active and inactive regions. Various coding modes are applied to active speech, depending upon the required level of fidelity. Coding modes may be utilized according to the strengths and weaknesses of each particular mode. The apparatus dynamically switches between these modes as the properties of the speech signal vary with time. COPYRIGHT: (C)2011,JPO&INPIT

    Abstract translation: 要解决的问题:为了在语音信号的可变速率编码的方法和装置中获得低比特率。

    解决方案:输入语音信号被分类,并且基于该分类选择适当的编码模式。 对于每个分类,选择实现具有可接受的语音再现质量的最低比特率的编码模式。 低平均比特率通过在语音的部分期间仅采用高保真模式来实现,其中该可靠输出需要该保真度。 在这些模式产生可接受输出的语音部分期间使用较低比特率模式。 输入语音信号分为主动和非活动区域。 取决于所需的保真级别,各种编码模式被应用于活动语音。 可以根据每个特定模式的优点和缺点来利用编码模式。 当语音信号的属性随时间变化时,该装置动态地在这些模式之间切换。 版权所有(C)2011,JPO&INPIT

    Variable rate speech coding
    9.
    发明专利
    Variable rate speech coding 有权
    可变速率语音编码

    公开(公告)号:JP2013178545A

    公开(公告)日:2013-09-09

    申请号:JP2013087419

    申请日:2013-04-18

    CPC classification number: G10L19/24 G10L19/20 G10L2025/783 G10L2025/935

    Abstract: PROBLEM TO BE SOLVED: To perform variable bit rate coding of speech signals.SOLUTION: An input speech signal is classified and a coding mode for achieving the lowest bit rate with allowable speech reproduction quality is selected on the basis of the classification. Low average bit rates are achieved by only employing a high fidelity mode (that is, a high bit rate widely applicable to different types of speech) during a period of portions of the speech where it is required for acceptable output. The input speech signal is classified into active and inactive regions. The active region is further classified into a voiced region, an unvoiced region, and a transient region. Various coding modes are applied according to the required level of fidelity. The modes are dynamically switched among the modes as properties of speech signals vary with time. And where appropriate, regions of the speech are modeled as pseudo random noise, resulting in a significantly low bit rate.

    Abstract translation: 要解决的问题:执行语音信号的可变比特率编码。解码:输入语音信号被分类,并且基于分类来选择用于实现具有允许语音再现质量的最低比特率的编码模式。 在需要可接受的输出的语音部分的期间,通过仅采用高保真模式(即广泛适用于不同类型的语音的高比特率)来实现低平均比特率。 输入语音信号分为主动和非活动区域。 有源区进一步分为有声区,无声区和瞬态区。 根据所需的保真级别应用各种编码模式。 这些模式在模式之间动态切换,因为语音信号的属性随时间而变化。 并且在适当的地方,语音区域被建模为伪随机噪声,导致显着低的比特率。

    Video coding with fine granularity scalability using cycle-aligned fragments
    10.
    发明专利
    Video coding with fine granularity scalability using cycle-aligned fragments 审中-公开
    使用周期对齐的片段,具有精细粒度可扩展性的视频编码

    公开(公告)号:JP2013081226A

    公开(公告)日:2013-05-02

    申请号:JP2012268191

    申请日:2012-12-07

    CPC classification number: H04N19/34

    Abstract: PROBLEM TO BE SOLVED: To use cycle-aligned fragments (CAFs) to facilitate access to encoded FGS video data.SOLUTION: An embodiment performs cycle-based coding of FGS video data block coefficients and syntax elements, and encapsulates cycles in fragments for transmission. The fragments may be cycle-aligned such that a start of a payload of each of the fragments substantially coincides with a start of one of the cycles. In this manner, coding cycles can be readily accessed via individual fragments. The CAFs may be entropy-coded independently of one another so that each fragment may be readily accessed and decoded without waiting for decoding of other fragments. Independent entropy coding permits parallel decoding and simultaneous processing of fragments.

    Abstract translation: 要解决的问题:使用循环对齐的片段(CAF)以便于访问编码的FGS视频数据。 解决方案:一个实施例执行FGS视频数据块系数和语法元素的基于周期的编码,并且将循环封装成用于传输的片段。 片段可以是循环对齐的,使得每个片段的有效载荷的开始基本上与循环中的一个的开始重合。 以这种方式,编码周期可以通过各个片段容易地访问。 CAF可以彼此独立地进行熵编码,使得每个片段可以容易地被访问和解码,而不等待其他片段的解码。 独立熵编码允许并行解码和片段的同时处理。 版权所有(C)2013,JPO&INPIT

Patent Agency Ranking