Abstract:
데이터가 하나의 디바이스에 의해 제공되는 레이트 및 데이터가 다른 디바이스에 의해 프로세싱되는 레이트는 상이할 수 있다. 예를 들어, 송신 디바이스는 송신 클럭에 따라 데이터를 송신할 수 있는 반면, 송신된 데이터를 수신하는 수신 디바이스는 수신 클럭에 따라 데이터를 프로세싱할 수 있다. 송신 및 수신 클럭들 사이의 타이밍 미스매치가 존재하는 경우, 수신 디바이스는 그것이 데이터를 프로세싱하는 것보다 더 빠르게 또는 더 느리게 데이터를 수신할 수 있다. 이러한 경우, 수신된 데이터의 프로세싱과 관련된 에러들이 존재할 수 있다. 이와 같은 타이밍 미스매치들을 처리하기 위해서, 수신 디바이스는 수신된 데이터로부터 데이터를 삭제하거나, 또는 수신된 데이터에 데이터를 삽입할 수 있다. 이러한 연산들과 관련하여, 수신 디바이스는 삽입 또는 삭제가 결과적인 출력 신호 상에서 미칠 수 있는 임의의 역효과를 완화시키는 방식으로 삽입 지점 또는 삭제 지점에서 또는 그 부근에서 수신된 데이터를 수정할 수 있다.
Abstract:
데이터가 하나의 디바이스에 의해 제공되는 레이트 및 데이터가 다른 디바이스에 의해 프로세싱되는 레이트는 상이할 수 있다. 예를 들어, 송신 디바이스는 송신 클럭에 따라 데이터를 송신할 수 있는 반면, 송신된 데이터를 수신하는 수신 디바이스는 수신 클럭에 따라 데이터를 프로세싱할 수 있다. 송신 및 수신 클럭들 사이의 타이밍 미스매치가 존재하는 경우, 수신 디바이스는 그것이 데이터를 프로세싱하는 것보다 더 빠르게 또는 더 느리게 데이터를 수신할 수 있다. 이러한 경우, 수신된 데이터의 프로세싱과 관련된 에러들이 존재할 수 있다. 이와 같은 타이밍 미스매치들을 처리하기 위해서, 수신 디바이스는 수신된 데이터로부터 데이터를 삭제하거나, 또는 수신된 데이터에 데이터를 삽입할 수 있다. 이러한 연산들과 관련하여, 수신 디바이스는 삽입 또는 삭제가 결과적인 출력 신호 상에서 미칠 수 있는 임의의 역효과를 완화시키는 방식으로 삽입 지점 또는 삭제 지점에서 또는 그 부근에서 수신된 데이터를 수정할 수 있다.
Abstract:
PROBLEM TO BE SOLVED: To efficiently perform transforms such as DCT and IDCT on a large quantity of data of various applications.SOLUTION: A full transform is a transform that implements the complete mathematical description of the transform. A scaled transform is a transform that operates on or provides scaled transform coefficients, which are scaled versions of full transform coefficients. The scaled transform may have lower computational complexity whereas the full transform may be simpler to use by applications. The full and scaled transforms may be for a 2D IDCT, which may be implemented in a separable manner with 1D IDCTs. The full and scaled transforms may also be for a 2D DCT, which may be implemented in a separable manner with ID DCTs. The 1D IDCTs and 1D DCTs may be implemented in a computationally efficient manner.
Abstract:
PROBLEM TO BE SOLVED: To provide a voice recognition method that excellently functions even before and during generation of a speaker dependent (SD) acoustic model to which unsupervised training is applied. SOLUTION: A voice recognition (VR) system is disclosed that utilizes at least one speaker independent (SI) acoustic model 230 or 232 in combination with at least one speaker dependent (SD) acoustic model 234 to provide a level of speech recognition performance that at least equals that of the purely SI acoustic model 230 or 232. The disclosed VR system continually uses unsupervised training to update the acoustic templates in the one or more acoustic models. The VR system then uses the updated SD acoustic model 234 in combination with the at least one SI acoustic model 230 or 232. Consequently, improved VR performance is obtained even during VR testing. COPYRIGHT: (C)2008,JPO&INPIT
Abstract:
PROBLEM TO BE SOLVED: To provide techniques that can facilitate multimedia telephony.SOLUTION: A method for communication of multimedia data comprises: determining a first level of throughput associated with multimedia data communication from a first access terminal to a network; determining a second level of throughput associated with multimedia data communication from the network to a second access terminal based on feedback from the second access terminal to the first access terminal via the network; determining a budget associated with communication of a video unit of the multimedia data; and coding the video unit of the multimedia data on the basis of the budget and the first and second levels of throughput.
Abstract:
PROBLEM TO BE SOLVED: To provide encoding of an audio video stream that is transmitted over a network, for example, a wireless or IP network such that an entire frame of audio and an entire frame of video are transmitted simultaneously within a period required to render the audio video stream frames by an application in a receiver. SOLUTION: This encoding includes receiving audio and video RTP streams and assigning an entire frame of RTP video data to communication channel packets that occupy the same period, or less, as the video frame rate. An entire frame of RTP audio data is assigned to communication channel packets that occupy the same period, or less, as the audio frame rate. The video and audio communication channel packets are transmitted simultaneously. Receiving and assigning RTP streams can be performed in a remote station, or a base station. COPYRIGHT: (C)2011,JPO&INPIT
Abstract:
PROBLEM TO BE SOLVED: To attain a speech recognition method for successfully functioning even before generation and during the generation of an SD acoustic model depending on a speaker to whom unmanaged learning is applied.SOLUTION: In order to give a speech recognition level at least equal to pure SI acoustic models 230, 232 which do not depend on a speaker, at least one SI acoustic model 230, 232 and at least one SD acoustic model 234 which depends on the speaker are combined and used. Then, acoustic templates in at least one or more acoustic models are updated by continuously performing unmanaged learning. After that, the updated SD acoustic model 234 is combined with at least one SI acoustic model 230, 232. Thus, high speech recognition performance is obtained even while a speech recognition test is performed.
Abstract:
PROBLEM TO BE SOLVED: To provide a method and apparatus for audio signal processing by applying log companding on spectral domain or time domain representations of the audio signals to provide an encoded audio signal.SOLUTION: The encoded audio signal is decoded upon receipt. A frequency domain representation or time domain representation of the audio signal is computed by separating the audio signal into specific frequency bands, each having a coefficient. Log companding with different compression ratios is performed on each coefficient to provide an encoded signal. Upon receipt of the encoded signal, inverse log companding and time frequency or time scale reconstruction are performed to provide the audio signal.
Abstract:
PROBLEM TO BE SOLVED: To provide a voice recognition method which excellently functions before and during generation of a speaker dependent (SD) acoustic model through unsupervised training. SOLUTION: At least one speaker-independent (SI) acoustic model 230 or 232 is used in combination with at least one speaker-dependent (SD) acoustic model 234 to provide a level of voice recognition performance that at least equals that of the purely SI acoustic model 230 or 232. The disclosed system continually uses unsupervised training to update the acoustic templates in the one or more SD acoustic models. The system then uses the updated SD acoustic models 234 in combination with the at least one SI acoustic model 230 or 232 to provide improved voice recognition performance even during voice recognition testing. COPYRIGHT: (C)2008,JPO&INPIT
Abstract:
PROBLEM TO BE SOLVED: To provide a method, device and system for providing the distributed source coding technique that improves the data coding performance such as video data coding when channel errors or losses occur. SOLUTION: Errors in the reconstruction of the data is eliminated or reduced by sending extra information. Correlation between a predicted sequence and an original sequence can be used to design codebooks and find the cosets required to represent the original image. This information can be sent over another channel, or a secondary channel. COPYRIGHT: (C)2011,JPO&INPIT