-
公开(公告)号:WO2022150235A1
公开(公告)日:2022-07-14
申请号:PCT/US2021/072788
申请日:2021-12-07
Applicant: QUALCOMM INCORPORATED
Inventor: THAGADUR SHIVAPPA, Shankar , WESTBURG, Reid , OLIVIERI, Ferdinando
Abstract: A device includes one or more processors configured to, during a call, receive a sequence of audio frames from a first device. The one or more processors are configured to, in response to determining that no audio frame of the sequence has been received for a threshold duration since a last received audio frame of the sequence, initiate transmission of a frame loss indication to the first device. The one or more processors are also configured to, responsive to the frame loss indication, receive a set of audio frames of the sequence and an indication of a second playback speed from the first device. The one or more processors are configured to initiate playback, via a speaker, of the set of audio frames based on the second playback speed. The second playback speed is greater than a first playback speed of a first set of audio frames of the sequence.
-
公开(公告)号:WO2020263855A1
公开(公告)日:2020-12-30
申请号:PCT/US2020/039180
申请日:2020-06-23
Applicant: QUALCOMM INCORPORATED
IPC: G10L19/008
Abstract: In general, techniques are described for psychoacoustic audio coding of ambisonic audio data. A device comprising a memory and one or more processors may be configured to perform the techniques. The memory may store the bitstream that includes an encoded audio object and a corresponding spatial component that defines spatial characteristics of the encoded foreground audio signal. The encoded foreground audio signal may include a coded gain and a coded shape. The one or more processors may perform a gain and shape synthesis with respect to the coded gain and the coded shape to obtain a foreground audio signal, and reconstruct, based on the foreground audio signal and the spatial component, the ambisonic audio data.
-
公开(公告)号:WO2021102137A1
公开(公告)日:2021-05-27
申请号:PCT/US2020/061274
申请日:2020-11-19
Applicant: QUALCOMM INCORPORATED
IPC: H04S7/00 , G10L19/008
Abstract: An example device includes a memory configured to store at least one spatial component and at least one audio source within a plurality of audio streams. The device also includes one or more processors coupled to the memory. The one or more processors are configured to receive, from motion sensors, rotation information. The one or more processors are configured to rotate the at least one spatial component based on the rotation information to form at least one rotated spatial component. The one or more processors are also configured to reconstruct ambisonic signals from the at least one rotated spatial component and the at least one audio source, wherein the at least one spatial component describes spatial characteristics associated with the at least one audio source in a spherical harmonic domain representation.
-
公开(公告)号:WO2021102132A1
公开(公告)日:2021-05-27
申请号:PCT/US2020/061268
申请日:2020-11-19
Applicant: QUALCOMM INCORPORATED
IPC: G10L19/008 , H04S7/00 , G06F3/01 , H04N19/37 , G02B27/01
Abstract: An example device configured to obtain image data includes a memory configured to store one or more priority values, each of the one or more priority values being associated with a type of image object associated with the image data. The device includes one or more processors coupled to the memory, and configured to associate image objects in the image data with one or more audio sources represented in one or more audio streams. The one or more processors are also configured to assign a respective priority value to each of the one or more audio sources represented in the one or more streams and code ambisonic coefficients based on the assigned priority value.
-
公开(公告)号:EP4289129A1
公开(公告)日:2023-12-13
申请号:EP21839806.3
申请日:2021-12-09
Applicant: QUALCOMM INCORPORATED
Inventor: OLIVIERI, Ferdinando , WESTBURG, Reid , THAGADUR SHIVAPPA, Shankar
IPC: H04M3/56
-
公开(公告)号:WO2022169534A1
公开(公告)日:2022-08-11
申请号:PCT/US2021/072831
申请日:2021-12-09
Applicant: QUALCOMM INCORPORATED
Inventor: OLIVIERI, Ferdinando , WESTBURG, Reid , THAGADUR SHIVAPPA, Shankar
IPC: H04M3/56
Abstract: A device for communication includes one or more processors configured to receive, during an online meeting, a speech audio stream representing speech of a first user. The one or more processors are also configured to receive a text stream representing the speech of the first user. The one or more processors are further configured to selectively generate an output based on the text stream in response to an interruption in the speech audio stream.
-
公开(公告)号:WO2020263843A1
公开(公告)日:2020-12-30
申请号:PCT/US2020/039158
申请日:2020-06-23
Applicant: QUALCOMM INCORPORATED
IPC: G10L19/008 , G10L19/24
Abstract: In general, various aspects of the techniques described in this disclosure are directed to performing psychoacoustic audio coding based on operating conditions. A device comprising a memory and one or more processors may be configured to perform the techniques. The memory may be configured to store the encoded scene-based audio data. The one or more processors may be configured to obtain an operating condition of the device for decoding the encoded scene-based audio data and perform, based on the operating condition, psychoacoustic audio decoding with respect to the encoded scene-based audio data to obtain ambisonic transport format audio data. The one or more processors may also be configured to perform spatial audio decoding with respect to the ambisonic transport format audio data to obtain scene-based audio data.
-
公开(公告)号:WO2020005970A1
公开(公告)日:2020-01-02
申请号:PCT/US2019/039025
申请日:2019-06-25
Applicant: QUALCOMM INCORPORATED
Inventor: KIM, Moo Young , OLIVIERI, Ferdinando , SEN, Dipanjan
IPC: G10L19/008
Abstract: In general, techniques are described by which to render different portions of audio data using different renderers. A device comprising a memory and one or more processors may be configured to perform the techniques. The memory may store audio renderers. The processor(s) may obtain a first audio renderer of the plurality of audio renderers, and apply the first audio renderer with respect to a first portion of the audio data to obtain one or more first speaker feeds. The processor(s) may next obtain a second audio renderer of the plurality of audio renderers, and apply the second audio renderer with respect to a second portion of the audio data to obtain one or more second speaker feeds. The processor(s) may output, to one or more speakers, the one or more first speaker feeds and the one or more second speaker feeds.
-
公开(公告)号:EP4073625A1
公开(公告)日:2022-10-19
申请号:EP20824377.4
申请日:2020-11-17
Applicant: QUALCOMM INCORPORATED
Inventor: FILOS, Jason , OLIVIERI, Ferdinando , PETERS, Nils Gunther
-
公开(公告)号:EP4062404A1
公开(公告)日:2022-09-28
申请号:EP20824822.9
申请日:2020-11-19
Applicant: QUALCOMM INCORPORATED
IPC: G10L19/008 , H04S7/00 , G06F3/01 , H04N19/37 , G02B27/01
-
-
-
-
-
-
-
-
-