-
公开(公告)号:GB2635333A
公开(公告)日:2025-05-14
申请号:GB202317056
申请日:2023-11-07
Applicant: NOKIA TECHNOLOGIES OY
IPC: G06T19/00 , G02B27/01 , G06V10/764 , H04S7/00
Abstract: An apparatus 10 comprises control means 12 for controlling the presentation of content by a rendering device 20 to a user 2. The control means provides spatially-dependent variation of transfer of real-world content by the rendering device to the user. The control means also provides spatially-dependent rendering of virtual content to the user. The content may be visual and/or audio content. The visibility of real-world visual content (captured by one or more cameras) and virtual content (provided by the rendering device) seen by the viewer is dependent on its position in the display. The spatially-dependent provision of real-world content and virtual content can facilitate gradual transitions between the two presented contents or realities, for example by fading in selected regions of a real world visual/audio scene whilst fading out virtual content, wherein the rate of fade may be varied for different portions of a scene The apparatus may be implemented in a head-mounted display (HMD) wherein the transparency of the visual/audio presentation is controllable to provide a spatially-dependent variation of transparency to the external real-world ambient scene/audio.
-
公开(公告)号:GB2635332A
公开(公告)日:2025-05-14
申请号:GB202317042
申请日:2023-11-07
Applicant: NOKIA TECHNOLOGIES OY
Abstract: An apparatus comprises determining a change in head direction of a user within a first predefined time period and determining a first predicted head direction based on the change in head direction. A change in torso direction of a user within a second predefined time period is determined. The first predicted head direction is modified to a second predicted head direction based on the change in torso direction. The first predicted head direction may be modified towards the direction of the change in direction of the torso when determining the second predicted head direction. Resources (bitrate or video resolution) for video transmission via a network can be based upon the second predicted head direction, with more resource being allocated to parts of the video corresponding to the second predicted head direction. The apparatus allows a video presentation apparatus to customise video content based upon predicted head direction. The video may be an immersive video, spherical video, a virtual reality video, 360 degree video, 180 degree video, etc.. Inertial measurement units may determine the changes of direction of the head and torso. A method and a computer program related to the apparatus are also disclosed.
-
公开(公告)号:GB2634721A
公开(公告)日:2025-04-23
申请号:GB202315771
申请日:2023-10-16
Applicant: NOKIA TECHNOLOGIES OY
IPC: G06V40/20 , G10K11/178
Abstract: An apparatus that recognises a sound producing gesture of a user 111, (110, Fig.1; Fig.2) and provides audio feedback to the user based thereon 32. An optical sensor (camera, LIDAR) may be used to sense user movements and a trained machine learning algorithm may classify the user’s movement. The audio feedback may comprise temporarily modifying an active noise cancellation (ANC) 34, particularly: reducing audio cancellation for higher frequencies 132; reducing attenuation of captured ambient audio 134; switching-off audio cancellation 136; or providing spatially selective pass-through of an ambient audio source 138. The user may select the temporary modification. Audio feedback to the user may comprise rendering a virtual sound to the user when the gesture noise is below a loudness threshold (164, Fig.5). The virtual sound may be selected from a database (162, Fig.5) in dependence upon the classification. The audio feedback may be spatially aligned to the sound-producing user gesture (161, Fig.6). Ambient noise from the ANC (32, Figs.7&8) may be processed locally (19, Fig.7) or remotely (319, Fig.8). The apparatus may be earphones or other head-worn apparatus such as head mounted display (Fig.9).
-
公开(公告)号:GB2631410A
公开(公告)日:2025-01-08
申请号:GB202309834
申请日:2023-06-29
Applicant: NOKIA TECHNOLOGIES OY
Inventor: MIIKKA TAPANI VILERMO , MIKKO OLAVI HEIKKINEN , ARTO JUHANI LEHTINIEMI
IPC: H04L1/00 , G10L19/005 , G10L19/02
Abstract: The invention provides determining a loss of a packet comprising first and second audio data. As the audio data are transported in the same packet for contemporaneous rendering, they suffer the same transportation loss. The effect of the loss is remedied by generating substitution audio data differently for the first audio data and second audio data. The generating is configured to conceal (obscure) the packet loss by producing audio data having characteristics as expected which creates perceptual similarity. The first generation process can, for example, use silence or simulated noise to mimic speech, without attempting to reproduce previous speech. The second generation process can, for example, mimic background noise and reproduce long-term characteristics of previous background noise. In examples, the packet further comprises metadata for positioning the audio source in three dimensions. The invention may provide packet loss concealment for Immersive Voice and Audio Services (IVAS) calls, for example, in mobile radio telecommunication devices or in Metadata-assisted spatial audio (MASA) format optimized for direct mobile device use.
-
公开(公告)号:GB2608823A
公开(公告)日:2023-01-18
申请号:GB202110058
申请日:2021-07-13
Applicant: NOKIA TECHNOLOGIES OY
Inventor: MIIKKA TAPANI VILERMO , HANNU JUHANI PULAKKA , ROOPE OLAVI JARVINEN , JORMA JUHANI MÄKINEN
Abstract: An apparatus, method or program for determining, in an audio signal, if sound energy in a first direction is higher, lower, or changing relative to sound energy in another direction by a threshold amount, and controlling the amount of headroom of said signal based on said difference in order to enable an audio zooming effect. The different directions may be achieved may be achieved using directional microphones or through applying a beamforming method to the audio signal. The amount of headroom may be controlled using an automatic gain control or through applying compression to the signal. The audio zoom may be created amplifying sound signals inside of the area of interest or through attenuating sound signals outside of the area of interest.
-
公开(公告)号:GB2606176A
公开(公告)日:2022-11-02
申请号:GB202106043
申请日:2021-04-28
Applicant: NOKIA TECHNOLOGIES OY
Inventor: MIIKKA TAPANI VILERMO , HANNU PULAKKA , TONI HENRIK MÄKINEN
IPC: G01S3/808 , H04S7/00 , G10K11/178
Abstract: Examples of the disclosure relate to apparatus, methods and computer programs for controlling amplification and/or attenuation of sound sources based on their position relative to an electronic device. The apparatus can comprise means for obtaining two or more audio signals from a plurality of microphones of an electronic device and determining loudness of one or more sound sources based on the two or more audio signals so as to determine the loudest sound source. The apparatus also comprises means for determining whether the loudest sound source is within a region of interest based on the two or more audio signals and controlling audibility of the one or more sound sources in accordance with whether the loudest sound source is within a region of interest. The audibility of the one or more sound sources is controlled so that if the loudest sound source is not within the region of interest the loudest sound source is de-emphasized relative to one or more other sound sources within the region of interest.
-
公开(公告)号:GB2563635A
公开(公告)日:2018-12-26
申请号:GB201709909
申请日:2017-06-21
Applicant: NOKIA TECHNOLOGIES OY
Abstract: A method of receiving a plurality input signals representing a sound space. The input signals 61 are used to obtain spatial metadata 66 corresponding to the sound space and to obtain a first spatial audio signal corresponding to the spatial metadata. The spatial metadata and first signal are associated. The spatial metadata is used to enable the first spatial audio signal 65 to obtain a second spatial audio signal 69. The first and second signals may be binaural signals. The second spatial audio signal may be obtained after it is detected that the sound scene to be rendered has changed, for instance due to head orientation 67. Alternatively, a method of receiving a first spatial audio signal and spatial metadata and enabling either a first or second rendering mode. In the first rendering mode, the first spatial audio signal is rendered to a user. In the second rendering mode, the spatial metadata is used to process the first spatial audio signal to obtain a second spatial audio signal, which is rendered to the user.
-
28.
公开(公告)号:GB2549922A
公开(公告)日:2017-11-08
申请号:GB201601489
申请日:2016-01-27
Applicant: NOKIA TECHNOLOGIES OY
Inventor: TONI HENRIK MAKINEN , MIKKO TAMMI , MIIKKA TAPANI VILERMO
IPC: G10L21/028 , G10L25/18 , H04R3/00
Abstract: An audio signal from a mic array 41, 43, 45 (on eg. a phone) is Fourier transformed and beamformed 52 and the size of the beamforming signal reduced by calculating a mean value for each of B frequency bands before being encoded into a bitstream 57 (eg. as metadata), allowing a 3D spatial audio scene (fig. 8) to be rendered upon decoding (fig. 5B). Lower frequency bands may be narrower than those of higher frequencies, and a user may select a focus position for audio output via eg. a touch-screen display (fig. 8).
-
公开(公告)号:GB2549776A
公开(公告)日:2017-11-01
申请号:GB201607458
申请日:2016-04-29
Applicant: NOKIA TECHNOLOGIES OY
Inventor: MIIKKA TAPANI VILERMO , KORAY OZCAN , TONI HENRIK MAKINEN
IPC: G10L21/0208 , A45C11/00 , H04M1/725
Abstract: An external cover device 200 for eg. a phone contains microphones 212, 214, an audio processor 216, for eg. beamforming and a conductive docking terminal 210 allowing connection to a primary terminal 100 containing its own microphones 105, 106. A controller in the terminal 106 processes the signals from both sets of microphones (eg. forward facing in the terminal and rear facing in the cover) according to a cover ID 218, having determined that the cover is attached. The processing may reduce distortion detected in the signals and apply noise-cancellation.
-
公开(公告)号:GB2548614A
公开(公告)日:2017-09-27
申请号:GB201605009
申请日:2016-03-24
Applicant: NOKIA TECHNOLOGIES OY
Inventor: MIIKKA TAPANI VILERMO , TONI HENRIK MAKINEN , LASSE LAAKSONEN , ANSSI SAKARI RAMO
IPC: G10L19/008 , G10L21/0208
Abstract: A spatial audio system receives audio signals from a mic array 23 and divides them into a direct component 45 and an ambient component 46, applies separate noise reduction 47, 48, and then provides the noise-reduced signals to a speaker for rendering 27. Cameras may allow identification of direct components via Image Feature Detection (91, fig. 9) and a third, directional component 50 may be further separated.
-
-
-
-
-
-
-
-
-