-
公开(公告)号:US11282169B2
公开(公告)日:2022-03-22
申请号:US16958698
申请日:2019-01-24
Applicant: Intel Corporation
Inventor: Wayne Cochran , Fai Yeung , Durga Raj Mathur , Gilson Goncalves De Lima , Patrick Youngung Shon , John A. Harrison , Ok Joon Kim , Harleen Gill , Kyle Siehl , Uma Jayaram , Sankar Jayaram , Archie Sharma , Gockcen Clingir , Stanley Baran , Mayuresh Varerkar , Barnan Das , Narayan Biswal , Nilesh Shah , Ritesh Kale , Greg Weinstein
Abstract: An apparatus, system, and method are described for providing real-time capture, processing, and distribution of panoramic virtual reality (VR) content. One embodiment of a graphics processor comprises a video interface to receive a plurality of images from a corresponding plurality of cameras; an image rectifier to perform a perspective re-projection of at least some of the images to a common image plane to generate a rectified plurality of images; a stitcher to analyze overlapping regions of adjacent images in the rectified images and to identify corresponding pixels in the overlapping regions and to stitch the adjacent images in accordance with the corresponding pixels to generate a panoramic image comprising a stitched combination of the rectified plurality of images; and a cylindrical projector to project the panoramic image onto a cylindrical surface to generate a final panoramic video image to be used to implement a VR environment on a VR apparatus.
-
公开(公告)号:US20220036903A1
公开(公告)日:2022-02-03
申请号:US17327379
申请日:2021-05-21
Applicant: Intel Corporation
Inventor: Gokcen Cilingir , Narayan Biswal
IPC: G10L17/04 , G10L17/12 , G10L17/20 , G10L21/0208 , G10L17/06
Abstract: Techniques are provided for reverberation compensation for far-field speaker recognition. A methodology implementing the techniques according to an embodiment includes receiving an authentication audio signal associated with speech of a user and extracting features from the authentication audio signal. The method also includes scoring results of application of one or more speaker models to the extracted features. Each of the speaker models is trained based on a training audio signal processed by a reverberation simulator to simulate selected far-field environmental effects to be associated with that speaker model. The method further includes selecting one of the speaker models, based on the score, and mapping the selected speaker model to a known speaker identification or label that is associated with the user.
-
83.
公开(公告)号:US11178373B2
公开(公告)日:2021-11-16
申请号:US16050322
申请日:2018-07-31
Applicant: Intel Corporation
Inventor: Mayuresh Varerkar , Stanley Baran , Michael Apodaca , Prasoonkumar Surti , Atsuo Kuwahara , Narayan Biswal , Jill Boyce , Yi-Jen Chiu , Gokcen Cilingir , Barnan Das , Atul Divekar , Srikanth Potluri , Nilesh Shah , Archie Sharma
IPC: H04H60/33 , H04N13/111 , H04N19/597 , G06F9/38 , G06F3/01 , G06N20/00
Abstract: A mechanism is described for facilitating adaptive resolution and viewpoint-prediction for immersive media in computing environments. An apparatus of embodiments, as described herein, includes one or more processors to receive viewing positions associated with a user with respect to a display, and analyze relevance of media contents based on the viewing positions, where the media content includes immersive videos of scenes captured by one or more cameras. The one or more processors are further to predict portions of the media contents as relevant portions based on the viewing positions and transmit the relevant portions to be rendered and displayed.
-
公开(公告)号:US20210349527A1
公开(公告)日:2021-11-11
申请号:US17161014
申请日:2021-01-28
Applicant: Intel Corporation
Inventor: Robert J. Johnston , Satyanarayana Avadhanam , Changliang Wang , Narayan Biswal , Archie Sharma , Richmond Hicks , Joydeep Ray , Abhishek R. Appu , Stanley J. Baran , Sang-Hee Lee , Atthar H. Mohammed , Jong Dae Oh , Hiu-Fai Chan , Sumit Mohan , Jill M. Boyce , Yi-Jen Chiu
Abstract: Systems, apparatuses and methods may provide for technology to improve user experience when viewing simulated 3D objects on a display. Head and upper-body movements may be tracked and recognized as gestures to alter the displayed viewing angle. The technology provides for a very natural way to look around, under, or over objects.
-
公开(公告)号:US11017781B2
公开(公告)日:2021-05-25
申请号:US16153756
申请日:2018-10-06
Applicant: INTEL CORPORATION
Inventor: Gokcen Cilingir , Narayan Biswal
Abstract: Techniques are provided for reverberation compensation for far-field speaker recognition. A methodology implementing the techniques according to an embodiment includes receiving an authentication audio signal associated with speech of a user and extracting features from the authentication audio signal. The method also includes scoring results of application of one or more speaker models to the extracted features. Each of the speaker models is trained based on a training audio signal processed by a reverberation simulator to simulate selected far-field environmental effects to be associated with that speaker model. The method further includes selecting one of the speaker models, based on the score, and mapping the selected speaker model to a known speaker identification or label that is associated with the user.
-
公开(公告)号:US20200244882A1
公开(公告)日:2020-07-30
申请号:US16847053
申请日:2020-04-13
Applicant: Intel Corporation
Inventor: Richmond Hicks , Changliang Wang , Satyanarayana Avadhanam , Robert J. Johnston , Narayan Biswal
IPC: H04N5/232 , H04N13/271 , H04N13/25 , H04N13/383 , H04N13/344 , H04N21/218 , H04N19/132 , H04N21/2343 , H04N21/4223 , H04N19/179 , H04N21/4728 , H04N19/17 , H04N21/81 , H04N21/442 , H04N21/4402 , H04N19/167 , H04N13/243 , G02B27/01 , G06F3/01 , H04N5/225 , G03B37/04 , G03B17/56
Abstract: Systems and methods may provide for capturing 360 degree video, and multi-resolution encoding, processing and displaying of the video based on a field of view (FOV) and region of interest (ROI) for a viewer. The ROI may be determined based on eye tracking information (ETI) and the video may be encoded for viewports within the FOV at a high resolution and for other viewports outside the FOV at a lower resolution. ROI in the video may be encoded at a high resolution and areas outside of the ROI may be encoded at a lower resolution. The ETI enables the selective display of one or more warnings based on the gaze of a user to improve the efficiency of the warning. 3D glasses having variable lens may be used to adjust the focal distance of a virtual display to match a virtual distance of an object based on stereo distance cues.
-
公开(公告)号:US10574995B2
公开(公告)日:2020-02-25
申请号:US15483757
申请日:2017-04-10
Applicant: Intel Corporation
Inventor: Atthar H. Mohammed , Abhishek R. Appu , Stanley J. Baran , Sang-Hee Lee , Jong Dae Oh , Hiu-Fai R. Chan , Joydeep Ray , Narayan Biswal , Richmond Hicks , Arthur J. Runyan , Nausheen Ansari
IPC: H04N19/174 , H04N19/142
Abstract: Systems, apparatuses and methods may include a source device that generates a scene change notification in response to a movement of a camera, modifies an encoding scheme associated with the video content captured by the camera in response to the scene change notification, identifies a full-frame difference threshold, wherein scene analysis information includes frame difference data, and compares the frame difference data to an intermediate threshold that is less than the full-frame difference threshold, wherein the scene change notification is generated when the frame difference data exceeds the intermediate threshold. A sink device may obtain transport quality data associated with video content, modify an output parameter of a display based on the transport quality data, determine a view perspective of a still image containing a plurality of image slices, retrieve only a subset of the plurality of image slices based on the view perspective and decode the retrieved subset.
-
公开(公告)号:US10565964B2
公开(公告)日:2020-02-18
申请号:US15494935
申请日:2017-04-24
Applicant: Intel Corporation
Inventor: Richmond Hicks , Arthur J. Runyan , Nausheen Ansari , Narayan Biswal
Abstract: A system for reducing bandwidth and/or reducing power consumed by a display may comprise a display having a background plane and a region of interest plane that may be identified by a gaze tracker. The region of interest may be of a higher quality picture. In some embodiments, the display may be a large panel display and in others a head mounted display (HMD).
-
公开(公告)号:US20190279645A1
公开(公告)日:2019-09-12
申请号:US16153756
申请日:2018-10-06
Applicant: INTEL CORPORATION
Inventor: Gokcen Cilingir , Narayan Biswal
IPC: G10L17/04 , G10L17/06 , G10L17/20 , G10L17/12 , G10L21/0208
Abstract: Techniques are provided for reverberation compensation for far-field speaker recognition. A methodology implementing the techniques according to an embodiment includes receiving an authentication audio signal associated with speech of a user and extracting features from the authentication audio signal. The method also includes scoring results of application of one or more speaker models to the extracted features. Each of the speaker models is trained based on a training audio signal processed by a reverberation simulator to simulate selected far-field environmental effects to be associated with that speaker model. The method further includes selecting one of the speaker models, based on the score, and mapping the selected speaker model to a known speaker identification or label that is associated with the user.
-
公开(公告)号:US10339935B2
公开(公告)日:2019-07-02
申请号:US15626828
申请日:2017-06-19
Applicant: INTEL CORPORATION
Inventor: Gokcen Cilingir , Jonathan J. Huang , Narayan Biswal , Mandar S. Joshi
Abstract: Techniques are provided for training of a text independent (TI) speaker recognition (SR) model. A methodology implementing the techniques according to an embodiment includes measuring context data associated with collected TI speech utterances from a user and identifying the user based on received identity measurements. The method further includes performing a speech quality analysis and a speaker state analysis based on the utterances, and evaluating a training merit value of the utterances, based on the speech quality analysis and the speaker state analysis. If the training merit value exceeds a threshold value, the utterances are stored as training data in a training database. The database is indexed by the user identity and the context data. The method further includes determining whether the stored training data has achieved a sufficiency level for enrollment of a TI SR model, and training the TI SR model for the identified user and context.
-
-
-
-
-
-
-
-
-