Invention Grant
- Patent Title: Translating sound events to speech and AR content
-
Application No.: US16732847Application Date: 2020-01-02
-
Publication No.: US11055533B1Publication Date: 2021-07-06
- Inventor: Willie L Scott, II , Seema Nagar , Charu Pandhi , Kuntal Dey
- Applicant: International Business Machines Corporation
- Applicant Address: US NY Armonk
- Assignee: International Business Machines Corporation
- Current Assignee: International Business Machines Corporation
- Current Assignee Address: US NY Armonk
- Agency: Patterson + Sheridan, LLP
- Main IPC: G06K9/00
- IPC: G06K9/00 ; G10L15/22 ; G10L13/027 ; G06N20/00 ; G06T7/13 ; G06T7/73 ; G06F3/01 ; G10L15/26

Abstract:
Embodiments herein provide an augmented reality (AR) system that uses sound localization to identify sounds that may be of interest to a user and generates an audio description of the source of the sound as well as AR content that can be magnified and displayed to the user. In one embodiment, an AR device captures images that have the source of the sound within their field of view. Using machine learning (ML) techniques, the AR device can identify the object creating the sound (i.e., the sound source). A description of the sound source and its actions can outputted to the user. In parallel, the AR device can also generate AR content for the sound source. For example, the AR device can magnify the sound source to a size that is viewable to the user and create AR content that is then superimposed onto a display.
Public/Granted literature
- US20210209365A1 TRANSLATING SOUND EVENTS TO SPEECH AND AR CONTENT Public/Granted day:2021-07-08
Information query