Invention Grant
- Patent Title: System and method for disambiguating a source of sound based on detected lip movement
-
Application No.: US16277032Application Date: 2019-02-15
-
Publication No.: US11200902B2Publication Date: 2021-12-14
- Inventor: Nishant Shukla , Ashwin Dharne
- Applicant: DMAI, Inc.
- Applicant Address: US CA Los Angeles
- Assignee: DMAI, Inc.
- Current Assignee: DMAI, Inc.
- Current Assignee Address: US CA Los Angeles
- Agency: Venable LLP
- Main IPC: G10L15/25
- IPC: G10L15/25 ; G10L15/22

Abstract:
The present teaching relates to method, system, medium, and implementations for detecting a source of speech sound in a dialogue. A visual signal acquired from a dialogue scene is first received, where the visual signal captures a person present in the dialogue scene. A human lip associated with the person is detected from the visual signal and tracked to detect whether lip movement is observed. If lip movement is detected, a first candidate source of sound is generated corresponding to an area in the dialogue scene where the lip movement occurred.
Public/Granted literature
- US20190251970A1 SYSTEM AND METHOD FOR DISAMBIGUATING A SOURCE OF SOUND BASED ON DETECTED LIP MOVEMENT Public/Granted day:2019-08-15
Information query