Invention Grant
- Patent Title: Intelligent multi-camera switching with machine learning
-
Application No.: US17343424Application Date: 2021-06-09
-
Publication No.: US11606510B2Publication Date: 2023-03-14
- Inventor: Jian David Wang , John Paul Spearman , Varun Ajay Kulkarni , Yong Yan , Xiangdong Wang , Peter L. Chu , David A. Bryan
- Applicant: Plantronics, Inc.
- Applicant Address: US CA Santa Cruz
- Assignee: Plantronics, Inc.
- Current Assignee: Plantronics, Inc.
- Current Assignee Address: US CA Santa Cruz
- Main IPC: H04N5/268
- IPC: H04N5/268 ; H04N7/18 ; G10L25/78 ; H04N7/15 ; G06V20/40 ; G06V40/16 ; H04R1/08

Abstract:
Multiple cameras in a conference room, each pointed in a different direction and including a microphone array to perform sound source localization (SSL). The SSL is used in combination with the video image to identify the speaker from among multiple individuals that appear in the video image. Neural network or machine learning processing is performed on the identified speaker to determine the quality of the front or facial view of the speaker. The best view of the speaker's face from the various cameras is selected to be provided to the far end. If no view is satisfactory, a default view is selected and that is provided to the far end. The use of the SSL allows selection of the proper individual from a group of individuals in the conference room, so that only the speaker's head is analyzed for the best facial view and then framed for transmission.
Public/Granted literature
- US20220400216A1 INTELLIGENT MULTI-CAMERA SWITCHING WITH MACHINE LEARNING Public/Granted day:2022-12-15
Information query