Invention Grant
- Patent Title: Speaker identification assisted by categorical cues
-
Application No.: US15476194Application Date: 2017-03-31
-
Publication No.: US10431225B2Publication Date: 2019-10-01
- Inventor: Jeffrey D. Amsterdam , Aaron K. Baughman , Stephen C. Hammer , Mauro Marzorati
- Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
- Applicant Address: US NY Armonk
- Assignee: International Business Machines Corporation
- Current Assignee: International Business Machines Corporation
- Current Assignee Address: US NY Armonk
- Agency: Heslin Rothenberg Farley & Mesiti PC
- Agent Chris K. McLane, Esq.; George S. Blasiak, Esq.
- Main IPC: G10L25/78
- IPC: G10L25/78 ; B60T17/22 ; G06F9/54 ; G10L17/06 ; G06F17/27 ; G10L15/26 ; G10L17/12

Abstract:
Methods, computer program products, and systems are presented. The methods include, for instance: obtaining a media file including a speech by one or more speaker. The language of the speech is identified and biographic data of a speaker of the speech is generated by analyzing semantics and vocal characteristics of the speech. The speaker is diarized and confidence in a resulting speaker label is evaluated against a threshold. The speaker label is adjusted with the language of the speech and biographic data of the speaker and produced as speaker metadata of the media file.
Public/Granted literature
- US20180286412A1 SPEAKER IDENTIFICATION ASSISTED BY CATEGORICAL CUES Public/Granted day:2018-10-04
Information query