System for automatic extraction of structure from spoken conversation using lexical and acoustic features

Invention Grant

US10592611B2 System for automatic extraction of structure from spoken conversation using lexical and acoustic features 有权

Please log in to see more content

Patent Title: System for automatic extraction of structure from spoken conversation using lexical and acoustic features
Application No.: US15332766

Application Date: 2016-10-24
Publication No.: US10592611B2

Publication Date: 2020-03-17
Inventor: Jesse Vig , Harish Arsikere , Margaret H. Szymanski , Luke R. Plurkowski , Kyle D. Dent , Daniel G. Bobrow , Daniel Davies , Eric Saund
Applicant: Palo Alto Research Center Incorporated
Applicant Address: US TX Richardson
Assignee: Conduent Business Services, LLC
Current Assignee: Conduent Business Services, LLC
Current Assignee Address: US TX Richardson
Main IPC: G10L15/00
IPC: G10L15/00 ; G06F17/27 ; G10L25/48 ; G10L15/26 ; H04M3/51

System for automatic extraction of structure from spoken conversation using lexical and acoustic features

Abstract:

Embodiments of the present invention provide a system for automatically extracting conversational structure from a voice record based on lexical and acoustic features. The system also aggregates business-relevant statistics and entities from a collection of spoken conversations. The system may infer a coarse-level conversational structure based on fine-level activities identified from extracted acoustic features. The system improves significantly over previous systems by extracting structure based on lexical and acoustic features. This enables extracting conversational structure on a larger scale and finer level of detail than previous systems, and can feed an analytics and business intelligence platform, e.g. for customer service phone calls. During operation, the system obtains a voice record. The system then extracts a lexical feature using automatic speech recognition (ASR). The system extracts an acoustic feature. The system then determines, via machine learning and based on the extracted lexical and acoustic features, a coarse-level structure of the conversation.

Public/Granted literature

US20180113854A1 SYSTEM FOR AUTOMATIC EXTRACTION OF STRUCTURE FROM SPOKEN CONVERSATION USING LEXICAL AND ACOUSTIC FEATURES Public/Granted day:2018-04-26

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L15/00	语音识别（G10L17/00优先）