- Patent Title: Speech recognition system using machine learning to classify phone posterior context information and estimate boundaries in speech from combined boundary posteriors
-
Application No.: US16103251Application Date: 2018-08-14
-
Publication No.: US10424289B2Publication Date: 2019-09-24
- Inventor: Ozlem Kalinli-Akbacak
- Applicant: Sony Interactive Entertainment Inc.
- Applicant Address: JP Tokyo
- Assignee: SONY INTERACTIVE ENTERTAINMENT INC.
- Current Assignee: SONY INTERACTIVE ENTERTAINMENT INC.
- Current Assignee Address: JP Tokyo
- Agency: JDI Patent
- Agent Joshua Isenberg; Robert Pullman
- Main IPC: G10L15/04
- IPC: G10L15/04 ; G10L25/30 ; G10L25/03 ; G10L15/16

Abstract:
A speech recognition system includes a phone classifier and a boundary classifier. The phone classifier generates combined boundary posteriors from a combination of auditory attention features and phone posteriors by feeding phone posteriors of neighboring frames of an audio signal into a machine learning algorithm to classify phone posterior context information. The boundary classifier estimates boundaries in speech contained in the audio signal from the combined boundary posteriors.
Public/Granted literature
Information query