Patent search ap:("IBM") AND inv:"COHEN PAUL S" Page 1

1.

发明专利
Synchronising audio and video. 未知

公开(公告)号：GB2366110A

公开(公告)日：2002-02-27

申请号：GB0114988

申请日：2001-06-20

Applicant: IBM

Inventor： COHEN PAUL S , DILDINE JOHN R , GLEASON EDWARD J

IPC: G10L15/24 , H04N21/2368 , H04N21/43 , H04N21/434 , H04N7/52

Abstract: A method for eliminating synchronisation errors using speech recognition. Using separate audio and visual speech recognition techniques, the method identifies 110 visemes, or visual cues which are indicative of articulatory type, in the video content, and identifies 120 phones and their articulatory types in the audio content. Once the two recognition techniques have been applied, the outputs are compared 130 to determine the relative alignment and, if not aligned, a synchronisation algorithm is applied to time-adjust one or both of the audio and the visual streams in order to achieve synchronisation. Facial features, such as mouth movements, are used to provide visual cues in the video content.

2.

发明专利
SPEECH RECOGNITION SYSTEM WITH EFFICIENT STORAGE AND RAPID ASSEMBLY OF PHONOLOGICAL GRAPHS 未知

公开(公告)号：CA1242028A

公开(公告)日：1988-09-13

申请号：CA504805

申请日：1986-03-24

Applicant: IBM

Inventor： BAHL LALIT R , COHEN PAUL S , MERCER ROBERT L

IPC: G10L15/08 , G10L15/14 , G10L15/18 , G10L5/06

Abstract: SPEECH RECOGNITION SYSTEM WITH EFFICIENT STORAGE AND RAPID ASSEMBLY OF PHONOLOGICAL GRAPHS A continuous speech recognition system is disclosed having a speech processor and a word recognition computer subsystem, charcterized by means associated with the speech processor for developing a graph of confluent links between confluent nodes; means associated with the speech processor for developing a graph of boundary links between adjacent words; means associated with the speech processor for storing an inventory of confluent links and boundary links as a coding inventory; means associated with the speech processor for converting an unknown utterance into an encoded sequence of confluent links and boundary links corresponding to recognition sequences stored in said word recognition subsystem recognition vocabulary for speech recognition. The invention also includes method for achieving continuous speech recognition by characterizing speech as a sequence of confluent links which are matched with candidate words. The invention also applies to isolated word speech recognition as with continuous speech recognition, except that in such case there are no boundary links.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification