Synchronising audio and video.
    1.
    发明专利

    公开(公告)号:GB2366110A

    公开(公告)日:2002-02-27

    申请号:GB0114988

    申请日:2001-06-20

    Applicant: IBM

    Abstract: A method for eliminating synchronisation errors using speech recognition. Using separate audio and visual speech recognition techniques, the method identifies 110 visemes, or visual cues which are indicative of articulatory type, in the video content, and identifies 120 phones and their articulatory types in the audio content. Once the two recognition techniques have been applied, the outputs are compared 130 to determine the relative alignment and, if not aligned, a synchronisation algorithm is applied to time-adjust one or both of the audio and the visual streams in order to achieve synchronisation. Facial features, such as mouth movements, are used to provide visual cues in the video content.

    SPEECH RECOGNITION SYSTEM WITH EFFICIENT STORAGE AND RAPID ASSEMBLY OF PHONOLOGICAL GRAPHS

    公开(公告)号:CA1242028A

    公开(公告)日:1988-09-13

    申请号:CA504805

    申请日:1986-03-24

    Applicant: IBM

    Abstract: SPEECH RECOGNITION SYSTEM WITH EFFICIENT STORAGE AND RAPID ASSEMBLY OF PHONOLOGICAL GRAPHS A continuous speech recognition system is disclosed having a speech processor and a word recognition computer subsystem, charcterized by means associated with the speech processor for developing a graph of confluent links between confluent nodes; means associated with the speech processor for developing a graph of boundary links between adjacent words; means associated with the speech processor for storing an inventory of confluent links and boundary links as a coding inventory; means associated with the speech processor for converting an unknown utterance into an encoded sequence of confluent links and boundary links corresponding to recognition sequences stored in said word recognition subsystem recognition vocabulary for speech recognition. The invention also includes method for achieving continuous speech recognition by characterizing speech as a sequence of confluent links which are matched with candidate words. The invention also applies to isolated word speech recognition as with continuous speech recognition, except that in such case there are no boundary links.

Patent Agency Ranking