Disambiguating heteronyms in speech synthesis

    公开(公告)号:AU2015261693B2

    公开(公告)日:2017-07-20

    申请号:AU2015261693

    申请日:2015-11-27

    Applicant: APPLE INC

    Abstract: Systems and processes for disambiguating heteronyms in speech synthesis are provided. In one example process, a speech input containing a heteronym can be received from a user. The speech input can be processed using an automatic speech recognition system to determine a phonemic string corresponding to the heteronym as pronounced by the user in the speech input. A correct pronunciation of the heteronym can be determined based on at least one of the phonemic string or using an n-gram language model of the automatic speech recognition system. A dialogue response to the speech input can be generated where the dialogue response can include the heteronym. The dialogue response can be outputted as a speech output. The heteronym in the dialogue response can be pronounced in the speech output according to the correct pronunciation. ( ~(D Z1 _ _ __co_ _f c-z 0o col _n Q)0,c ) ro73 0 .) 0 0 Cf) 02 C)= -i;

    Disambiguating heteronyms in speech synthesis

    公开(公告)号:AU2015261693A1

    公开(公告)日:2016-06-23

    申请号:AU2015261693

    申请日:2015-11-27

    Applicant: APPLE INC

    Abstract: Systems and processes for disambiguating heteronyms in speech synthesis are provided. In one example process, a speech input containing a heteronym can be received from a user. The speech input can be processed using an automatic speech recognition system to determine a phonemic string corresponding to the heteronym as pronounced by the user in the speech input. A correct pronunciation of the heteronym can be determined based on at least one of the phonemic string or using an n-gram language model of the automatic speech recognition system. A dialogue response to the speech input can be generated where the dialogue response can include the heteronym. The dialogue response can be outputted as a speech output. The heteronym in the dialogue response can be pronounced in the speech output according to the correct pronunciation. ( ~(D Z1 _ _ __co_ _f c-z 0o col _n Q)0,c ) ro73 0 .) 0 0 Cf) 02 C)= -i;

Patent Agency Ranking