SPEECH RECOGNITION USING ASSOCIATIVE MAPPING
    11.
    发明申请
    SPEECH RECOGNITION USING ASSOCIATIVE MAPPING 审中-公开
    使用相关映射的语音识别

    公开(公告)号:US20160171977A1

    公开(公告)日:2016-06-16

    申请号:US15049892

    申请日:2016-02-22

    Applicant: Google Inc.

    Abstract: Methods, systems, and apparatus are described that receive audio data for an utterance. Association data is accessed that indicates associations between data corresponding to uncorrupted audio segments, and data corresponding to corrupted versions of the uncorrupted audio segments, where the associations are determined before receiving the audio data for the utterance. Using the association data and the received audio data for the utterance, data corresponding to at least one uncorrupted audio segment is selected. A transcription of the utterance is determined based on the selected data corresponding to the at least one uncorrupted audio segment.

    Abstract translation: 描述了接收用于话语的音频数据的方法,系统和装置。 访问关联数据,其指示对应于未损坏的音频片段的数据之间的关联,以及对应于未被破坏的音频段的损坏版本的数据,其中在接收用于话语的音频数据之前确定关联。 使用关联数据和所接收的音频数据进行发音,选择对应于至少一个未被破坏的音频段的数据。 基于与至少一个未损坏的音频段相对应的所选数据来确定话音的转录。

    UTTERANCE SELECTION FOR AUTOMATED SPEECH RECOGNIZER TRAINING
    12.
    发明申请
    UTTERANCE SELECTION FOR AUTOMATED SPEECH RECOGNIZER TRAINING 有权
    自动选择语音识别器培训

    公开(公告)号:US20150379983A1

    公开(公告)日:2015-12-31

    申请号:US14314295

    申请日:2014-06-25

    Applicant: Google Inc.

    CPC classification number: G10L15/063 G10L2015/0635

    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for generating a set of training utterances. The methods, systems, and apparatus include actions of obtaining a target multi-dimensional distribution of characteristics in an initial set of candidate utterances and selecting a subset of the initial set of candidate utterances based on speech recognition confidence scores associated with the candidate utterances. Additional actions include selecting a particular candidate utterance from the subset of the initial set of utterances and determining that adding the particular candidate utterance to a set of training utterances reduces a divergence of a multi-dimensional distribution of the characteristics in the set of training utterances from the target multi-dimensional distribution. Further actions include adding the particular candidate utterance to the set of training utterances.

    Abstract translation: 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于产生一组训练话语。 方法,系统和装置包括在初始的候选话语集中获得特征的目标多维分布的动作,并且基于与候选话语相关联的语音识别置信度得分来选择候选话语的初始集合的子集。 附加动作包括从初始话语集合的子集中选择特定的候选话语,并确定将特定候选话语添加到一组训练话语中减少了训练语言组中的特征的多维分布的发散, 目标多维分布。 进一步的行动包括将特定候选人的话语添加到一组训练话语中。

Patent Agency Ranking