AUDIO CLASSIFICATION FOR INFORMATION RETRIEVAL USING SPARSE FEATURES
    1.
    发明申请
    AUDIO CLASSIFICATION FOR INFORMATION RETRIEVAL USING SPARSE FEATURES 审中-公开
    使用稀疏特征的信息检索的音频分类

    公开(公告)号:WO2010105089A1

    公开(公告)日:2010-09-16

    申请号:PCT/US2010/027031

    申请日:2010-03-11

    CPC classification number: G10L25/48 G06F17/30743

    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, are provided for using audio features to classify audio for information retrieval. In general, one aspect of the subject matter described in this specification can be embodied in methods that include the actions of generating a collection of auditory images, each auditory image being generated from respective audio files according to an auditory model; extracting sparse features from each auditory image in the collection to generate a sparse feature vector representing the corresponding audio file; and ranking the audio files in response to a query including one or more words using the sparse feature vectors and a matching function relating sparse feature vectors to words in the query.

    Abstract translation: 提供方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于使用音频特征来分类用于信息检索的音频。 通常,本说明书中描述的主题的一个方面可以体现在包括产生听觉图像的集合的动作的方法中,每个听觉图像根据听觉模型从各个音频文件生成; 从集合中的每个听觉图像中提取稀疏特征以生成表示相应音频文件的稀疏特征向量; 并且响应于包括使用稀疏特征向量的一个或多个单词的查询和将稀疏特征向量与查询中的单词相关联的匹配函数进行排序。

    AUDIO CLASSIFICATION FOR INFORMATION RETRIEVAL USING SPARSE FEATURES
    3.
    发明公开
    AUDIO CLASSIFICATION FOR INFORMATION RETRIEVAL USING SPARSE FEATURES 有权
    音频分类检索信息使用稀疏特点

    公开(公告)号:EP2406787A1

    公开(公告)日:2012-01-18

    申请号:EP10712602.1

    申请日:2010-03-11

    Applicant: Google Inc.

    CPC classification number: G10L25/48 G06F17/30743

    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, are provided for using audio features to classify audio for information retrieval. In general, one aspect of the subject matter described in this specification can be embodied in methods that include the actions of generating a collection of auditory images, each auditory image being generated from respective audio files according to an auditory model; extracting sparse features from each auditory image in the collection to generate a sparse feature vector representing the corresponding audio file; and ranking the audio files in response to a query including one or more words using the sparse feature vectors and a matching function relating sparse feature vectors to words in the query.

Patent Agency Ranking