Employing user input to facilitate inferential sound recognition based on patterns of sound primitives

Invention Grant

US10198697B2 Employing user input to facilitate inferential sound recognition based on patterns of sound primitives 有权

Please log in to see more content

Patent Title: Employing user input to facilitate inferential sound recognition based on patterns of sound primitives
Application No.: US15256236

Application Date: 2016-09-02
Publication No.: US10198697B2

Publication Date: 2019-02-05
Inventor: Sebastien J. V. Christian , Thor C. Whalen
Applicant: OtoSense Inc.
Applicant Address: US MA Cambridge
Assignee: OtoSense Inc.
Current Assignee: OtoSense Inc.
Current Assignee Address: US MA Cambridge
Agency: Park, Vaughan, Fleming & Dowler LLP
Main IPC: G10L25/48
IPC: G10L25/48 ; G06N99/00 ; G10L21/10 ; G10L25/27 ; G10L21/14 ; G08B13/16 ; G08B21/18

Employing user input to facilitate inferential sound recognition based on patterns of sound primitives

Abstract:

The disclosed embodiments provide a system that generates sound primitives to facilitate sound recognition. First, the system performs a feature-detection operation on sound samples to detect a set of sound features, wherein each sound feature comprises a measurable characteristic of a window of consecutive sound samples. Next, the system creates feature vectors from coefficients generated by the feature-detection operation, wherein each feature vector comprises a set of coefficients for sound features detected in a window. The system then performs a clustering operation on the feature vectors to produce feature-vector clusters, wherein each feature-vector cluster comprises a set of feature vectors that are proximate to each other in a feature-vector space that contains the feature vectors. After the clustering operation, the system defines a set of sound primitives, wherein each sound primitive is associated with a feature-vector cluster. Finally, the system associates semantic labels with the set of sound primitives.

Public/Granted literature

US20160379666A1 EMPLOYING USER INPUT TO FACILITATE INFERENTIAL SOUND RECOGNITION BASED ON PATTERNS OF SOUND PRIMITIVES Public/Granted day:2016-12-29

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L25/00	不限于组G10L 15/00-G10L 21/00的语言或者声音分析技术(当利用语音检测器来感知一些信号特殊特征的基于半导体的静噪放大器，如无信号时的感知入H03G3/34)
G10L25/48	.专门适用于特定用途