Multithreaded speech-to-text processing

Invention Grant

US11776545B2 Multithreaded speech-to-text processing 有权

Please log in to see more content

Patent Title: Multithreaded speech-to-text processing
Application No.: US17994554

Application Date: 2022-11-28
Publication No.: US11776545B2

Publication Date: 2023-10-03
Inventor: Xiaolong Li , Xiaozhuo Cheng , Samuel Norris Henderson , Xu Yang
Applicant: SAS Institute Inc.
Applicant Address: US NC Cary
Assignee: SAS Institute Inc.
Current Assignee: SAS Institute Inc.
Current Assignee Address: US NC Cary
Agency: KDW Firm PLLC
Main IPC: G10L15/26
IPC: G10L15/26 ; G10L15/22 ; G10L15/02 ; G10L15/04 ; G10L25/78 ; G10L25/30

Abstract:

An apparatus includes a processor to: receive a request to perform speech-to-text conversion of a speech data set; perform pause detection to identify a set of likely sentence pauses and/or speaker diarization technique to identify a set of likely speaker changes; based the set of likely sentence pauses and/or the set of likely speaker changes, divide the speech data set into data segments representing speech segments; use an acoustic model with the data segments to derive sets of probabilities of speech sounds uttered; store the sets of probabilities in temporal order within a buffer queue; distribute the sets of probabilities from the buffer queue in temporal order among threads of a thread pool; and within each thread, and based on set(s) of probabilities, derive one candidate word and select either the candidate word or an alternate candidate word derived from a language model as the next word most likely spoken.

Public/Granted literature

US20230098063A1 Multithreaded Speech-to-Text Processing Public/Granted day:2023-03-30

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L15/00	语音识别（G10L17/00优先）
G10L15/26	.语音—正文识别系统（G10L15/08优先）