Audio file annotation

Invention Grant

US11893990B2 Audio file annotation 有权

Please log in to see more content

Patent Title: Audio file annotation
Application No.: US17486661

Application Date: 2021-09-27
Publication No.: US11893990B2

Publication Date: 2024-02-06
Inventor: Hans-Martin Ramsl
Applicant: SAP SE
Applicant Address: DE Walldorf
Assignee: SAP SE
Current Assignee: SAP SE
Current Assignee Address: DE Walldorf
Agency: SCHWEGMAN LUNDBERG & WOESSNER, P.A.
Main IPC: G10L15/22
IPC: G10L15/22 ; G06F40/295 ; G10L15/26

Abstract:

Text-to-speech translation is used to generate a transcript for an audio file. Text segments are associated with time segments in the transcript. A trained machine learning model determines, based on the text in the transcript, one or more topics for the audio file. The transcript is modified to include the determined one or more topics. A user interface may be presented that allows a user to search for portions of an audio file that relate to a particular topic. In response to the selected or entered topic, the user interface presents segments having a matching topic. The user may use voice or other user interface commands to modify the annotation of the audio file. User commands may also be used to extract data from the transcript and copy the data to a clipboard or to another application.

Public/Granted literature

US20230094828A1 AUDIO FILE ANNOTATION Public/Granted day:2023-03-30

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L15/00	语音识别（G10L17/00优先）
G10L15/22	.在语音识别过程中（例如在人机对话过程中）使用的程序