Method for evaluating a speech forced alignment model, electronic device, and storage medium

Invention Grant

US11749257B2 Method for evaluating a speech forced alignment model, electronic device, and storage medium 有权

Please log in to see more content

Patent Title: Method for evaluating a speech forced alignment model, electronic device, and storage medium
Application No.: US18178813

Application Date: 2023-03-06
Publication No.: US11749257B2

Publication Date: 2023-09-05
Inventor: Lizhao Guo , Song Yang , Junfeng Yuan
Applicant: BEIJING CENTURY TAL EDUCATION TECHNOLOGY CO., LTD.
Applicant Address: CN Beijing
Assignee: BEIJING CENTURY TAL EDUCATION TECHNOLOGY CO., LTD.
Current Assignee: BEIJING CENTURY TAL EDUCATION TECHNOLOGY CO., LTD.
Current Assignee Address: CN Beijing
Agency: Emerson, Thomson & Bennett, LLC
Agent Roger D. Emerson; Peter R. Detorre
Priority: CN 2010925650.2 2020.09.07
Main IPC: G10L15/05
IPC: G10L15/05 ; G10L15/01 ; G10L15/02

Method for evaluating a speech forced alignment model, electronic device, and storage medium

Abstract:

A method for evaluating a speech forced alignment model, an electronic device, and a storage medium are provided. The method includes: according to each audio segment in a test set and a text corresponding to each audio segment, acquiring, by using a speech forced alignment model to be evaluated, a phoneme sequence corresponding to each audio segment and a predicted start time and a predicted end time of each phoneme in the phoneme sequence; for each phoneme, obtaining a time accuracy score of the phoneme according to the predicted start time and the predicted end time of the phoneme and a predetermined reference start time and a predetermined reference end time of the phoneme; and determining a time accuracy score of said speech forced alignment model according to the time accuracy score of each phoneme.

Public/Granted literature

US20230206902A1 METHOD FOR EVALUATING A SPEECH FORCED ALIGNMENT MODEL, ELECTRONIC DEVICE, AND STORAGE MEDIUM Public/Granted day:2023-06-29

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L15/00	语音识别（G10L17/00优先）
G10L15/04	.分段；字极限检测
G10L15/05	..字边界检测