-
公开(公告)号:EP4336854A2
公开(公告)日:2024-03-13
申请号:EP22842574.0
申请日:2022-07-14
Applicant: Lemon Inc.
Inventor: ZHENG, Xin , ZHU, Conghui , XIA, Rui , SHANG, Chuxiang , ZHONG, Dejian , JIANG, Yongsen , TU, Ming , DENG, Lelai
IPC: H04N21/854 , G10L15/26 , H04N21/43
Abstract: According to the embodiments of the present disclosure, a multimedia processing method, device, electronic device, and storage medium are provided by obtaining a first multimedia resource; determining an initial text content corresponding to the first multimedia resource by performing speech recognition on audio data of the first multimedia resource, the audio data of the first multimedia resource comprises speech data of the initial text content; determining an invalid text content in the initial text content, the invalid text content is semantically non-informative; determining a first playing position of speech data of the invalid text content in the first multimedia resource; and cropping the first multimedia resource based on the first playing position to obtain a second multimedia resource, wherein audio data of the second multimedia resource comprises speech data of a target text content but does not comprise the speech data of the invalid text content. Automatic cropping of the invalid content in multimedia resources is implemented by the embodiments of the present disclosure, improving cropping efficiency and cropping effect.
-
公开(公告)号:EP4537539A1
公开(公告)日:2025-04-16
申请号:EP23820196.6
申请日:2023-06-01
Applicant: Lemon Inc.
Inventor: JIANG, Wenqing , USLUBAS, Serhan , LI, Zheng , TU, Ming , PANDIRI, Shiva Shanker
IPC: H04N21/232 , G10L15/02 , G11B27/02 , G06F3/01
-
公开(公告)号:EP4276827A1
公开(公告)日:2023-11-15
申请号:EP22750129.3
申请日:2022-01-31
Applicant: Lemon Inc.
Inventor: XIA, Rui , TU, Ming , DING, Chen , ZHENG, Weiming
Abstract: Embodiments provide a method and an apparatus for determining speech similarity, and a program product, which relate to speech technology. The method includes: playing exemplary audio, and acquiring evaluation audio of a user, where the exemplary audio is audio of specified content that is read by using a specified language; acquiring a standard pronunciation feature corresponding to the exemplary audio, and extracting, from the evaluation audio, an evaluation pronunciation feature corresponding to the standard pronunciation feature, where the standard pronunciation feature is used to reflect a specific pronunciation of the specified content in the specified language; and determining a feature difference between the standard pronunciation feature and the evaluation pronunciation feature, and determining similarity between the evaluation audio and the exemplary audio according to the feature difference. In the scheme of the present application, the evaluation pronunciation feature corresponding to the standard pronunciation feature corresponding to the exemplary audio can be extracted from the evaluation audio, thereby achieving relatively small volume of a module functioned with similarity analysis of follow-up reading.
-
-