-
1.
公开(公告)号:WO2023063880A2
公开(公告)日:2023-04-20
申请号:PCT/SG2022/050704
申请日:2022-09-29
Applicant: LEMON INC.
Inventor: LU, Wei Tsung , WANG, Ju-Chiang , WON, Minz , CHOI, Keunwoo , SONG, Xuchen
Abstract: Devices, systems and methods related to causing an apparatus to generate music information of audio data using a transformer-based neural network model with a multilevel transformer for audio analysis, using a spectral and a temporal transformer, are disclosed herein. The processor generates a time-frequency representation of obtained audio data to be applied as input for a transformer-based neural network model; determines spectral embeddings and first temporal embeddings of the audio data based on the time-frequency representation of the audio data; determines each vector of a second frequency class token (FCT) by passing each vector of the first FCT in the spectral embeddings through the spectral transformer; determines second temporal embeddings by adding a linear projection of the second FCT to the first temporal embeddings; determines third temporal embeddings by passing the second temporal embeddings through the temporal transformer; and generates music information based on the third temporal embeddings.
-
公开(公告)号:WO2023003505A2
公开(公告)日:2023-01-26
申请号:PCT/SG2022/050404
申请日:2022-06-13
Applicant: LEMON INC.
Inventor: WON, Minz , CHOI, Keunwoo , FENG, Yuanjian
IPC: G06N3/0895 , G06N3/0455 , G10L25/30 , G06N3/0464 , G06N20/00 , G06F16/61 , G06F16/65 , G06F16/683 , G06N3/08 , G10G1/00 , G10H1/0025
Abstract: The present disclosure describes techniques for identifying music attributes. The described techniques comprises receiving audio data of a piece of music; determining at least one attribute of the piece of music based on the audio data of the piece of music using a model; the model comprising a convolutional neural network and a transformer; the model being pre-trained using training data, wherein the training data comprise labelled data associated with a first plurality of music samples and unlabelled data associated with a second plurality of music samples, the labelled data comprise audio data of the first plurality of music samples and label information indicative of attributes of the first plurality of music samples, and the unlabelled data comprise audio data of the second plurality of music samples.
-
公开(公告)号:EP4550173A1
公开(公告)日:2025-05-07
申请号:EP23829734.5
申请日:2023-05-11
Applicant: Beijing Zitiao Network Technology Co., Ltd. , Lemon Inc.
Inventor: LIN, Xiaohui , DAI, Junyu , ZHANG, Luxi , LU, Wei Tsung , WON, Minz , SONG, Xuchen , HE, Jie , XIANG, Haoran
IPC: G06F16/638
Abstract: Provided in the embodiments of the present disclosure are a song list generation method and apparatus, and an electronic device, a computer-readable storage medium, a computer program product and a computer program. The method comprises: acquiring candidate song library information, wherein the candidate song library information comprises feature expressions of candidate songs, and the feature expressions represent song features in a plurality of dimensions; determining a similarity score of at least one candidate song according to the candidate song library information and a target feature expression, wherein the target feature expression is a feature expression of a seed song, and the similarity score represents the similarity between the candidate song and the seed song; and determining a target song on the basis of the similarity score of the candidate song, and generating a recommended song list on the basis of the target song. The similarity between the seed song and the candidate song is evaluated by using the feature expressions which represent the song features in the plurality of dimensions, and therefore a set of target songs which are more consistent with the seed song can be obtained, such that the recommended song list generated on the basis of the target songs has better consistency in the aspects of content, style, etc.
-
-