Abstract:
PURPOSE: A frequency analyzing method using an analyzing window is provided to supply more reliable and exact frequency information analyzing method to exact accurate melody extraction and music content information in a audio signal having various sound. CONSTITUTION: An inputted audio signal is resampled(S10). An audio signal inputted based on a time area is converted into a signal based on a frequency area(S20). A peak value of a amplitude in and a frequency value of a location expressing a peak value in spectrum of each obtained frame are extracted(S30). Based on the extracted values, a range that a melody pitch of each frame exists is reset(S40). Dynamic change information of the melody pitch is extracted with a method that grasps an autocorrelation coefficient between frames(S50).
Abstract:
PURPOSE: A system for extracting a descriptor of a multimedia contents is provided to extract information of a multimedia contents and express the information through the descriptor. CONSTITUTION: A web server(200) offers a web site to a user terminal(100). A multimedia dictionary service server(300) analyzes the multimedia contents of the user terminal. The multimedia dictionary service server extracts the descriptor from the multimedia contents information. The server additionally inputs the descriptor to the corresponding multimedia contents. The service server searches the corresponding multimedia contents using the descriptor. The service server offers the searched multimedia contents to the user terminal.
Abstract:
PURPOSE: A method for establishing a model which can recognize feelings in a voice through a loss function and a maximum margin technique based on WTM(Watson-Tellegen Emotional Model) is provided to remarkably increase feeling recognition performance included in a voice. CONSTITUTION: A difference between each emotional feelings is figured by suing geometric distance between emotion groups of WTM(310). Based on set values in the first step, a value of a loss function is obtained(330). Based on a loss function, a parameter of each speech emotion module through a max-margin with margin scaling method is obtained(340).
Abstract:
PURPOSE: A method and an apparatus for searching an image based on contents are provided to extract a feature vector from a searched specific area after searching a specific area of the image based on contents of an image. CONSTITUTION: An area search module(140) extracts a specific area of an area darker than the surround. A fingerprint extraction module(150) extracts a fingerprint vector of the specific area. A similarity calculation module(160) compares the fingerprint vector with a fingerprint vector of another image stored in an image information DB by an area.
Abstract:
An embodiment of the present invention, which relates to a method to generate a robot dance motion expression, provides a method to generate a robot dance motion expression including: an information extraction step of extracting a first music information from a music signal; an information prediction step of predicting a second music information so as to express a dance motion with the first music information; and an information selection step of selecting a dance motion-associated information suitable for the second music information from a motion database.
Abstract:
본 발명의 일 실시예에 따른 이미지 분할 방법은 입력 이미지를 슈퍼픽셀들로 나누는 제1 단계; 상기 제1 단계를 통해 얻어진 슈퍼픽셀들 중 인접하는 2 이상의 복수의 슈퍼픽셀을 특정 조건을 고려한 연결을 통해 하이퍼그래프를 구축하는 제2 단계; 및 상기 하이퍼그래프의 각 에지의 특징 벡터를 추출하여 조인트 특징 맵을 구성하고, 고차 상관 클러스터링을 통해, 구축된 상기 하이퍼그래프를 분할하는 제3 단계를 포함한다.