METHOD AND APPARATUS FOR CONSTRUCTING VOICE TEMPLATES FOR A SPEAKER-INDEPENDENT VOICE RECOGNITION SYSTEM
    71.
    发明公开
    METHOD AND APPARATUS FOR CONSTRUCTING VOICE TEMPLATES FOR A SPEAKER-INDEPENDENT VOICE RECOGNITION SYSTEM 有权
    方法和设备参考模型的说话人无关的语音识别系统生产

    公开(公告)号:EP1301919A2

    公开(公告)日:2003-04-16

    申请号:EP01952681.3

    申请日:2001-07-11

    Inventor: BI, Ning

    CPC classification number: G10L15/063

    Abstract: A method and apparatus for constructing voice templates for a speaker-independent voice recognition system includes segmenting a training utterance to generate time-clustered segments, each segment being represented by a mean. The means for all utterances of a given word are quantized to generate template vectors. Each template vector is compared with testing utterances to generate a comparison result. The comparison is typically a dynamic time warping computation. The training utterances are matched with the template vectors if the comparison result exceeds at least one predefined threshold value, to generate an optimal path result, and the training utterances are partitioned in accordance with the optical path result. The partitioning is typically a K-means segmentation computation. The partitioned utterances may then be re-quantized and re-compared with the testing utterances until the at least one predefined threshold value is not exceeded.

    METHOD AND APPARATUS FOR SEARCHING AN EXCITATION CODEBOOK IN A CODE EXCITED LINEAR PREDICTION (CLEP) CODER
    72.
    发明公开
    METHOD AND APPARATUS FOR SEARCHING AN EXCITATION CODEBOOK IN A CODE EXCITED LINEAR PREDICTION (CLEP) CODER 失效
    方法和装置一个ERREGUNGSKODEBUCHES作者CELP编码器搜索

    公开(公告)号:EP0917710A1

    公开(公告)日:1999-05-26

    申请号:EP97937095.0

    申请日:1997-07-31

    CPC classification number: G10L19/12 G10L25/06

    Abstract: Method and apparatus for selecting a code vector in an algebraic codebook wherein the analysis window for the coder is extended beyond the length of the target speech frame. An input signal is filtered by a perceptual weighting filter (76). Then, the filter is set to ring out for a number of samples equal to the length of the perceptual weighting filter (76), while a zero input vector is applied as input. By extending the analysis window, the two dimensional impulse response matrix can be stored as a one dimensional autocorrelation matrix in memory (60, 80), greatly saving on the computational complexity and memory required for the search.

    FACIAL EXPRESSION RECOGNITION
    73.
    发明申请

    公开(公告)号:WO2022182447A1

    公开(公告)日:2022-09-01

    申请号:PCT/US2022/013371

    申请日:2022-01-21

    Abstract: Systems and techniques are provided for facial expression recognition. In some examples, a system receives an image frame corresponding to a face of a person. The system also determines, based on a three-dimensional model of the face, landmark feature information associated with landmark features of the face. The system then inputs, to at least one layer of a neural network trained for facial expression recognition, the image frame and the landmark feature information. The system further determines, using the neural network, a facial expression associated with the face.

    METHODS AND SYSTEMS FOR APPLYING COMPLEX OBJECT DETECTION IN A VIDEO ANALYTICS SYSTEM

    公开(公告)号:WO2019083738A9

    公开(公告)日:2019-05-02

    申请号:PCT/US2018/055544

    申请日:2018-10-12

    Abstract: Techniques and systems are provided for tracking objects in one or more video frames. For example, a first set of one or more bounding regions are determined for a video frame based on a trained classification network applied to the video frame. The first set of one or more bounding regions are associated with one or more objects in the video frame. One or more blobs can be detected for the video frame. A blob includes pixels of at least a portion of an object in the video frame. A second set of one or more bounding regions are determined for the video frame that are associated with the one or more blobs. A final set of one or more bounding regions is determined for the video frame using the first set of one or more bounding regions and the second set of one or more bounding regions. Object tracking can then be performed for the video frame using the final set of one or more bounding regions.

    DETECTING FALSE POSITIVES IN FACE RECOGNITION

    公开(公告)号:WO2019046078A1

    公开(公告)日:2019-03-07

    申请号:PCT/US2018/047612

    申请日:2018-08-22

    Abstract: Techniques and systems are provided for detecting false positive faces in one or more video frames. For example, a video frame of a scene can be obtained. The video frame includes a face of a user associated with at least one characteristic feature. The face of the user is determined to match a representative face from stored representative data. The representative face is associated with the at least one characteristic feature. The face of the user is determined to match the representative face based on the at least one characteristic feature. The face of the user can then be determined to be a false positive face based on the face of the user matching the representative face.

    HIGH-LEVEL SIGNALLING FOR FISHEYE VIDEO DATA
    77.
    发明申请

    公开(公告)号:WO2018218047A1

    公开(公告)日:2018-11-29

    申请号:PCT/US2018/034435

    申请日:2018-05-24

    Abstract: An example method includes processing a file including fisheye video data, the file including a syntax structure including a plurality of syntax elements that specify attributes of the fisheye video data, wherein the plurality of syntax elements includes: a first syntax element that explicitly indicates whether the fisheye video data is monoscopic or stereoscopic, and one or more syntax elements that implicitly indicate whether the fisheye video data is monoscopic or stereoscopic; determining, based on the first syntax element, whether the fisheye video data is monoscopic or stereoscopic; and rendering, based on the determination, the fisheye video data as monoscopic or stereoscopic.

    ORIENTED IMAGE STITCHING FOR SPHERICAL IMAGE CONTENT

    公开(公告)号:WO2018093483A1

    公开(公告)日:2018-05-24

    申请号:PCT/US2017/055932

    申请日:2017-10-10

    Abstract: Techniques are described related to generating image content. A graphics processing unit (GPU) is configured to receive a first set of images generated from a first camera device in a first location, the first camera device having a first orientation, render for display the first set of images oriented to an orientation reference, receive a second, different set of images generated from a second, different camera device in a second, different location, the second camera device having a second orientation, the second orientation being different than the first orientation, and render for display the second set of images oriented to the orientation reference.

    METHODS AND SYSTEMS OF DETERMINING COSTS FOR OBJECT TRACKING IN VIDEO ANALYTICS
    79.
    发明申请
    METHODS AND SYSTEMS OF DETERMINING COSTS FOR OBJECT TRACKING IN VIDEO ANALYTICS 审中-公开
    视频分析中确定物体跟踪成本的方法和系统

    公开(公告)号:WO2017204897A1

    公开(公告)日:2017-11-30

    申请号:PCT/US2017/024720

    申请日:2017-03-29

    Abstract: Techniques and systems are provided for processing video data. For example, techniques and systems are provided for determining costs for blob trackers and blobs. A blob can be detected in a video frame. The blob includes pixels of at least a portion of a foreground object. A physical distance between a blob tracker and the blob can be determined. A size ratio between the blob tracker and the blob can also be determined. A cost between the blob tracker and the blob can then be determined using the physical distance and the size ratio. In some cases, a spatial relationship between the blob tracker and the blob is determined, in which case the physical distance can be determined based on the spatial relationship. Blob trackers can be associated with blobs based on the determined costs between the blob trackers and the blobs.

    Abstract translation: 提供了用于处理视频数据的技术和系统。 例如,提供了用于确定斑点跟踪器和斑点的成本的技术和系统。 在视频帧中可以检测到斑点。 该斑点包括前景对象的至少一部分的像素。 可以确定斑点跟踪器和斑点之间的物理距离。 斑点跟踪器和斑点之间的大小比率也可以确定。 然后可以使用物理距离和大小比率来确定斑点跟踪器和斑点之间的成本。 在一些情况下,确定斑点跟踪器和斑点之间的空间关系,在这种情况下,可以基于空间关系来确定物理距离。 根据blob跟踪器和blob之间确定的成本,Blob跟踪器可以与blob关联。

    METHODS AND SYSTEMS OF GENERATING A BACKGROUND PICTURE FOR VIDEO CODING
    80.
    发明申请
    METHODS AND SYSTEMS OF GENERATING A BACKGROUND PICTURE FOR VIDEO CODING 审中-公开
    生成视频编码背景图的方法和系统

    公开(公告)号:WO2017184257A1

    公开(公告)日:2017-10-26

    申请号:PCT/US2017/019854

    申请日:2017-02-28

    Abstract: Techniques and systems are provided for generating a background picture. The background picture can be used for coding one or more pictures. For example, a method of generating a background picture includes generating a long-term background model for one or more pixels of a background picture. The long-term background model includes a statistical model for detecting long-term motion of the one or more pixels in a sequence of pictures. The method further includes generating a short-term background model for the one or more pixels of the background picture. The short-term background model detects short-term motion of the one or more pixels between two or more pictures. The method further includes determining a value for the one or more pixels of the background picture using the long-term background model and the short-term background model.

    Abstract translation: 提供了用于生成背景图片的技术和系统。 背景图片可以用于编码一个或多个图片。 例如,生成背景图片的方法包括为背景图片的一个或多个像素生成长期背景模型。 长期背景模型包括用于检测图片序列中的一个或多个像素的长期运动的统计模型。 该方法还包括为背景图片的一个或多个像素生成短期背景模型。 短期背景模型检测两个或更多个图片之间的一个或多个像素的短期运动。 该方法还包括使用长期背景模型和短期背景模型来确定背景图片的一个或多个像素的值。

Patent Agency Ranking