Patent search ap:("IBM") AND inv:"BROWN LISA MARIE" Page 1

1.

发明申请
SYSTEM AND METHOD FOR GENERATING A VIEWABLE VIDEO INDEX FOR LOW BANDWIDTH APPLICATIONS 审中-公开
Title translation: 用于为低带宽应用生成可视视频索引的系统和方法

公开(公告)号：WO2005081127A3

公开(公告)日：2005-11-03

申请号：PCT/EP2005050757

申请日：2005-02-22

Applicant: IBM , IBM UK , BROWN LISA MARIE , CONNELL JONATHAN HUDSON , COOKE RAYMOND ANDERSON , HAMPAPUR ARUN , PANKANTI SHARATHCHANDRA UMAPAT , SENIOR ANDREW WILLIAM , TIAN YING-LI

Inventor： BROWN LISA MARIE , CONNELL JONATHAN HUDSON , COOKE RAYMOND ANDERSON , HAMPAPUR ARUN , PANKANTI SHARATHCHANDRA UMAPAT , SENIOR ANDREW WILLIAM , TIAN YING-LI

IPC: G06F17/30 , G06T9/00 , H04N7/18 , H04N7/26

CPC classification number: H04N7/181 , G08B13/19604 , G08B13/19608 , G08B13/19673 , G08B13/19684 , H04N19/20

Abstract: A system and method for generating a viewable video index for low bandwidth applications are provided. The exemplary aspects of the present invention solve the problems with the prior art systems by incorporating information for generating a viewable representation of the video data into the index, thus generating a viewable video index. The viewable video index contains information for generating a visual representation of moving objects in the video data, a visual representation of the background of the video capture area, i.e. the scene, a representation of the object trajectory, a representation of the object attributes, and a representation of detected events. The result is that the viewable video index may be transmitted to a low bandwidth application on a client device and may be used along with associated object and background models to generate a representation of the actual video data without requiring that the original video data itself be streamed to the client device.

Abstract translation: 提供了一种用于为低带宽应用生成可视视频索引的系统和方法。本发明的示例性方面通过将用于生成视频数据的可视表示的信息并入索引来解决现有技术系统的问题，从而生成可观看的视频索引。可视视频索引包含用于生成视频数据中的移动对象的视觉表示，视频捕获区域的背景（即场景）的视觉表示，对象轨迹的表示，对象属性的表示以及检测到的事件的表示。结果是可以将可视视频索引传输到客户端设备上的低带宽应用，并且可以与关联的对象和背景模型一起使用以生成实际视频数据的表示，而不需要原始视频数据本身被流化到客户端设备。

2.

发明申请
SEMANTIC PARSING OF OBJECTS IN VIDEO 审中-公开
Title translation: 视频中的对象语义分离

公开(公告)号：WO2012013711A3

公开(公告)日：2013-02-21

申请号：PCT/EP2011062925

申请日：2011-07-27

Applicant: IBM , IBM UK , VAQUERO DANIEL , FERIS ROGERIO SCHMIDT , HAMPAPUR ARUN , BROWN LISA MARIE

Inventor： VAQUERO DANIEL , FERIS ROGERIO SCHMIDT , HAMPAPUR ARUN , BROWN LISA MARIE

IPC: G06K9/00 , G06K9/46

CPC classification number: G06K9/00718 , G06K9/00369 , G06K9/00664 , G06K9/469 , G06K9/6201 , G06K9/6202 , G06K9/6232 , G06K9/6857

Abstract: The invention provides an improved method to detect semantic attributes of human body in computer vision. In detecting semantic attributes of human body in computer vision, the invention maintains a list of semantic attributes, each of which corresponds to a human body part. A computer module then analyzes segments of a frame of a digital video to detect each semantic attribute by finding a most likely attribute for each segment. A threshold is applied to select candidate segments of the frame for further analysis. The candidate segments of the frame then go through geometric and resolution context analysis by applying the physical structure principles of a human body and by analyzing increasingly higher resolution versions of the image to verify the existence and accuracy of parts and attributes. A computer module computes a resolution context score for a lower resolution version of the image based on a weighted average score computed for a higher resolution version of the image by evaluating appearance features, geometric features, and resolution context features when available on the higher resolution version of the image. Finally, an optimal configuration step is performed via dynamic programming to select an optimal output with both semantic attributes and spatial positions of human body parts on the frame.

Abstract translation: 本发明提供了一种用于检测计算机视觉中人体语义属性的改进方法。在检测计算机视觉中人体的语义属性时，本发明保留了语义属性的列表，每个语义属性对应于人体部分。然后，计算机模块通过为每个段找到最可能的属性来分析数字视频的帧的段以检测每个语义属性。应用阈值来选择帧的候选片段用于进一步分析。然后，帧的候选片段通过应用人体的物理结构原理并通过分析图像的越来越高的分辨率版本来验证部件和属性的存在和准确性来进行几何和分辨率上下文分析。计算机模块基于通过在更高分辨率版本上可用时评估外观特征，几何特征和分辨率上下文特征来计算针对图像的较高分辨率版本的加权平均得分，来计算图像的较低分辨率版本的分辨率上下文得分的图像。最后，通过动态规划执行最佳配置步骤，以选择具有框架上人体部位的语义属性和空间位置的最优输出。

3.

发明申请
MULTI-MODE VIDEO EVENT INDEXING 审中-公开
Title translation: 多模式视频活动指标

公开(公告)号：WO2012022744A3

公开(公告)日：2012-04-26

申请号：PCT/EP2011064088

申请日：2011-08-16

Applicant: IBM , IBM UK , ZHAI YUN , FERIS ROGERIO SCHMIDT , BROWN LISA MARIE , HAMPAPUR ARUN , BOBBITT RUSSELL PATRICK

Inventor： ZHAI YUN , FERIS ROGERIO SCHMIDT , BROWN LISA MARIE , HAMPAPUR ARUN , BOBBITT RUSSELL PATRICK

IPC: G06K9/00

CPC classification number: G06T7/254 , G06K9/00536 , G06K9/00771 , G06K9/4647 , G06K9/6212 , G06T7/20 , G06T7/2053 , G06T7/74 , G06T7/97 , G06T2207/10016 , G06T2207/20224 , G06T2207/30232 , H04N7/18

Abstract: Multi-mode video event indexing includes determining a quality of object distinctiveness with respect to images from a video stream input. A high-quality analytic mode is selected from multiple modes and applied to video input images via a hardware device to determine object activity within the video input images if the determined level of detected quality of object distinctiveness meets a threshold level of quality, else a low-quality analytic mode is selected and applied to the video input images via a hardware device to determine object activity within the video input images, wherein the low-quality analytic mode is different from the high-quality analytic mode.

Abstract translation: 多模式视频事件索引包括确定相对于来自视频流输入的图像的对象特征的质量。从多种模式中选择高质量的分析模式，并通过硬件设备将视频输入图像应用于视频输入图像，以确定检测到的对象特征质量水平达到阈值质量水平时视频输入图像中的对象活动，否则低通过硬件设备选择质量分析模式并将其应用于视频输入图像，以确定视频输入图像内的对象活动，其中低质量分析模式不同于高质量分析模式。

4.

发明申请
CATEGORIZING MOVING OBJECTS INTO FAMILIAR COLOURS IN VIDEO 审中-公开
Title translation: 将移动物体分类为视频中的家庭彩色

公开(公告)号：WO2008113650A3

公开(公告)日：2009-09-03

申请号：PCT/EP2008051823

申请日：2008-02-15

Applicant: IBM , IBM UK , BROWN LISA MARIE

Inventor： BROWN LISA MARIE

IPC: G06T7/20

CPC classification number: G06T7/20 , G06K9/4652

Abstract: An improved solution for categorizing moving objects into familiar colours in video is provided. In an embodiment of the invention, a method for categorizing moving objects into familiar colours in video comprises: receiving a video input; determining at least one object track of the video input; creating a normalized cumulative histogram of the at least one object track; performing a parameterization quantization of the histogram including separating the histogram into regions based on at least one surface curve derived from one of saturation and intensity; and identifying a significant colour of the quantized histogram.

Abstract translation: 提供了一种改进的解决方案，用于将移动对象分类为视频中熟悉的颜色。在本发明的一个实施例中，用于将移动对象分类为视频中熟悉的颜色的方法包括：接收视频输入; 确定视频输入的至少一个物体轨道; 创建所述至少一个物体轨道的归一化累积直方图; 执行直方图的参数化量化，包括基于从饱和度和强度之一导出的至少一个曲面曲线将直方图分离成区域; 并且识别量化直方图的显着颜色。

5.

发明专利
Semantisches Parsen von Objekten in Videos 未知

公开(公告)号：DE112011101927T5

公开(公告)日：2013-09-05

申请号：DE112011101927

申请日：2011-07-27

Applicant: IBM

Inventor： VAQUERO DANIEL , FERIS ROGERIO SCHMIDT , HAMPAPUR ARUN , BROWN LISA MARIE

IPC: G06K9/00

Abstract: Die Erfindung stellt ein verbessertes Verfahren zum Erkennen semantischer Attribute des menschlichen Körpers in der Computersicht bereit. Beim Erkennen semantischer Attribute des menschlichen Körpers in der Computersicht unterhält die Erfindung eine Liste semantischer Attribute, von denen jedes einem menschlichen Körperteil entspricht. Dann analysiert ein Computermodul Segmente eines Einzelbildes eines digitalen Videos, um jedes semantische Attribut durch Suchen eines wahrscheinlichsten Attributs für jedes Segment zu erkennen. Ein Grenzwert wird angewandt, um Kandidatensegmente des Einzelbildes für die weitere Analyse auszuwählen. Die Kandidatensegmente des Einzelbildes durchlaufen dann eine geometrische und eine Auflösungskontextanalyse, indem die physischen Aufbauprinzipien eines menschlichen Körpers angewandt werden und indem Versionen des Bildes mit zunehmend höherer Auflösung analysiert werden, um das Vorhandensein und die Genauigkeit der Teile und Attribute zu überprüfen. Ein Computermodul berechnet eine Auflösungskontextzahl für eine Version des Bildes mit niedrigerer Auflösung auf der Grundlage einer für eine Version des Bildes mit höherer Auflösung berechneten Zahl des gewichteten Mittels, indem Auftretensmerkmale, geometrische Merkmale und Auflösungskontextmerkmale ausgewertet werden, falls sie in der Version des Bildes mit höherer Auflösung verfügbar sind. Schließlich wird mittels dynamischer Programmierung ein Schritt für die optimale Konfiguration durchgeführt, um eine optimale Ausgabe mit semantischen Attributen und auch räumlichen Positionen menschlicher Körperteile im Einzelbild auszuwählen.

6.

发明专利
Improved abandoned object recognition using pedestrian detection 未知

公开(公告)号：GB2496266A

公开(公告)日：2013-05-08

申请号：GB201218606

申请日：2012-10-17

Applicant: IBM

Inventor： BROWN LISA MARIE , KJELDSEN FREDERIK CARL , FERIS ROGERIO SCHMIDT , SCHERBAUM KRISTINA

IPC: G06K9/00

Abstract: Methods and apparatus are provided for improved abandoned object recognition using pedestrian detection. An abandoned object is detected in one or more images by determining if one or more detected objects in a foreground of the images comprises a potential abandoned object; applying a trained pedestrian detector 130 to the potential abandoned object to determine if the potential abandoned object comprises at least a portion of a pedestrian; and classifying the potential abandoned object as an abandoned object based on whether the potential abandoned object is not at least a portion of a pedestrian. The trained pedestrian detector is trained using positive training samples comprised of at least portions of human bodies in one or more poses and/or negative training samples comprised of at least portions of abandoned objects.

7.

发明专利
Incorporating video meta-data in 3D models 未知

公开(公告)号：GB2503621B

公开(公告)日：2014-03-12

申请号：GB201318426

申请日：2012-05-02

Applicant: IBM

Inventor： BROWN LISA MARIE , FERIS ROGERIO SCHMIDT , PANKANTI SHARATHCHANDRA UMAPATHIRAO , DATTA ANKUR

IPC: H04N5/232 , G06T13/20 , G06T17/00

Abstract: A moving object tracked within a field of view environment of a two-dimensional data feed of a calibrated video camera is represented by a three-dimensional model. An appropriate three-dimensional mesh-based volumetric model for the object is initialized by using a back-projection of a corresponding two-dimensional image. A texture of the object is projected onto the three-dimensional model, and two-dimensional tracks of the object are upgraded to three-dimensional motion to drive a three-dimensional model.

8.

发明专利
Incorporating video meta-data in 3D models 未知

公开(公告)号：GB2503621A

公开(公告)日：2014-01-01

申请号：GB201318426

申请日：2012-05-02

Applicant: IBM

Inventor： BROWN LISA MARIE , FERIS ROGERIO SCHMIDT , PANKANTI SHARATHCHANDRA UMAPATHIRAO , DATTA ANKUR

IPC: H04N5/232 , G06T13/20 , G06T17/00

Abstract: A moving object detected and tracked within a field of view environment of a 2D data feed of a calibrated video camera is represented by a 3D model through localizing a centroid of the object and determining an intersection with a ground-plane within the field of view environment. An appropriate 3D mesh-based volumetric model for the object is initialized by using a back-projection of a corresponding 2D image as a function of the centroid and the determined ground-plane intersection. Nonlinear dynamics of a tracked motion path of the object are represented as a collection of different local linear models. A texture of the object is projected onto the 3D model, and 2D tracks of the object are upgraded to 3D motion to drive the 3D model by learning a weighted combination of the different local linear models that minimizes an image re-projection error of model movement.

9.

发明专利
Categorizing moving objects into familiar colours in video 未知

公开(公告)号：AU2008228412A1

公开(公告)日：2008-09-25

申请号：AU2008228412

申请日：2008-02-15

Applicant: IBM

Inventor： BROWN LISA MARIE

IPC: G06T7/20 , G06T7/40

Abstract: An improved solution for categorizing moving objects into familiar colors in video is provided. In an embodiment of the invention, a method for categorizing moving objects into familiar colors in video comprises: receiving a video input; determining at least one object track of the video input; creating a normalized cumulative histogram of the at least one object track; and one of: performing a parameterization quantization of the histogram including separating the histogram into regions based on at least one surface curve derived from one of saturation and intensity; or identifying a significant color of the quantized histogram.

10.

发明专利
Semantisches Parsen von Objekten in Videos 未知

公开(公告)号：DE112011101927B4

公开(公告)日：2016-03-17

申请号：DE112011101927

申请日：2011-07-27

Applicant: IBM

Inventor： VAQUERO DANIEL , FERIS ROGERIO SCHMIDT , HAMPAPUR ARUN , BROWN LISA MARIE

IPC: G06K9/00

Abstract: Die Erfindung stellt ein verbessertes Verfahren zum Erkennen semantischer Attribute des menschlichen Körpers in der Computersicht bereit. Beim Erkennen semantischer Attribute des menschlichen Körpers in der Computersicht unterhält die Erfindung eine Liste semantischer Attribute, von denen jedes einem menschlichen Körperteil entspricht. Dann analysiert ein Computermodul Segmente eines Einzelbildes eines digitalen Videos, um jedes semantische Attribut durch Suchen eines wahrscheinlichsten Attributs für jedes Segment zu erkennen. Ein Grenzwert wird angewandt, um Kandidatensegmente des Einzelbildes für die weitere Analyse auszuwählen. Die Kandidatensegmente des Einzelbildes durchlaufen dann eine geometrische und eine Auflösungskontextanalyse, indem die physischen Aufbauprinzipien eines menschlichen Körpers angewandt werden und indem Versionen des Bildes mit zunehmend höherer Auflösung analysiert werden, um das Vorhandensein und die Genauigkeit der Teile und Attribute zu überprüfen. Ein Computermodul berechnet eine Auflösungskontextzahl für eine Version des Bildes mit niedrigerer Auflösung auf der Grundlage einer für eine Version des Bildes mit höherer Auflösung berechneten Zahl des gewichteten Mittels, indem Auftretensmerkmale, geometrische Merkmale und Auflösungskontextmerkmale ausgewertet werden, falls sie in der Version des Bildes mit höherer Auflösung verfügbar sind. Schließlich wird mittels dynamischer Programmierung ein Schritt für die optimale Konfiguration durchgeführt, um eine optimale Ausgabe mit semantischen Attributen und auch räumlichen Positionen menschlicher Körperteile im Einzelbild auszuwählen.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification