Patent search ap:("IBM") AND inv:"VAQUERO DANIEL" Page 1

1.

发明申请
SEMANTIC PARSING OF OBJECTS IN VIDEO 审中-公开
Title translation: 视频中的对象语义分离

公开(公告)号：WO2012013711A3

公开(公告)日：2013-02-21

申请号：PCT/EP2011062925

申请日：2011-07-27

Applicant: IBM , IBM UK , VAQUERO DANIEL , FERIS ROGERIO SCHMIDT , HAMPAPUR ARUN , BROWN LISA MARIE

Inventor： VAQUERO DANIEL , FERIS ROGERIO SCHMIDT , HAMPAPUR ARUN , BROWN LISA MARIE

IPC: G06K9/00 , G06K9/46

CPC classification number: G06K9/00718 , G06K9/00369 , G06K9/00664 , G06K9/469 , G06K9/6201 , G06K9/6202 , G06K9/6232 , G06K9/6857

Abstract: The invention provides an improved method to detect semantic attributes of human body in computer vision. In detecting semantic attributes of human body in computer vision, the invention maintains a list of semantic attributes, each of which corresponds to a human body part. A computer module then analyzes segments of a frame of a digital video to detect each semantic attribute by finding a most likely attribute for each segment. A threshold is applied to select candidate segments of the frame for further analysis. The candidate segments of the frame then go through geometric and resolution context analysis by applying the physical structure principles of a human body and by analyzing increasingly higher resolution versions of the image to verify the existence and accuracy of parts and attributes. A computer module computes a resolution context score for a lower resolution version of the image based on a weighted average score computed for a higher resolution version of the image by evaluating appearance features, geometric features, and resolution context features when available on the higher resolution version of the image. Finally, an optimal configuration step is performed via dynamic programming to select an optimal output with both semantic attributes and spatial positions of human body parts on the frame.

Abstract translation: 本发明提供了一种用于检测计算机视觉中人体语义属性的改进方法。在检测计算机视觉中人体的语义属性时，本发明保留了语义属性的列表，每个语义属性对应于人体部分。然后，计算机模块通过为每个段找到最可能的属性来分析数字视频的帧的段以检测每个语义属性。应用阈值来选择帧的候选片段用于进一步分析。然后，帧的候选片段通过应用人体的物理结构原理并通过分析图像的越来越高的分辨率版本来验证部件和属性的存在和准确性来进行几何和分辨率上下文分析。计算机模块基于通过在更高分辨率版本上可用时评估外观特征，几何特征和分辨率上下文特征来计算针对图像的较高分辨率版本的加权平均得分，来计算图像的较低分辨率版本的分辨率上下文得分的图像。最后，通过动态规划执行最佳配置步骤，以选择具有框架上人体部位的语义属性和空间位置的最优输出。

2.

发明专利
Semantisches Parsen von Objekten in Videos 未知

公开(公告)号：DE112011101927T5

公开(公告)日：2013-09-05

申请号：DE112011101927

申请日：2011-07-27

Applicant: IBM

Inventor： VAQUERO DANIEL , FERIS ROGERIO SCHMIDT , HAMPAPUR ARUN , BROWN LISA MARIE

IPC: G06K9/00

Abstract: Die Erfindung stellt ein verbessertes Verfahren zum Erkennen semantischer Attribute des menschlichen Körpers in der Computersicht bereit. Beim Erkennen semantischer Attribute des menschlichen Körpers in der Computersicht unterhält die Erfindung eine Liste semantischer Attribute, von denen jedes einem menschlichen Körperteil entspricht. Dann analysiert ein Computermodul Segmente eines Einzelbildes eines digitalen Videos, um jedes semantische Attribut durch Suchen eines wahrscheinlichsten Attributs für jedes Segment zu erkennen. Ein Grenzwert wird angewandt, um Kandidatensegmente des Einzelbildes für die weitere Analyse auszuwählen. Die Kandidatensegmente des Einzelbildes durchlaufen dann eine geometrische und eine Auflösungskontextanalyse, indem die physischen Aufbauprinzipien eines menschlichen Körpers angewandt werden und indem Versionen des Bildes mit zunehmend höherer Auflösung analysiert werden, um das Vorhandensein und die Genauigkeit der Teile und Attribute zu überprüfen. Ein Computermodul berechnet eine Auflösungskontextzahl für eine Version des Bildes mit niedrigerer Auflösung auf der Grundlage einer für eine Version des Bildes mit höherer Auflösung berechneten Zahl des gewichteten Mittels, indem Auftretensmerkmale, geometrische Merkmale und Auflösungskontextmerkmale ausgewertet werden, falls sie in der Version des Bildes mit höherer Auflösung verfügbar sind. Schließlich wird mittels dynamischer Programmierung ein Schritt für die optimale Konfiguration durchgeführt, um eine optimale Ausgabe mit semantischen Attributen und auch räumlichen Positionen menschlicher Körperteile im Einzelbild auszuwählen.

3.

发明专利
Semantisches Parsen von Objekten in Videos 未知

公开(公告)号：DE112011101927B4

公开(公告)日：2016-03-17

申请号：DE112011101927

申请日：2011-07-27

Applicant: IBM

Inventor： VAQUERO DANIEL , FERIS ROGERIO SCHMIDT , HAMPAPUR ARUN , BROWN LISA MARIE

IPC: G06K9/00

Abstract: Die Erfindung stellt ein verbessertes Verfahren zum Erkennen semantischer Attribute des menschlichen Körpers in der Computersicht bereit. Beim Erkennen semantischer Attribute des menschlichen Körpers in der Computersicht unterhält die Erfindung eine Liste semantischer Attribute, von denen jedes einem menschlichen Körperteil entspricht. Dann analysiert ein Computermodul Segmente eines Einzelbildes eines digitalen Videos, um jedes semantische Attribut durch Suchen eines wahrscheinlichsten Attributs für jedes Segment zu erkennen. Ein Grenzwert wird angewandt, um Kandidatensegmente des Einzelbildes für die weitere Analyse auszuwählen. Die Kandidatensegmente des Einzelbildes durchlaufen dann eine geometrische und eine Auflösungskontextanalyse, indem die physischen Aufbauprinzipien eines menschlichen Körpers angewandt werden und indem Versionen des Bildes mit zunehmend höherer Auflösung analysiert werden, um das Vorhandensein und die Genauigkeit der Teile und Attribute zu überprüfen. Ein Computermodul berechnet eine Auflösungskontextzahl für eine Version des Bildes mit niedrigerer Auflösung auf der Grundlage einer für eine Version des Bildes mit höherer Auflösung berechneten Zahl des gewichteten Mittels, indem Auftretensmerkmale, geometrische Merkmale und Auflösungskontextmerkmale ausgewertet werden, falls sie in der Version des Bildes mit höherer Auflösung verfügbar sind. Schließlich wird mittels dynamischer Programmierung ein Schritt für die optimale Konfiguration durchgeführt, um eine optimale Ausgabe mit semantischen Attributen und auch räumlichen Positionen menschlicher Körperteile im Einzelbild auszuwählen.

4.

发明专利
Semantic parsing of objects in video 未知

公开(公告)号：GB2495881A

公开(公告)日：2013-04-24

申请号：GB201302234

申请日：2011-07-27

Applicant: IBM

Inventor： VAQUERO DANIEL , FERIS ROGERIO SCHMIDT , BROWN LISA MARIE , HAMPAPUR ARUN

IPC: G06K9/00 , G06K9/62

Abstract: The invention provides an improved method to detect semantic attributes of human body in computer vision. In detecting semantic attributes of human body in computer vision, the invention maintains a list of semantic attributes, each of which corresponds to a human body part. A computer module then analyzes segments of a frame of a digital video to detect each semantic attribute by finding a most likely attribute for each segment. A threshold is applied to select candidate segments of the frame for further analysis. The candidate segments of the frame then go through geometric and resolution context analysis by applying the physical structure principles of a human body and by analyzing increasingly higher resolution versions of the image to verify the existence and accuracy of parts and attributes. A computer module computes a resolution context score for a lower resolution version of the image based on a weighted average score computed for a higher resolution version of the image by evaluating appearance features, geometric features, and resolution context features when available on the higher resolution version of the image. Finally, an optimal configuration step is performed via dynamic programming to select an optimal output with both semantic attributes and spatial positions of human body parts on the frame.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification