-
公开(公告)号:US20190251499A1
公开(公告)日:2019-08-15
申请号:US16390966
申请日:2019-04-22
Applicant: Amazon Technologies, Inc.
Inventor: Dilip Kumar , Elie Micah Kornfield , Alexander Clark Prater , Sridhar Boyapati , Xiaofeng Ren , Chang Yuan
CPC classification number: G06Q10/087 , G07G1/0036
Abstract: This disclosure describes a system for tracking removal or placement of items at inventory locations with a materials handling facility. In some instances, a user may remove an item from an inventory location and the inventory management system may detect that removal and update a user item list associated with the user to include an item identifier representative of the removed item. Likewise, if the user places an item at an inventory location, the inventory management system may detect that placement and update the user item list to remove an item identifier representative of the placed item.
-
公开(公告)号:US10326894B1
公开(公告)日:2019-06-18
申请号:US15836642
申请日:2017-12-08
Applicant: Amazon Technologies, Inc.
Inventor: Geoffrey Scott Heller , Chang Yuan
Abstract: A mobile device for projecting images on a surface and for maintaining a position of the image on the surface. The device including an image rendering module to scan the surface and to modify the image based on data obtained about a three dimensional nature of the surface. The device further including an optical image stabilization to adjust an alignment of the projector based on orientation data of the device, an electronic image stabilization module to shift the image within a projection area based on the orientation data and an image-tracking module to shift the image within the projection area based on changes in a relative positions of the image and a target.
-
公开(公告)号:US09305226B1
公开(公告)日:2016-04-05
申请号:US13893175
申请日:2013-05-13
Applicant: Amazon Technologies, Inc.
Inventor: Chang Yuan , Geoffrey Scott Heller , Louis LeRoi LeGrand, III , Daniel Bibireata , Neil Cooper , Laura Varnum Finney , Saurabh Verma
IPC: G06K9/18
CPC classification number: G06K9/726 , G06K9/723 , G06K2209/01
Abstract: The accuracy of a text recognition process can be improved using a set of semantic boosting rules, as may be contained in a sequence or other such arrangement. When text is output from a text recognition process, that text can have alternatives and confidence values for different characters or portions of the string. In order to improve the accuracy, this data can be processed using the organized rules, where rules are applied as long as any preconditions for that rule are satisfied, and each rule has the ability to modify the confidence values or modify one or more of the alternatives. When a result it produced with a minimum confidence level, or all applicable rules have been applied, the result can be provided as a refined text output of the recognition process.
Abstract translation: 文本识别过程的准确性可以使用一组语义增强规则来改进,如可以包含在序列或其他这样的布置中。 当从文本识别过程输出文本时,该文本可以为字符串的不同字符或部分提供替代和置信度值。 为了提高准确性,可以使用有组织的规则来处理该数据,只要满足该规则的任何前提条件就应用规则,并且每个规则都具有修改置信度值或修改一个或多个 备择方案。 当结果产生的最小置信水平或所有适用的规则已被应用时,结果可以作为识别过程的精细文本输出提供。
-
公开(公告)号:US09298980B1
公开(公告)日:2016-03-29
申请号:US13789508
申请日:2013-03-07
Applicant: Amazon Technologies, Inc.
Inventor: Chang Yuan , Geoffrey Scott Heller
CPC classification number: G06K9/00456 , G06K9/0061 , G06K9/3233 , G06K9/3258
Abstract: Various embodiments provide an image pre-processing method to accelerate image computing tasks. For example, an image is obtained from a camera of a portable computing device. Salient features of the image are identified by a number of techniques, such as identifying feature points, identifying gradients within the image, and other techniques and methods discussed herein. Based at least in part on the identified salient features, a saliency map of the image is computed. The saliency map includes a saliency value for each of at least a portion of the plurality of pixels. In this example pixels above a threshold saliency value are processed with an optical character recognition engine.
Abstract translation: 各种实施例提供了一种用于加速图像计算任务的图像预处理方法。 例如,从便携式计算设备的照相机获得图像。 通过许多技术来识别图像的显着特征,例如识别特征点,识别图像内的梯度,以及本文讨论的其它技术和方法。 至少部分地基于所识别的突出特征,计算图像的显着图。 显着图包括多个像素的至少一部分中的每一个的显着值。 在该示例中,用光学字符识别引擎处理高于阈值显着值的像素。
-
公开(公告)号:US09058644B2
公开(公告)日:2015-06-16
申请号:US13800951
申请日:2013-03-13
Applicant: Amazon Technologies, Inc.
Inventor: David Paul Ramos , Chang Yuan , Keith Harrison Goodman , Avnish Sikka
CPC classification number: G06T5/001 , G06K9/00228 , G06K9/03 , G06K9/3258 , G06K9/34 , G06K9/40 , G06K9/44 , G06K2209/01 , G06T7/73 , G06T2207/10004 , G06T2207/30168 , G06T2207/30201
Abstract: Various embodiments enable regions of text to be identified in an image captured by a camera of a computing device for preprocessing before being analyzed by a visual recognition engine. For example, each of the identified regions can be analyzed or tested to determine whether a respective region contains a quality associated with poor text recognition results, such as poor contrast, blur, noise, and the like, which can be measured by one or more algorithms. Upon identifying a region with such a quality, an image quality enhancement can be automatically applied to the respective region without user instruction or intervention. Accordingly, once each region has been cleared of the quality associated with poor recognition, the regions of text can be processed with a visual recognition algorithm or engine.
Abstract translation: 各种实施例使得在由视觉识别引擎分析之前,在由计算设备的照相机拍摄的图像中识别文本区域以进行预处理。 例如,可以分析或测试每个所识别的区域以确定相应区域是否包含与差的文本识别结果相关联的质量,例如差的对比度,模糊,噪声等,其可以由一个或多个 算法。 在识别具有这种质量的区域时,可以在没有用户指导或干预的情况下自动地将图像质量增强应用于相应区域。 因此,一旦每个区域已被清除与识别不良相关的质量,文本区域可以用视觉识别算法或引擎进行处理。
-
公开(公告)号:US09912847B1
公开(公告)日:2018-03-06
申请号:US13626596
申请日:2012-09-25
Applicant: Amazon Technologies, Inc.
Inventor: Chang Yuan , Leo B. Baldwin , Geoffrey Scott Heller , Daniel Bibireata , Louis L. LeGrand, III , Christopher S. A. MacGregor , Charles P. Vaughn , Francislav P. Penov
CPC classification number: H04N5/225 , H04N5/23222 , H04N5/2351
Abstract: A user can capture an image of an object, using a computing device, to obtain information about that object. If a specular highlight (or other saturated region) is detected, the device can attempt to determine a location of a light source associated with the highlight. The device can then provide instructions as to a direction to move in order to reduce the presence of the specular highlight in subsequent images. Multiple images of the object can be captured and analyzed to generate a three-dimensional reconstruction of the environment, whereby a position of the light source can be determined. In other embodiments, movement of the specular reflections in response to movement of the device is used to determine a direction of the light source. In other embodiments, an image of the user is captured to determine the position of the light source based on shadows or reflections on the user.
-
公开(公告)号:US09177224B1
公开(公告)日:2015-11-03
申请号:US13828401
申请日:2013-03-14
Applicant: Amazon Technologies, Inc.
Inventor: Geoffrey Scott Heller , Chang Yuan
CPC classification number: G06K9/4671 , G06K9/52 , G06K9/6211
Abstract: Various embodiments utilize geometric hashing to automatically recognize and track and an object. For example, a user can capture an image of a product, or other object, and a point detection algorithm can identify particular features of the product and designate feature points of the product within the captured image. Hash values are then determined for each feature point by determining a basis for the image and determining the location of each feature point relative to that basis. A hash table including the identified hash values is then created and compared to hash values for either a stored product when performing object recognition or from a reference image when performing object tracking.
Abstract translation: 各种实施例利用几何散列来自动识别和跟踪物体。 例如,用户可以捕获产品或其他对象的图像,并且点检测算法可以识别产品的特定特征并且在捕获的图像内指定产品的特征点。 然后通过确定图像的基础并确定每个特征点相对于该基础的位置,确定每个特征点的散列值。 然后创建包括所识别的散列值的哈希表,并且当执行对象识别时或从参考图像执行对象跟踪时,与存储的产品的哈希值进行比较。
-
公开(公告)号:US09160993B1
公开(公告)日:2015-10-13
申请号:US13945823
申请日:2013-07-18
Applicant: Amazon Technologies, Inc.
Inventor: Christopher John Lish , Geoffrey Scott Heller , Jim Oommen Thomas , Chang Yuan , Oleg Rybakov
CPC classification number: H04N9/3185 , G06F3/0425 , G06F3/0488 , H04N5/23219 , H04N5/23229 , H04N5/23293 , H04N9/3194
Abstract: Approaches enable the projection of one or more visual elements, such as one or more dynamically changing graphical elements, that can substantially bound, or otherwise at least partially surround or identify, an object recognized by a computing device. The computing device can project the graphical elements to collectively appear as a bounding element for the recognized/actionable object or object portion. As such, the graphical elements can appear as a bounding element that adorns, decorates, highlights, and/or emphasizes, etc., the recognized/actionable object or object portion. The graphical elements to be dynamic. For example, the graphical elements can be projected to move around individually over time, while still appearing to at least partially surround the recognized/actionable object or object portion. Further, the graphical elements can be used to improve various object recognition approaches.
Abstract translation: 方法使得能够基本上绑定或以其他方式至少部分地围绕或识别由计算设备识别的对象的一个或多个可视元素的投影,诸如一个或多个动态变化的图形元素。 计算设备可以投影图形元素以集体显示为识别/可操作的对象或对象部分的边界元素。 因此,图形元素可以显示为对已识别/可操作的对象或对象部分进行装饰,装饰,突出显示和/或强调等的边界元素。 图形元素是动态的。 例如,图形元素可以被投影以随着时间逐渐移动,同时仍然显示为至少部分地围绕识别/可操作的对象或对象部分。 此外,图形元素可以用于改进各种对象识别方法。
-
公开(公告)号:US20140270528A1
公开(公告)日:2014-09-18
申请号:US13800951
申请日:2013-03-13
Applicant: Amazon Technologies, Inc.
Inventor: DAVID PAUL RAMOS , Chang Yuan , Keith Harrison Goodman , Avnish Sikka
IPC: G06T5/00
CPC classification number: G06T5/001 , G06K9/00228 , G06K9/03 , G06K9/3258 , G06K9/34 , G06K9/40 , G06K9/44 , G06K2209/01 , G06T7/73 , G06T2207/10004 , G06T2207/30168 , G06T2207/30201
Abstract: Various embodiments enable regions of text to be identified in an image captured by a camera of a computing device for preprocessing before being analyzed by a visual recognition engine. For example, each of the identified regions can be analyzed or tested to determine whether a respective region contains a quality associated with poor text recognition results, such as poor contrast, blur, noise, and the like, which can be measured by one or more algorithms. Upon identifying a region with such a quality, an image quality enhancement can be automatically applied to the respective region without user instruction or intervention. Accordingly, once each region has been cleared of the quality associated with poor recognition, the regions of text can be processed with a visual recognition algorithm or engine.
Abstract translation: 各种实施例使得在由视觉识别引擎分析之前,在由计算设备的照相机拍摄的图像中识别文本区域以进行预处理。 例如,可以分析或测试每个所识别的区域以确定相应区域是否包含与差的文本识别结果相关联的质量,例如差的对比度,模糊,噪声等,其可以由一个或多个 算法。 在识别具有这种质量的区域时,可以在没有用户指导或干预的情况下自动地将图像质量增强应用于相应区域。 因此,一旦每个区域已被清除与识别不良相关的质量,文本区域可以用视觉识别算法或引擎进行处理。
-
-
-
-
-
-
-
-