-
公开(公告)号:US20180129972A1
公开(公告)日:2018-05-10
申请号:US15394708
申请日:2016-12-29
Applicant: Google Inc.
Inventor: Zhifeng Chen , Michael Schuster , Melvin Jose Johnson Premkumar , Yonghui Wu , Quoc V. Le , Maxim Krikun , Thorsten Brants
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media for performing machine learning tasks. One method includes receiving (i) a model input, and (ii) data identifying a first machine learning task to be performed on the model input to generate a first type of model output for the model input; augmenting the model input with an identifier for the first machine learning task to generate an augmented model input; and processing the augmented model input using a machine learning model, wherein the machine learning model has been trained on training data to perform a plurality of machine learning tasks including the first machine learning task, and wherein the machine learning model has been configured through training to process the augmented model input to generate a machine learning model output of the first type for the model input.
-
公开(公告)号:US09208232B1
公开(公告)日:2015-12-08
申请号:US13731891
申请日:2012-12-31
Applicant: Google Inc.
Inventor: Sundeep Tirumalareddy , Michael E. Flaster , Eric Lehman , Paul Haahr , Yonghui Wu
IPC: G06F17/30
CPC classification number: G06F17/30864
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating synthetic descriptive text. One of the methods includes identifying a group of linking resources, wherein each of the linking resources includes a link to a respective target resource; determining, from a search engine index, that at least some of the target resources are associated with seed queries; generating term location information that identifies, for each seed query, locations of terms from the seed query in the linking resource that links to the target resource associated with the seed query; generating synthetic descriptive text for the target resources based on the term location information; and associating the synthetic descriptive text with the target resources in the search engine index.
Abstract translation: 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于产生合成描述性文本。 方法之一包括识别一组链接资源,其中每个链接资源包括到相应目标资源的链接; 从搜索引擎索引确定至少一些目标资源与种子查询相关联; 生成术语位置信息,其针对每个种子查询标识来自链接到与种子查询相关联的目标资源的链接资源中的种子查询的术语的位置; 基于术语位置信息为目标资源生成合成描述性文本; 并将合成描述性文本与搜索引擎索引中的目标资源相关联。
-
公开(公告)号:US20150161086A1
公开(公告)日:2015-06-11
申请号:US14211487
申请日:2014-03-14
Applicant: Google Inc.
Inventor: Yonghui Wu , Michael E. Flaster , Randall G. Keller , Paul Haahr
CPC classification number: G06F17/30247 , G06F3/04842 , G06F17/211 , G06F17/212 , G06F17/2785 , G06F17/30011 , G06F17/30047 , G06F17/30253 , G06F17/30289 , G06F17/30876
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for generating descriptive text for images. In one aspect, a method includes identifying a set of seed descriptors for an image in a document that is hosted on a website. For each seed descriptor, structure information is generated that specifies a structure of the document with respect to the image and the seed descriptor. One or more templates are generated for each seed descriptor using the structure information for the seed descriptor. Each template can include image location information, document structure information, image feature information, and a generative rule that generates descriptive text for other images in other documents. Descriptive text for other images is generated using the templates and the other documents. The descriptive text is associated with the images.
Abstract translation: 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于生成用于图像的描述性文本。 一方面,一种方法包括识别在一个网站上托管的文档中的图像的一组种子描述符。 对于每个种子描述符,生成指定关于图像和种子描述符的文档的结构的结构信息。 使用种子描述符的结构信息为每个种子描述符生成一个或多个模板。 每个模板可以包括图像位置信息,文档结构信息,图像特征信息和生成其他文档中的其他图像的描述文本的生成规则。 使用模板和其他文档生成其他图像的描述性文本。 描述性文字与图像相关联。
-
公开(公告)号:US09436747B1
公开(公告)日:2016-09-06
申请号:US14750483
申请日:2015-06-25
Applicant: Google Inc.
Inventor: Steven D. Baker , Michael E. Flaster , Nitin Gupta , Paul Haahr , Srinivasan Venkatachary , Yonghui Wu
CPC classification number: G06F17/30563 , G06F17/248 , G06F17/30389 , G06F17/3064 , G06F17/30864
Abstract: Methods, systems, and apparatus, including computer program products, for generating synthetic queries using seed queries and structural similarity between documents are described. In one aspect, a method includes identifying embedded coding fragments (e.g., HTML tag) from a structured document and a seed query; generating one or more query templates, each query template corresponding to at least one coding fragment, the query template including a generative rule to be used in generating candidate synthetic queries; generating the candidate synthetic queries by applying the query templates to other documents that are hosted on the same web site as the document; identifying terms that match structure of the query templates as candidate synthetic queries; measuring a performance for each of the candidate synthetic queries; and designating as synthetic queries the candidate synthetic queries that have performance measurements exceeding a performance threshold.
-
-
-