Patent search ap:("Google Inc.") AND inv:"Srinidhi Viswanatha" Page 1

1.

发明申请
AUTOMATIC GENERATION OF TEMPLATES FOR PARSING ELECTRONIC DOCUMENTS 审中-公开

公开(公告)号：US20170308517A1

公开(公告)日：2017-10-26

申请号：US14024147

申请日：2013-09-11

Applicant: Google Inc.

Inventor： Vanja Josifovski , Srinidhi Viswanatha

IPC: G06F17/24

CPC classification number: G06Q10/10 , G06F16/313

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for receiving a plurality of electronic documents, each electronic document being associated with an identifier that is associated with a source of the electronic document, grouping electronic documents of the plurality of electronic documents into a plurality of base sub-groups based on respective sources, for each base sub-group of the plurality of base sub-groups, automatically processing electronic documents to provide one or more templates, each template mapping content to one or more markers, and storing the one or more templates in memory, each template being accessible by one or more parsers to parse content from subsequently received electronic documents.

2.

发明授权
Generating and applying data extraction templates 有权
Title translation: 生成和应用数据提取模板

公开(公告)号：US09563689B1

公开(公告)日：2017-02-07

申请号：US14470510

申请日：2014-08-27

Applicant: Google Inc.

Inventor： Luis Garcia Pueyo , Vanja Josifovski , Amitabh Saikia , Jie Yang , Mike Bendersky , Srinidhi Viswanatha , Marc-Allen Cartright

IPC: G06F17/30

CPC classification number: G06F17/30705

Abstract: Methods, apparatus, and computer-readable media are provided for generating and applying data extraction templates. In various implementations, a corpus of structured communications such as emails may be grouped into clusters based on one or more similarities between the structured communications. A set of structural paths may be identified from structured communications of a particular cluster. One or more structural paths of the set may be classified as transient wherein a count of occurrences of one or more associated segments of text across the particular cluster satisfies a criterion. One or more transient paths may be assigned a semantic data type and/or a confidentiality designation based on various signals. A data extraction template may be generated to extract, from subsequent structured communications, segments of text associated with transient (and in some cases, non-confidential) structural paths.

Abstract translation: 提供了用于生成和应用数据提取模板的方法，装置和计算机可读介质。在各种实现中，诸如电子邮件的结构化通信语料库可以基于结构化通信之间的一个或多个相似性被分组成群集。可以从特定集群的结构化通信中识别一组结构路径。该集合的一个或多个结构路径可以被分类为瞬时，其中跨越特定集群的一个或多个相关联的文本段的出现次数满足标准。可以基于各种信号为一个或多个瞬态路径分配语义数据类型和/或机密性指定。可以生成数据提取模板，以从后续结构化通信中提取与瞬态（以及在一些情况下，非机密）结构路径相关联的文本段。

3.

发明授权
Generating and applying data extraction templates 有权

公开(公告)号：US10216838B1

公开(公告)日：2019-02-26

申请号：US15394610

申请日：2016-12-29

Applicant: Google Inc.

Inventor： Luis Garcia Pueyo , Vanja Josifovski , Amitabh Saikia , Jie Yang , Mike Bendersky , Srinidhi Viswanatha , Marc-Allen Cartright

IPC: G06F17/30 , G06F21/62 , G06F17/27

Abstract: Methods, apparatus, and computer-readable media are provided for generating and applying data extraction templates. In various implementations, a corpus of structured communications such as emails may be grouped into clusters based on one or more similarities between the structured communications. A set of structural paths may be identified from structured communications of a particular cluster. One or more structural paths of the set may be classified as transient wherein a count of occurrences of one or more associated segments of text across the particular cluster satisfies a criterion. One or more transient paths may be assigned a semantic data type and/or a confidentiality designation based on various signals. A data extraction template may be generated to extract, from subsequent structured communications, segments of text associated with transient (and in some cases, non-confidential) structural paths.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification