DISCUSSION SUMMARY
    1.
    发明申请
    DISCUSSION SUMMARY 审中-公开
    讨论摘要

    公开(公告)号:US20150120680A1

    公开(公告)日:2015-04-30

    申请号:US14062307

    申请日:2013-10-24

    CPC classification number: G06F16/345 G06F16/332 G06F16/9535

    Abstract: One or more techniques and/or systems are provided for providing a discussion summary corresponding to a search query and/or for providing discussion session search results. For example, discussion data (e.g., corresponding to real-time messaging, such as a microblog discussion) may be evaluated to identify a discussion topic for a discussion sessions (e.g., a kitchen renovation topic may be assigned to a 1 hour exchange of kitchen renovation messages by a discussion group). A discussion summary of a discussion session may be provided based upon the discussion session having a discussion topic corresponding to a search query topic of a search query. The discussion summary may be provided along with other results for the query and may describe the discussion group, identifiers such as hashtags used by the discussion group, meeting dates/times, average number(s) of participants, other discussion sessions hosted by the discussion group, future discussion sessions, and/or other information.

    Abstract translation: 提供一个或多个技术和/或系统用于提供对应于搜索查询的讨论摘要和/或用于提供讨论会话搜索结果。 例如,可以评估讨论数据(例如,对应于实时消息,例如微博讨论)以识别讨论会议的讨论话题(例如,厨房翻新主题可以被分配给1小时厨房交换 讨论组的翻新消息)。 可以基于具有与搜索查询的搜索查询主题相对应的讨论主题的讨论会话来提供讨论会话的讨论摘要。 讨论摘要可以与查询的其他结果一起提供,并且可以描述讨论组,诸如讨论组使用的标签,标识符,会议日期/时间,参与者的平均次数,由讨论主持的其他讨论会 组,未来讨论会和/或其他信息。

    PRESERVING GEOMETRIC PROPERTIES OF DATASETS WHILE PROTECTING PRIVACY
    2.
    发明申请
    PRESERVING GEOMETRIC PROPERTIES OF DATASETS WHILE PROTECTING PRIVACY 审中-公开
    保护隐私保护数据库的几何属性

    公开(公告)号:US20140196151A1

    公开(公告)日:2014-07-10

    申请号:US13737947

    申请日:2013-01-10

    CPC classification number: G06F16/258

    Abstract: The privacy of a dataset is protected. A private dataset is received that includes multiple rows of multidimensional data. Each row may correspond to a user, and each dimension may be an attribute of the user. A projection matrix is applied to each row to generate a lower dimensional sketch of the row. Noise is added to each of the lower dimensional sketches. The sketches with the added noise may be published together with the projection matrix. The sketches preserve geometric relationships of the original dataset including clustering, distances, and nearest neighbor, and therefore may be useful for data mining purposes while still protecting the privacy of the users.

    Abstract translation: 数据集的隐私受到保护。 收到包含多行多维数据的专用数据集。 每行可以对应于用户,并且每个维度可以是用户的属性。 投影矩阵应用于每一行以生成行的低维草图。 噪音被添加到每个较低维度的草图。 附加噪音的草图可能与投影矩阵一起发布。 草图保留原始数据集的几何关系,包括聚类,距离和最近邻,因此可能对数据挖掘目的有用,同时仍保护用户的隐私。

    DETERMINING SEGMENTS FOR DOCUMENTS
    3.
    发明申请
    DETERMINING SEGMENTS FOR DOCUMENTS 审中-公开
    确定文件的部分

    公开(公告)号:US20160070692A1

    公开(公告)日:2016-03-10

    申请号:US14482015

    申请日:2014-09-10

    CPC classification number: G06F17/27 G06F17/277

    Abstract: A document is received for segmentation. The document includes multiple atomic textual units in a sequence. These units may correspond to sentences, phrases, paragraphs, concept phrases, chapters, etc. A distance function is selected that determines a distance between one set of atomic textual units and another set of atomic textual units. The distance between the sets is large for sets that are dissimilar, and small for sets that are similar. The distance function is applied to the atomic textual units to separate each of the atomic textual units into multiple segments, while maintaining the sequence of the atomic textual units.

    Abstract translation: 收到文档进行分割。 该文档包括序列中的多个原子文本单元。 这些单位可以对应于句子,短语,段落,概念短语,章节等。选择距离函数,其确定一组原子文本单元与另一组原子文本单位之间的距离。 对于不相似的集合,集合之间的距离很大,对于类似的集合,集合之间的距离很大。 距离函数被应用于原子文本单元,以将每个原子文本单元分成多个段,同时保持原子文本单元的序列。

    TOPIC IDENTIFIERS ASSOCIATED WITH GROUP CHATS
    4.
    发明申请
    TOPIC IDENTIFIERS ASSOCIATED WITH GROUP CHATS 审中-公开
    与集团相关的主题标识符

    公开(公告)号:US20140324982A1

    公开(公告)日:2014-10-30

    申请号:US13872175

    申请日:2013-04-29

    CPC classification number: H04L65/403 H04L12/1831 H04L51/04 H04L51/16

    Abstract: Text messages over some period of time are collected. Topic identifiers, such as hashtags, are extracted from the text messages. The text messages associated with each topic identifier are processed to identify which topic identifiers are associated with group chats based on information associated with the text messages such as the times when the text messages were generated and whether the text messages identify user accounts. The topic identifiers that are determined to be associated with the group chats are incorporated into applications that allow users to search for group chats, and to view text messages from past group chats.

    Abstract translation: 收集一段时间内的短信。 从文本消息中提取主题标识符(如主题标签)。 处理与每个主题标识符相关联的文本消息,以基于与文本消息相关联的信息(例如,生成文本消息的时间)以及文本消息是否识别用户帐户来识别哪些主题标识符与组聊天相关联。 被确定为与组聊天相关联的主题标识符被合并到允许用户搜索组聊天并且从过去的组聊天中查看文本消息的应用中。

    Associating content items with document sections

    公开(公告)号:US10216833B2

    公开(公告)日:2019-02-26

    申请号:US14481946

    申请日:2014-09-10

    Abstract: A document such as a book or textbook includes multiple sections such as chapters. Concept phrases are determined for each of the sections based on the text of each section. A set of content items such as videos is received, and each content item is associated with one or more queries that were submitted by users who were provided the content item in a set of search results. These queries are processed to determine concept phrases that are associated with the content items. The content items and their associated concept phrases are compared with the concept phrases associated with the sections to determine, for some or all of the content items, a minimum subset of the sections whose associated concept phrases cover most of the concept phrases that are associated with the content item. The content items are inserted or linked with the sections in their corresponding minimum subsets.

    Ranking relevant discussion groups

    公开(公告)号:US09819618B2

    公开(公告)日:2017-11-14

    申请号:US14307912

    申请日:2014-06-18

    Abstract: Messages are collected and processed to determine topic identifiers that correspond to discussion groups. Queries are received and multiple discussion groups that are relevant to the query are determined based on the messages that are associated with the discussion groups and the topic identifiers associated with the discussion groups. The relevant discussion groups are ranked using a group preference model that simulates the behavior of a hypothetical seeker that considers discussion groups by selecting a message author who is an authority in a particular group, and exploring the discussion groups that are preferred by the selected author. The behavior of the seeker is simulated using a stationary Markov process and is used to generate a probability distribution that is used to rank the relevant discussion groups. The ranked relevant discussion groups are provided in response to the query.

    Navigational aid for electronic books and documents

    公开(公告)号:US09720914B2

    公开(公告)日:2017-08-01

    申请号:US14523404

    申请日:2014-10-24

    CPC classification number: G06F17/30014 G06F17/30687

    Abstract: Systems, methods, and computer storage media are provided for generating rich navigational study aids for electronic books. For a particular section of interest in a document, one or more related sections for providing additional context to the particular section are determined. The related sections are ranked based on a score indicating significance to the particular section. Based on a user's information processing preference, a set of ranked navigational links to each related section is presented to the user for additional context related to the particular section.

    ASSOCIATING CONTENT ITEMS WITH DOCUMENT SECTIONS
    8.
    发明申请
    ASSOCIATING CONTENT ITEMS WITH DOCUMENT SECTIONS 审中-公开
    与文件部分相关的内容项目

    公开(公告)号:US20160070782A1

    公开(公告)日:2016-03-10

    申请号:US14481946

    申请日:2014-09-10

    Abstract: A document such as a book or textbook includes multiple sections such as chapters. Concept phrases are determined for each of the sections based on the text of each section. A set of content items such as videos is received, and each content item is associated with one or more queries that were submitted by users who were provided the content item in a set of search results. These queries are processed to determine concept phrases that are associated with the content items. The content items and their associated concept phrases are compared with the concept phrases associated with the sections to determine, for some or all of the content items, a minimum subset of the sections whose associated concept phrases cover most of the concept phrases that are associated with the content item. The content items are inserted or linked with the sections in their corresponding minimum subsets.

    Abstract translation: 诸如书籍或教科书的文件包括多个部分,如章节。 基于每个部分的文本为每个部分确定概念短语。 接收诸如视频的一组内容项目,并且每个内容项目与在一组搜索结果中被提供内容项目的用户提交的一个或多个查询相关联。 处理这些查询以确定与内容项相关联的概念短语。 将内容项目及其相关联的概念短语与与该部分相关联的概念短语进行比较,以确定部分或全部内容项目的相关概念短语涵盖大部分与 内容项。 内容项目被插入或链接到相应的最小子集中的部分。

    RANKING RELEVANT DISCUSSION GROUPS
    9.
    发明申请
    RANKING RELEVANT DISCUSSION GROUPS 有权
    排名相关讨论组

    公开(公告)号:US20150370797A1

    公开(公告)日:2015-12-24

    申请号:US14307912

    申请日:2014-06-18

    Abstract: Messages are collected and processed to determine topic identifiers that correspond to discussion groups. Queries are received and multiple discussion groups that are relevant to the query are determined based on the messages that are associated with the discussion groups and the topic identifiers associated with the discussion groups. The relevant discussion groups are ranked using a group preference model that simulates the behavior of a hypothetical seeker that considers discussion groups by selecting a message author who is an authority in a particular group, and exploring the discussion groups that are preferred by the selected author. The behavior of the seeker is simulated using a stationary Markov process and is used to generate a probability distribution that is used to rank the relevant discussion groups. The ranked relevant discussion groups are provided in response to the query.

    Abstract translation: 收集和处理消息以确定与讨论组相对应的主题标识符。 收到查询,并且与查询相关的多个讨论组基于与讨论组相关联的消息和与讨论组相关联的主题标识符来确定。 相关讨论组使用组偏好模型进行排名,该组偏好模型通过选择作为特定组中的权限的消息作者,并且探索被选择的作者优选的讨论组来模拟考虑讨论组的假设寻求者的行为。 使用固定的马尔可夫过程来模拟寻求者的行为,并用于产生用于对相关讨论组进行排名的概率分布。 针对查询提供了排名相关的讨论组。

Patent Agency Ranking