Invention Grant
- Patent Title: Identifying gibberish content in resources
- Patent Title (中): 识别资源中的乱七八糟的内容
-
Application No.: US12486626Application Date: 2009-06-17
-
Publication No.: US08554769B1Publication Date: 2013-10-08
- Inventor: Shashidhar A. Thakur , Sushrut Karanjkar , Pavel Levin , Thorsten Brants
- Applicant: Shashidhar A. Thakur , Sushrut Karanjkar , Pavel Levin , Thorsten Brants
- Applicant Address: US CA Mountain View
- Assignee: Google Inc.
- Current Assignee: Google Inc.
- Current Assignee Address: US CA Mountain View
- Agency: Fish & Richardson P.C.
- Main IPC: G06F17/30
- IPC: G06F17/30

Abstract:
This specification describes technologies relating to providing search results. One aspect of the subject matter described in this specification can be embodied in methods that include the actions of receiving a network resource, the network resource including text content; generating a language model score for the resource including applying a language model to the text content of the resource; generating a query stuffing score for the reference, the query stuffing score being a function of term frequency in the resource content and a query index; calculating a gibberish score for the resource using the language model score and the query stuffing score; and using the calculated gibberish score to determine whether to modify a ranking score of the resource.
Public/Granted literature
- US1688908A Refrigerator Public/Granted day:1928-10-23
Information query