Invention Grant
US08554769B1 Identifying gibberish content in resources 有权
识别资源中的乱七八糟的内容

Identifying gibberish content in resources
Abstract:
This specification describes technologies relating to providing search results. One aspect of the subject matter described in this specification can be embodied in methods that include the actions of receiving a network resource, the network resource including text content; generating a language model score for the resource including applying a language model to the text content of the resource; generating a query stuffing score for the reference, the query stuffing score being a function of term frequency in the resource content and a query index; calculating a gibberish score for the resource using the language model score and the query stuffing score; and using the calculated gibberish score to determine whether to modify a ranking score of the resource.
Public/Granted literature
Information query
Patent Agency Ranking
0/0