Invention Grant
US09330075B2 Method and apparatus for identifying garbage template article 有权
识别垃圾模板文章的方法和装置

Method and apparatus for identifying garbage template article
Abstract:
Method and apparatus for identifying garbage template articles in network communication field are disclosed. The method includes: extracting a feature from an eligible microblog article to generate an article feature including a punctuation feature, a topic feature, a bracket feature, a link feature and an account name feature; acquiring a garbage template list including garbage template feature, i.e. an article feature whose frequency reaches a preset threshold, wherein they are extracted in a same way; identifying the microblog article as a garbage template article when the article feature is the same as the garbage template feature. The apparatus includes: a feature extracting module, an acquiring module, and an identifying module. Features of a microblog article are extracted to determine whether the microblog article is a garbage template article, so that garbage template articles in the present microblog platform can be identified effectively and search engine resources are saved.
Public/Granted literature
Information query
Patent Agency Ranking
0/0