Invention Grant
- Patent Title: Method and apparatus for identifying garbage template article
- Patent Title (中): 识别垃圾模板文章的方法和装置
-
Application No.: US14428314Application Date: 2013-09-17
-
Publication No.: US09330075B2Publication Date: 2016-05-03
- Inventor: Zhixin Hao , Jianguo He , Guoqiang Zhang , Xiaochen He
- Applicant: Tencent Technology (Shenzhen) Company Limited
- Applicant Address: CN
- Assignee: Tencent Technology (Shenzhen) Company Limited
- Current Assignee: Tencent Technology (Shenzhen) Company Limited
- Current Assignee Address: CN
- Agency: BrainSpark Associates, LLC
- International Application: PCT/CN2013/083613 WO 20130917
- International Announcement: WO2014/040570 WO 20140320
- Main IPC: G06F17/30
- IPC: G06F17/30 ; G06F17/22 ; G06F17/27 ; H04L29/06 ; H04L12/58 ; H04L29/08

Abstract:
Method and apparatus for identifying garbage template articles in network communication field are disclosed. The method includes: extracting a feature from an eligible microblog article to generate an article feature including a punctuation feature, a topic feature, a bracket feature, a link feature and an account name feature; acquiring a garbage template list including garbage template feature, i.e. an article feature whose frequency reaches a preset threshold, wherein they are extracted in a same way; identifying the microblog article as a garbage template article when the article feature is the same as the garbage template feature. The apparatus includes: a feature extracting module, an acquiring module, and an identifying module. Features of a microblog article are extracted to determine whether the microblog article is a garbage template article, so that garbage template articles in the present microblog platform can be identified effectively and search engine resources are saved.
Public/Granted literature
- US20150227497A1 METHOD AND APPARATUS FOR IDENTIFYING GARBAGE TEMPLATE ARTICLE Public/Granted day:2015-08-13
Information query