Invention Grant
US07870474B2 System and method for smoothing hierarchical data using isotonic regression
有权
使用等渗回归平滑分层数据的系统和方法
- Patent Title: System and method for smoothing hierarchical data using isotonic regression
- Patent Title (中): 使用等渗回归平滑分层数据的系统和方法
-
Application No.: US11800235Application Date: 2007-05-04
-
Publication No.: US07870474B2Publication Date: 2011-01-11
- Inventor: Deepayan Chakrabarti , Kunal Punera , Shanmugasundaram Ravikumar
- Applicant: Deepayan Chakrabarti , Kunal Punera , Shanmugasundaram Ravikumar
- Applicant Address: US CA San Jose
- Assignee: Yahoo! Inc.
- Current Assignee: Yahoo! Inc.
- Current Assignee Address: US CA San Jose
- Main IPC: G06F17/00
- IPC: G06F17/00

Abstract:
An improved system and method is provided for detecting a web page template. A web page template detector may be provided for performing page-level template detection on a web page. In general, the web page template classifier may be trained using automatically generated training data, and then the web page template classifier may be applied to web pages to identify web page templates. A web page template may be detected by classifying segments of a web page as template structures, by assigning classification scores to the segments of the web page classified as template structures, and then by smoothing the classification scores assigned to the segments of the web page. Generalized isotonic regression may be applied for smoothing scores associated with the nodes of a hierarchy by minimizing an optimization function using dynamic programming.
Public/Granted literature
- US20080275890A1 System and method for smoothing hierarchical data using isotonic regression Public/Granted day:2008-11-06
Information query