Invention Grant
- Patent Title: Incorrect hyperlink detecting apparatus and method
- Patent Title (中): 超链接检测装置和方法不正确
-
Application No.: US11623135Application Date: 2007-01-15
-
Publication No.: US08359294B2Publication Date: 2013-01-22
- Inventor: Noriko Ohshima
- Applicant: Noriko Ohshima
- Applicant Address: US NY Armonk
- Assignee: International Business Machines Corporation
- Current Assignee: International Business Machines Corporation
- Current Assignee Address: US NY Armonk
- Agent Steven E. Bach
- Priority: JP2006-006720 20060113
- Main IPC: G06F7/00
- IPC: G06F7/00 ; G06F17/00

Abstract:
An incorrect hyperlink detecting apparatus which can detect a semantic inconsistency of a hyperlink with high accuracy is provided. An incorrect hyperlink detecting apparatus 10 includes a link source text extracting unit 12 for extracting a text from an HTML file 26 of a link source, a link destination text extracting unit 14 for extracting a text from the HTML file 26 of a link destination, a morpheme analysis unit 16 for dissolving the extracted texts into words, a weighting unit 18 for assigning a weightier every part of speech, a consistency rate calculating unit 20 for calculating a rate that the words of the link source are included in the words of the link destination as a consistency rate from the link source to the link destination and a rate that the words of the link destination are included in the words of the link source as a consistency rate from the link destination to the link source, degree of association calculating unit 22 for calculating a degree of association which indicates a probability of the hyperlink in response to both of the consistency rates, and a CSV output unit 24 for outputting the consistency rate and the degree of association in a CSV form.
Public/Granted literature
- US20080172220A1 Incorrect Hyperlink Detecting Apparatus and Method Public/Granted day:2008-07-17
Information query