Invention Grant
US07957957B2 Method and apparatus for discovering and classifying polysemous word instances in web documents
有权
在Web文档中发现和分类多义词实例的方法和装置
- Patent Title: Method and apparatus for discovering and classifying polysemous word instances in web documents
- Patent Title (中): 在Web文档中发现和分类多义词实例的方法和装置
-
Application No.: US11957190Application Date: 2007-12-14
-
Publication No.: US07957957B2Publication Date: 2011-06-07
- Inventor: Richard Michael King
- Applicant: Richard Michael King
- Applicant Address: US CA Sunnyvale
- Assignee: Yahoo! Inc.
- Current Assignee: Yahoo! Inc.
- Current Assignee Address: US CA Sunnyvale
- Agency: Hickman Palermo Truong & Becker LLP
- Main IPC: G06F17/21
- IPC: G06F17/21

Abstract:
A method and apparatus for discovering polysemous words and classifying polysemous words found in web documents. All document corpi in any natural language have words that have multiple usage contexts or words that have multiple meanings. Semantic analysis is not feasible for classifying all word occurrences in all documents on the web, which contain trillions of words in total. In addition, semantic analysis typically cannot distinguish multiple usages of a given meaning of a given word. In one embodiment of this invention, polysemous words in natural languages can be discovered by analyzing the co-occurrence of other words with the polysemous word in web documents. In one embodiment, the multiple meanings and usages of a polysemous word can be determined by analyzing the co-occurrences of other words with the polysemous word. No semantic analysis is used in discovering or classifying polysemous words.
Public/Granted literature
- US20090157390A1 Method and Apparatus for Discovering and Classifying Polysemous Word Instances in Web Documents Public/Granted day:2009-06-18
Information query