Invention Grant
- Patent Title: Refining a dictionary for information extraction
- Patent Title (中): 修改信息提取字典
-
Application No.: US13598946Application Date: 2012-08-30
-
Publication No.: US08775419B2Publication Date: 2014-07-08
- Inventor: Laura Chiticariu , Vitaly Feldman , Frederick R. Reiss , Huaiyu Zhu , Sudeepa Roy
- Applicant: Laura Chiticariu , Vitaly Feldman , Frederick R. Reiss , Huaiyu Zhu , Sudeepa Roy
- Applicant Address: US NY Armonk
- Assignee: International Business Machines Corporation
- Current Assignee: International Business Machines Corporation
- Current Assignee Address: US NY Armonk
- Agent Jeffrey T. Holman
- Main IPC: G06F17/30
- IPC: G06F17/30

Abstract:
A method for refining a dictionary for information extraction, the operations including: inputting a set of extracted results from execution of an extractor comprising the dictionary on a collection of text, wherein the extracted results are labeled as correct results or incorrect results; processing the extracted results using an algorithm configured to set a score of the extractor above a score threshold, wherein the score threshold balances a precision and a recall of the extractor; and outputting a set of candidate dictionary entries corresponding to a full set of dictionary entries, wherein the candidate dictionary entries are candidates to be removed from the dictionary based on the extracted results.
Public/Granted literature
- US20130318076A1 REFINING A DICTIONARY FOR INFORMATION EXTRACTION Public/Granted day:2013-11-28
Information query