Invention Grant
US09075796B2 Text mining for large medical text datasets and corresponding medical text classification using informative feature selection
有权
用于大型医学文本数据集的文本挖掘和使用信息特征选择的相应的医学文本分类
- Patent Title: Text mining for large medical text datasets and corresponding medical text classification using informative feature selection
- Patent Title (中): 用于大型医学文本数据集的文本挖掘和使用信息特征选择的相应的医学文本分类
-
Application No.: US13715186Application Date: 2012-12-14
-
Publication No.: US09075796B2Publication Date: 2015-07-07
- Inventor: Marianthi Markatou , Robert Ball , Taxiarchis Botsis , Michael D. Nguyen , Emily J. Woo
- Applicant: The National Institutes of Health, a component of the US Department of Health and Human Services , International Business Machines Corporation
- Applicant Address: US NY Armonk US DC Washington
- Assignee: International Business Machines Corporation,The National Institutes of Health, A Component of the US Department of Health and Human Services
- Current Assignee: International Business Machines Corporation,The National Institutes of Health, A Component of the US Department of Health and Human Services
- Current Assignee Address: US NY Armonk US DC Washington
- Agency: Harrington & Smith
- Agent Louis J. Percello
- Main IPC: G06F17/28
- IPC: G06F17/28 ; G06F17/30 ; G06F19/00

Abstract:
Techniques include performing text mining on a set of case reports in text format to determine a set of grammar rules to be used to determine whether case reports meet a medical condition. The text mining includes performing feature selection, used to determine the set of grammar rules, that combines standardized case definitions with experience of medical officers for the medical condition and outputting the set of grammar rules. Another technique includes applying grammar rule(s) to new case report(s), the grammar rule(s) previously determined at least by performing text mining comprised of performing feature selection, used to determine the set of grammar rules, that combines standardized case definitions with experience of medical officers for the medical condition. Indication(s) are output of whether the new case report(s) meet or do not meet the medical condition. The techniques may be performed by a method, an apparatus, and a program product.
Public/Granted literature
Information query