Invention Grant
- Patent Title: System and method for extraction of off-topic part from conversation
- Patent Title (中): 脱离话题部分提取的系统和方法
-
Application No.: US13740473Application Date: 2013-01-14
-
Publication No.: US09002843B2Publication Date: 2015-04-07
- Inventor: Nobuyasu Itoh , Masafumi Nishimura , Yuto Yamaguchi
- Applicant: International Business Machines Corporation
- Applicant Address: US NY Armonk
- Assignee: International Business Machines Corporation
- Current Assignee: International Business Machines Corporation
- Current Assignee Address: US NY Armonk
- Agency: Fleit Gibbons Gutman Bongini & Bianco PL
- Agent Jose Gutman
- Priority: JP2012-004802 20120113
- Main IPC: G06F7/00
- IPC: G06F7/00 ; G06F17/30 ; G06F17/27

Abstract:
A system and method extract off-topic parts from a conversation. The system includes a first corpus including documents of a plurality of fields; a second corpus including only documents of a field to which the conversation belongs; a determination means for determination as a lower limit subject word a word for which IDF value for the first corpus and IDF value for the second corpus are each below a first certain threshold value; a score calculation part for calculation as a score a TF-IDF value for each word included in the second corpus; a clipping part, for sequential cutting out of intervals from text data that are contents of the conversation; and an extraction part for extraction as an off-topic part an interval where average value of the score of words included in the clipped interval is larger than a second certain threshold value.
Public/Granted literature
- US20130185308A1 SYSTEM AND METHOD FOR EXTRACTION OF OFF-TOPIC PART FROM CONVERSATION Public/Granted day:2013-07-18
Information query