Invention Grant
US08880391B2 Natural language processing apparatus, natural language processing method, natural language processing program, and computer-readable recording medium storing natural language processing program 有权
自然语言处理装置,自然语言处理方法,自然语言处理程序和存储自然语言处理程序的计算机可读记录介质

  • Patent Title: Natural language processing apparatus, natural language processing method, natural language processing program, and computer-readable recording medium storing natural language processing program
  • Patent Title (中): 自然语言处理装置,自然语言处理方法,自然语言处理程序和存储自然语言处理程序的计算机可读记录介质
  • Application No.: US13581660
    Application Date: 2011-11-28
  • Publication No.: US08880391B2
    Publication Date: 2014-11-04
  • Inventor: Satoshi SekineHajime Wakahara
  • Applicant: Satoshi SekineHajime Wakahara
  • Applicant Address: JP Tokyo
  • Assignee: Rakuten, Inc.
  • Current Assignee: Rakuten, Inc.
  • Current Assignee Address: JP Tokyo
  • Agency: Sughrue Mion, PLLC
  • International Application: PCT/JP2011/077418 WO 20111128
  • International Announcement: WO2012/081386 WO 20120621
  • Main IPC: G06F17/27
  • IPC: G06F17/27 G06F17/28
Natural language processing apparatus, natural language processing method, natural language processing program, and computer-readable recording medium storing natural language processing program
Abstract:
A natural language processing apparatus includes a result acquisition unit that acquires a plurality of analysis results indicating parts of speech of morphemes contained in one or more common sentences from a plurality of types of morphological analyzers, a pattern acquisition unit that detects a common segmentation point in the plurality of analysis results, extracts one or more parts of speech corresponding to a character string segmented at the common segmentation point from each of the analysis results, and acquires a set of the parts of speech as a part-of-speech differing pattern, and a candidate specifying unit that extracts the part-of-speech differing pattern with the number of appearances being equal to or less than a predetermined threshold and specifies the character string corresponding to the extracted part-of-speech differing pattern as a character string containing a candidate for an unknown word.
Information query
Patent Agency Ranking
0/0