• Patent Title: Document analysis apparatus, document analysis method, and computer-readable recording medium
  • Application No.: US17441338
    Application Date: 2019-03-29
  • Publication No.: US11645448B2
    Publication Date: 2023-05-09
  • Inventor: Ayako Hoshino
  • Applicant: NEC Corporation
  • Applicant Address: JP Tokyo
  • Assignee: NEC CORPORATION
  • Current Assignee: NEC CORPORATION
  • Current Assignee Address: JP Tokyo
  • International Application: PCT/JP2019/014200 2019.03.29
  • International Announcement: WO2020/202324A 2020.10.08
  • Date entered country: 2021-09-21
  • Main IPC: G06F40/137
  • IPC: G06F40/137 G06F40/166
Document analysis apparatus, document analysis method, and computer-readable recording medium
Abstract:
A document analysis apparatus 10 includes: a candidate generation unit 11 that, for each line included in a document that is a target of structural analysis, specifies another line in a parallel relationship with the line by performing extraction of a marker indicating a hierarchy, and generates a candidate for a hierarchical structure of the document that is the target based on the result of the specification of each line; and a candidate evaluation unit 12 that, if two or more candidates have been generated, performs evaluation on each candidate for the hierarchical structure, and selects one candidate for the hierarchical structure as the hierarchical structure of the document that is the target based on the evaluation result.
Information query
Patent Agency Ranking
0/0