Invention Grant
- Patent Title: Method and system for motif extraction in electronic documents
- Patent Title (中): 电子文件中图案提取的方法和系统
-
Application No.: US13608312Application Date: 2012-09-10
-
Publication No.: US09483463B2Publication Date: 2016-11-01
- Inventor: Matthias Galle , Jean-Michel Renders
- Applicant: Matthias Galle , Jean-Michel Renders
- Applicant Address: US CT Norwalk
- Assignee: Xerox Corporation
- Current Assignee: Xerox Corporation
- Current Assignee Address: US CT Norwalk
- Agency: Jones Robb, PLLC
- Main IPC: G10L15/22
- IPC: G10L15/22 ; G10L15/187 ; G06Q30/04 ; G06Q30/02 ; G06Q40/00 ; G06F17/27 ; G06F17/24 ; G06F19/18 ; G06F19/22 ; G06F7/24

Abstract:
A method, system, and computer program product for extracting text motifs from the electronic documents is disclosed. A user provides a largest-maximal repeat or a super-maximal repeat as a first text block. The occurrences of the first text block are detected to identify the second text blocks in the vicinity of the occurrences of the first text block on the basis of pre-defined parameters. The text motifs are determined by combining the first text block and the second text block. Finally, the text motifs are extracted from the electronic documents.
Public/Granted literature
- US20140074455A1 METHOD AND SYSTEM FOR MOTIF EXTRACTION IN ELECTRONIC DOCUMENTS Public/Granted day:2014-03-13
Information query