Invention Grant
- Patent Title: Header-token driven automatic text segmentation
-
Application No.: US14724269Application Date: 2015-05-28
-
Publication No.: US09529862B2Publication Date: 2016-12-27
- Inventor: Badrul M. Sarwar , John A. Mount
- Applicant: PayPal, Inc.
- Applicant Address: US CA San Jose
- Assignee: PAYPAL, INC.
- Current Assignee: PAYPAL, INC.
- Current Assignee Address: US CA San Jose
- Agency: Maschoff Brennan
- Main IPC: G06F17/30
- IPC: G06F17/30 ; G06F17/27

Abstract:
A method and a system to automatically segment text based on header tokens is described. A relevance value and an irrelevance value are determined for each token in a description, assuming no tokens are left out of computations. The irrelevance value is based on occurrences of a token in a sample set of descriptions. The relevance value is an estimated probability of relevance based on the header of the description being segmented.
Public/Granted literature
- US20150261761A1 HEADER-TOKEN DRIVEN AUTOMATIC TEXT SEGMENTATION Public/Granted day:2015-09-17
Information query