-
公开(公告)号:GB2579957A
公开(公告)日:2020-07-08
申请号:GB202003261
申请日:2018-08-02
Applicant: IBM
Inventor: BENJAMIN SEGAL , BRANIMIR BOGURAEV , ESME MANANDISE
IPC: G06F40/20
Abstract: A method includes obtaining an input text, identifying a first term in the input text, and accessing lexicon data to identify a first entry corresponding to the first term. The first entry includes core data corresponding to domain- independent lexical information for the first term, and non-core data corresponding to domain-specific lexical information for the first term. The method also includes determining that the non-core data of the first entry identifies a second term in the input text as a modifier of the first term. The method further includes generating a partially parsed and bracketed version of the input text. The partially parsed and bracketed version indicates that the second term modifies the first term in the input text. The method also includes generating a parsed version of the input text based on the partially parsed and bracketed version of the input text.
-
公开(公告)号:GB2579326A
公开(公告)日:2020-06-17
申请号:GB202003195
申请日:2018-08-02
Applicant: IBM
Inventor: BENJAMIN PATRICK SEGAL , BRANIMIR BOGURAEV , ESME MANANDISE
IPC: G06F16/00
Abstract: A method includes performing, at a device, an analysis of a domain-specific corpus to identify a base term and a modifier term. The modifier term modifies the base term in at least a portion of the domain-specific corpus. The method also includes accessing, by the device, a first entry in lexicon data. The first entry includes core data corresponding to domain-independent lexical information for the base term. The method further includes adding, based on the analysis, non-core data to the first entry. The non-core data corresponds to domain-specific lexical information for the base term. The non-core data identifies the modifier term as a domain-specific modifier of the base term.
-