Invention Grant
- Patent Title: Exploiting structured content for unsupervised natural language semantic parsing
-
Application No.: US13773269Application Date: 2013-02-21
-
Publication No.: US10235358B2Publication Date: 2019-03-19
- Inventor: Gokhan Tur , Dilek Hakkani-Tur , Larry Heck , Minwoo Jeong , Ye-Yi Wang
- Applicant: Microsoft Technology Licensing, LLC
- Applicant Address: US WA Redmond
- Assignee: Microsoft Technology Licensing, LLC
- Current Assignee: Microsoft Technology Licensing, LLC
- Current Assignee Address: US WA Redmond
- Main IPC: G06F17/28
- IPC: G06F17/28 ; G06F17/27

Abstract:
Structured web pages are accessed and parsed to obtain implicit annotation for natural language understanding tasks. Search queries that hit these structured web pages are automatically mined for information that is used to semantically annotate the queries. The automatically annotated queries may be used for automatically building statistical unsupervised slot filling models without using a semantic annotation guideline. For example, tags that are located on a structured web page that are associated with the search query may be used to annotate the query. The mined search queries may be filtered to create a set of queries that is in a form of a natural language query and/or remove queries that are difficult to parse. A natural language model may be trained using the resulting mined queries. Some queries may be set aside for testing and the model may be adapted using in-domain sentences that are not annotated. The models may be tested using these implicitly annotated natural-language-like queries in an unsupervised fashion.
Public/Granted literature
- US20140236575A1 EXPLOITING THE SEMANTIC WEB FOR UNSUPERVISED NATURAL LANGUAGE SEMANTIC PARSING Public/Granted day:2014-08-21
Information query