Humanitarian crisis analysis using secondary information gathered by a focused web crawler
Abstract:
A network is crawled using a trained learning model to identify a set of secondary-source documents related to an event. A hub page from the set of secondary-source documents is identified that includes a link predicted to link to a new relevant secondary-source document. The new document is added to the set of secondary-source documents. Information is extracted from the set of secondary-source documents. Feedback is received indicative of a relevancy level for the extracted information as applied to the event. Each document is classified into one or more categories related to the event, based on the extracted information and the received feedback information. A learning model is trained based on the received feedback.
Information query
Patent Agency Ranking
0/0