Self-orchestrated system for extraction, analysis, and presentation of entity data
Abstract:
A method for operating a self-orchestrated system for extraction, analysis, and presentation of entity data involves extracting a web page to object-based storage including web page content, web page metadata and a globally unique identifier. The method extracts the web page metadata from the object-based storage. The method inputs the web page metadata to a queue. The method pulls web page content from a content store. The method receives RegEx from a model parameter store. The method parses the web page content using RegEx and web page metadata. The method passes web page metadata and extracted content from the web page and positions of extracted content to an advanced analysis function decider (AAF Decider) for analysis. The method streams web page metadata and extracted content from the web page and positions of the extracted content to a JSON file batch for flattening.
Information query
Patent Agency Ranking
0/0