Invention Grant
- Patent Title: Automated client sitemap generation
- Patent Title (中): 自动客户端站点地图生成
-
Application No.: US12028502Application Date: 2008-02-08
-
Publication No.: US08126869B2Publication Date: 2012-02-28
- Inventor: Ian V. Hollier , Martina Hiemstra
- Applicant: Ian V. Hollier , Martina Hiemstra
- Applicant Address: US WA Redmond
- Assignee: Microsoft Corporation
- Current Assignee: Microsoft Corporation
- Current Assignee Address: US WA Redmond
- Agency: Shook Hardy & Bacon LLP
- Main IPC: G06F7/00
- IPC: G06F7/00

Abstract:
Methods and computer-storage media for automated generation of domain sitemap files are provided. A universal resource locator (URL) for a web site having a plurality of web pages associated therewith is received. Log files and permission controls are analyzed to ascertain whether each web page has been previously crawled and which web pages may be crawled and/or indexed. The permitted, not-previously-crawled web pages are subsequently crawled and the relational structure of the web site is ascertained. Other items of metadata, such as web page modification frequency or priority values, also are determined. Once the structure and metadata are available, a current sitemap is generated that provides the hierarchy and related details in the form of metadata. The sitemap file is then written to a disk and may then be sent to search engines as generated or in a compressed format.
Public/Granted literature
- US20090204638A1 AUTOMATED CLIENT SITEMAP GENERATION Public/Granted day:2009-08-13
Information query