[XML4Lib] Re: [Web4lib] google sitemaps

Roy Tennant roy.tennant at ucop.edu
Tue Jun 28 13:05:52 EDT 2005


Google sitemaps are solely aimed at assisting web crawling software at 
finding all the pages it should crawl. This in no way replaces (or, in 
a sense, even supplements) metadata harvesting via OAI-PMH, which is a 
distinctly different activity. A parseable sitemap such as this should 
be very useful for web crawlers, however, whether you are Google or an 
individual library wanting to index a selected set of sites.
Roy

On Jun 28, 2005, at 9:46 AM, Eric Lease Morgan wrote:

>
> To what degree has anybody here learned how to exploit Google sitemap 
> files? See:
>
>   http://www.google.com/webmasters/sitemaps/docs/en/about.html
>
> Apparently these sitemaps are XML files that Google can read to create 
> more accurate crawls of a website. They seem to be an alternative or 
> supplement to OAI.
>
> -- 
> Eric Lease Morgan
> University Libraries of Notre Dame
>
> (574) 631-8604
>
> _______________________________________________
> Web4lib mailing list
> Web4lib at webjunction.org
> http://lists.webjunction.org/web4lib/
>



More information about the XML4Lib mailing list