[XML4Lib] Re: [Web4lib] google sitemaps

Roy Tennant roy.tennant at ucop.edu
Tue Jun 28 13:05:52 EDT 2005

Google sitemaps are solely aimed at assisting web crawling software at 
finding all the pages it should crawl. This in no way replaces (or, in 
a sense, even supplements) metadata harvesting via OAI-PMH, which is a 
distinctly different activity. A parseable sitemap such as this should 
be very useful for web crawlers, however, whether you are Google or an 
individual library wanting to index a selected set of sites.

On Jun 28, 2005, at 9:46 AM, Eric Lease Morgan wrote:

> To what degree has anybody here learned how to exploit Google sitemap 
> files? See:
>   http://www.google.com/webmasters/sitemaps/docs/en/about.html
> Apparently these sitemaps are XML files that Google can read to create 
> more accurate crawls of a website. They seem to be an alternative or 
> supplement to OAI.
> -- 
> Eric Lease Morgan
> University Libraries of Notre Dame
> (574) 631-8604
> _______________________________________________
> Web4lib mailing list
> Web4lib at webjunction.org
> http://lists.webjunction.org/web4lib/

More information about the XML4Lib mailing list