Sitemap

Sitemap allows to inform search engines about pages on their sites.

Using the Sitemap protocol does not guarantee that web pages are included in search engines, but provides hints for web crawlers to do a better job of crawling your site.

Sitemap 0.90 is offered under the terms of the Attribution-ShareAlike Creative Commons License and has wide adoption, including support from Google, Yahoo!, and Microsoft.

Official page

<?xml version="1.0" encoding="UTF-8"?>

<urlset xmlns="http://www.sitemaps.org/schemas/sitemap/0.9">
   <url>
      <loc>http://www.example.com/</loc>
      <lastmod>2005-01-01</lastmod>
      <changefreq>monthly</changefreq>
      <priority>0.8</priority>
   </url>
</urlset> 
tag name description
urlset Root element of Sitemap protocol.
url Parent tag for each URL entry. The remaining tags are children of this tag.
loc

URL of the page. This URL must begin with the protocol (such as http) and end with a trailing slash, if your web server requires it. This value must be less than 2,048 characters.

Be careful when rebuilding your sitemap. If you set the date to the current date, as some online tools do, search engines like Google might re-index all pages. This means that the pages will be removed from the index, and after a (long) time they will be added again.

lastmod Optional tag, that specifies the date of last modification of the page. This date should be in W3C format, for example, 1997-07-16T19:20:30+01:00, that is corresponds to YYYY-MM-DDThh:mm:ssTZD . You can omit the time portion, if desired, and use YYYY-MM-DD.
changefreq Optional tag, that specifies how frequently the page is likely to change. This information is used as hint, not as command. Possible values are:
  • always - is used to describe documents that change each time they are accessed
  • hourly
  • daily
  • weekly
  • monthly
  • yearly
  • never - used to describe archived URLs
priority Optional tag, that specifies the priority of this URL relative to other URLs on your site, so crawlers will know which pages you deem most important for the crawlers. Valid values range from 0.0 to 1.0. The default priority of a page is 0.5.