Enter a domain name
Enter a domain name
Search engines index the site gradually, level by level, starting from the main page. If the site has many levels of page nesting, then it will take many months until all the pages of the site are indexed. So, it is necessary to create a sitemap on which there will be all links to all pages of the site at once, so that the robot does not waste time navigating through the nesting levels, right? Yes and no. The answer depends on why the map is being made and what it will be. Let's take a look at what sitemaps are .
Sitemaps are divided into 2 main types or formats: html sitemap and xml file sitemap. An HTML sitemap is a page on a site that lists links. Usually these are links to the most important sections and pages of the site. The HTML sitemap is more intended for people than for robots and helps to quickly navigate in the main sections of the site. There are serious restrictions on the number of links per page for a sitemap in the form of an html page. If there are too many links on the page, not all links can be indexed, or the sitemap page can be excluded from the search for an excessive number of links, even internal links .
In order for the html sitemap to be correctly indexed and adequately perceived by visitors, you should not place more than 100 links on the page. This is more than enough to place on the page all the sections and subsections that did not fit into the main menu.
Typically, an HTML sitemap has a tree structure with expanded sections and subsections. Unnecessarily cumbersome HTML sitemaps are often decorated with graphic elements, CSS-styles and supplemented with Java-script. However, the html sitemap doesn't really matter to search engines.
An HTML sitemap is not a complete sitemap. What if the site has hundreds, thousands, tens of thousands of pages? To do this, you need to place links to all pages in the sitemap in xml format.
An XML sitemap is an xml file of the form sitemap.xml , which is usually found at the root of the site. The xml sitemap has many advantages over the html sitemap. Sitemap xml is a special sitemap format that is defined by all popular search engines such as Google and Yandex. You can specify up to 50,000 links in the xml sitemap . Moreover, in the sitemap xml, you can specify the relative priority and frequency of page refresh.
It should be said that the content of the sitemap is only a recommendation for the search robot. For example, if you set an annual refresh rate for a page on your site, search engine crawlers will still visit more often. And if you set the page refresh rate to hourly, this does not mean that robots will index the page every hour.
The xml sitemap has a specific syntax, let's look at the structure of the xml sitemap in more detail.
The correct sitemap.xml must be UTF8 encoded . The content of the sitemap.xml file looks like this:
<urlset xmlns = "http://www.sitemaps.org/schemas/sitemap/0.9"> <url> <loc> http://mysite.ru/ </loc> <lastmod> 2014-09-18T18: 54: 13 + 04: 00 </lastmod> <changefreq> always </changefreq> <priority> 1.0 </priority> </url> <url> <loc> http://mysite.ru/category/ </loc> <lastmod> 2014-09-18T18: 57: 09 + 04: 00 </lastmod> <changefreq> hourly </changefreq> <priority> 0.8 </priority> </url> <url> <loc> http://mysite.ru/page/ </loc> <lastmod> 2014-09-18T18: 59: 37 + 04: 00 </lastmod> <changefreq> daily </changefreq> <priority> 0.6 </priority> </url> </urlset>
Where the following required tags are used:
Further, optional tags follow:
Also, the xml-formatted sitemap file must contain an indication of the XML namespace:
xmlns = "http://www.sitemaps.org/schemas/sitemap/0.9"
If the sitemap file contains more than 50 thousand links or the size of the sitemap.xml exceeds 10 MB, it is recommended to split the sitemap into several files. In this case, several links to different map files must be specified in the sitemap.
<sitemapindex xmlns = "http://www.sitemaps.org/schemas/sitemap/0.9"> <sitemap> <loc> http://mysite.ru/sitemaps/sitemap01.xml </loc> <lastmod> 2014-09-18T18: 54: 13 + 04: 00 </lastmod> </sitemap> <sitemap> <loc> http://mysite.ru/sitemaps/sitemap02.xml </loc> <lastmod> 2014-09-18T18: 54: 13 + 04: 00 </lastmod> </sitemap> </sitemapindex>
It uses the familiar <loc> and <lastmod> tags, as well as the required tags:
Another way to design a sitemap as a file can be a sitemap in txt format:
1.http: //mysite.ru/ 2.http: //mysite.ru/page/ 3.http: //mysite.ru/page1/
It's simple. The sitemap.txt file lists all the necessary links line by line. A sitemap in txt format is an "option for the lazy". A similar sitemap xml limit of 50,000 links works here. However, the TXT sitemap lacks the ability to indicate last modified date and page priority.
Creating a sitemap is an important process in which it is necessary to clearly indicate which pages of the site need to be indexed and how best to index them. Depending on what type of sitemap you are talking about, there are different ways to create a sitemap . There is no point in discussing how to create an html sitemap separately. Let's consider how to make a map map in xml file format. There are a few basic ways to create a sitemap, but what they all have in common is where the sitemap is located and how the sitemap is identified by search engines.
As mentioned above, the sitemap file is located at the root of the site . Search engines are able to locate the sitemap file on their own. But there are several ways to provide a direct link to the file (s) of the sitemap (s) for faster detection by search engines. The easiest way to specify the location of the sitemap file is to directly specify the link or several links to the sitemap files in the webmaster tools from Yandex and Google. There you can also check the sitemap , analyze the sitemap for correctness , the correspondence of which pages from the sitemap were found by the search engine and how many of them are in indexation.