Disallow pages that ends with number only in robots.txt -


Is it possible for Google to crawl this page

  / blog / page / 10 / blog / page / 20 ... / blog / page / 100  

These are essentially Ajax calls that bring blog post data.

I created it in robots.txt:

  User-agent: * Disallow: / blog / page / *  

But now I have another page that should allow me

  / blog / page / start  

Is there any way I can get the robot I agree that pages ending with only one page such as

  user-agent: * reject: / blog / page / (: num)  

When I want to validate the robots.txt file I got an error after verb:

enter image details here

P> / / code>, with any number ( / blog / pages / 9129831823 ) Blog / pages / 9.html , / blog / pages / 5/10 / foo etc.). So you should not attach to * character (this The original robots.txt specification does not contain a wildcard, and in your case for those bots which the wildcard Area is interpreted as).

Google supports some features for robots.txt which are not part of the original robots.txt specification, and therefore (all) are not supported by other bots, e.g., allow < / Code> field but as the above robots.txt works, there is no need to use it.


Comments

Popular posts from this blog

python - Overriding the save method in Django ModelForm -

html - CSS autoheight, but fit content to height of div -

qt - How to prevent QAudioInput from automatically boosting the master volume to 100%? -