GoIndexed Help
Blocked by Robots.txt — How to Find and Fix the Issue
If Google can't crawl your page because of a robots.txt rule, it won't be indexed. This guide shows you how to find the rule, test it, and fix it without breaking your site.
What robots.txt blocking means
Your site's robots.txt file tells search engine crawlers which URLs they are allowed to visit. If a URL matches a Disallow rule for Googlebot, Google cannot crawl it — and a page Google can't crawl won't be indexed.
This is different from a noindex tag: robots.txt blocks the crawl entirely, so Google may still show the URL in results with no content snippet, based on links pointing to it.
How to find your robots.txt file
Your robots.txt is always at the root of your domain:
Visit yourdomain.com/robots.txt
Open your browser and go to your domain followed by /robots.txt. You'll see the plain text file.
Look for Disallow rules
Find any lines starting with Disallow: under a User-agent: * or User-agent: Googlebot block. Check if your URL matches any of them.
How to test a URL against robots.txt
Use Google Search Console's robots.txt tester
In Search Console, go to Settings → robots.txt. Paste the URL you want to test and click Test — it will tell you if Googlebot is blocked.
How to fix it
- Remove or narrow the Disallow rule that's blocking the URL — e.g. change Disallow: / to Disallow: /private/
- If the block is intentional (admin pages, internal tools), make sure the URL you want indexed is not in a blocked path
- After fixing, use GoIndexed's Re-inspect button or request indexing in Search Console to prompt Google to re-crawl
Common mistakes that cause accidental blocks
- Disallow: / blocks the entire site — usually left over from a staging environment
- WordPress's "Discourage search engines" setting adds Disallow: / automatically
- Overly broad wildcard patterns like Disallow: /*?* that accidentally match real URLs
Related articles
Help
Blocked by Noindex Tag — How to Find and Remove It
A noindex meta tag or HTTP header tells Google not to include your page in search results. This guide shows you where to find it and how to remove it safely.
Help
Crawled, Currently Not Indexed — What It Means and How to Fix It
The "Crawled, currently not indexed" status means Google visited your page but decided not to include it in search results. This guide explains why it happens and what to do.
Help
404 Page Not Found in Google Search Console — How to Fix It
When Google tries to crawl a URL and gets a 404 response, it cannot index the page. This guide explains why 404s appear in Search Console and how to clean them up.
Ready to put this into practice?
Open GoIndexed