Question 1

What is a robots.txt file?

Accepted Answer

robots.txt is a plain-text file at the root of your domain (e.g. https://example.com/robots.txt) that tells well-behaved web crawlers which URLs they may or may not request. It uses a simple User-agent / Disallow / Allow grammar standardized by Google and Bing.

Question 2

Where do I upload the file?

Accepted Answer

Place it at the root of your domain so a request to /robots.txt returns it. On most hosts that means uploading it to /public, /static or the document root; on Next.js you can put it at app/robots.ts or public/robots.txt. It must be reachable over HTTPS without redirects.

Question 3

Does robots.txt actually stop a page from being indexed?

Accepted Answer

Not directly. robots.txt only stops crawling — it tells bots not to fetch the URL. If other sites link to a disallowed URL, Google may still index it as a link with no snippet. To completely de-index, use a noindex meta tag (or X-Robots-Tag header) on an allowed page.

Question 4

How do I block AI crawlers like GPTBot or ClaudeBot?

Accepted Answer

Use the “Block AI crawlers” preset — it adds a block listing GPTBot, ChatGPT-User, OAI-SearchBot, anthropic-ai, ClaudeBot, Google-Extended, PerplexityBot, CCBot, FacebookBot, Bytespider, Amazonbot and others, all with Disallow: /. You can edit the list to add or remove agents as new ones appear.

Question 5

Why is there an Allow rule? Doesn't Disallow do the work?

Accepted Answer

Allow lets you carve exceptions out of broader Disallow rules. For example, WordPress sites commonly disallow /wp-admin/ but allow /wp-admin/admin-ajax.php, because that endpoint is needed by some plugins on the public site.

Question 6

What does the path tester do?

Accepted Answer

It applies your rules to a given URL path for a given User-agent using the same longest-match algorithm Google and Bing use. It tells you whether the URL is Allowed or Disallowed and which exact rule decided it — handy for debugging tricky cases before you ship.

Question 7

What about Crawl-delay?

Accepted Answer

Crawl-delay is honored by Bing, Yahoo Slurp and Yandex; Google ignores it. To rate-limit Googlebot, use Google Search Console's crawl-rate setting instead.

Question 8

Should I list my sitemap in robots.txt?

Accepted Answer

Yes — both Google and Bing recommend listing one or more Sitemap: URLs in robots.txt as a backup discovery mechanism in addition to submitting sitemaps in their respective Search Consoles.

Question 9

Is my data sent anywhere?

Accepted Answer

No. The generator runs entirely in your browser. Your draft is saved in localStorage so a refresh doesn't lose work, but nothing is uploaded to Toollyz or anywhere else.

Question 10

Is this Robots.txt Generator free?

Accepted Answer

Completely free with no signup and no limits. Build, test and download as many robots.txt files as you like — privately in your browser.

Robots.txt Generator

What is the Robots.txt Generator?

How to use it

Benefits

Frequently asked questions

What is a robots.txt file?

Where do I upload the file?

Does robots.txt actually stop a page from being indexed?

How do I block AI crawlers like GPTBot or ClaudeBot?

Why is there an Allow rule? Doesn't Disallow do the work?

What does the path tester do?

What about Crawl-delay?

Should I list my sitemap in robots.txt?

Is my data sent anywhere?

Is this Robots.txt Generator free?

Meta Tag Generator

URL Shortener

Slugify

DNS Lookup Tool

What is the Robots.txt Generator?

How to use it

Benefits

Frequently asked questions

What is a robots.txt file?

Where do I upload the file?

Does robots.txt actually stop a page from being indexed?

How do I block AI crawlers like GPTBot or ClaudeBot?

Why is there an Allow rule? Doesn't Disallow do the work?

What does the path tester do?

What about Crawl-delay?

Should I list my sitemap in robots.txt?

Is my data sent anywhere?

Is this Robots.txt Generator free?

Related tools

Meta Tag Generator

URL Shortener

Slugify

DNS Lookup Tool