Skip to main content
Toollyz

Search tools

Search for a command to run...

Robots.txt Generator

Build a standards-conformant robots.txt with multiple User-agent blocks, sitemap URLs, AI-crawler blocking presets and a live path tester. Free, private, generated entirely in your browser.

What is the Robots.txt Generator?

A robots.txt Generator is a tool that builds a /robots.txt file from a structured editor instead of hand-edited text. Toollyz Robots.txt Generator models your file as a list of rule blocks — each with one or more User-agent lines, Allow/Disallow paths and an optional Crawl-delay — plus a flat list of Sitemap URLs at the bottom. It ships with one-click presets (Allow everything, Disallow everything, WordPress, Next.js App Router, Shopify, generic e-commerce, and a Block all AI crawlers preset that adds GPTBot, ChatGPT-User, OAI-SearchBot, anthropic-ai, ClaudeBot, Google-Extended, PerplexityBot, CCBot, Bytespider and more) and an integrated path tester that uses Google's longest-match rule to tell you exactly which Allow or Disallow line decides a given URL for a given agent. The output is rendered in real time, validated for common mistakes (paths that don't start with /, sitemap URLs that aren't absolute) and ready to copy or download as robots.txt — generated entirely in your browser.

How to use it

  1. Pick a preset (or start from scratch) and edit User-agent, Allow and Disallow lines per block.
  2. Add sitemap URLs at the bottom — one absolute URL per line.
  3. Use the path tester to verify a real URL against a real bot (Googlebot, Bingbot, GPTBot, …).
  4. Copy the generated robots.txt or download it and upload to the root of your domain.

Benefits

  • Visual rule-block editor — no manual `User-agent:` / `Disallow:` formatting.
  • One-click presets for WordPress, Next.js, Shopify, e-commerce, allow-all and disallow-all.
  • Block AI crawlers preset covers 19 known LLM/training agents in a single click.
  • Live path tester using the official longest-match rule honored by Google and Bing.
  • Sitemap section with absolute-URL validation.
  • Real-time validation for paths, agents, crawl-delay range and empty blocks.
  • One-click Copy or Download as robots.txt — ready to deploy.
  • 100% private — generated in your browser, draft saved to localStorage.

Frequently asked questions

What is a robots.txt file?

robots.txt is a plain-text file at the root of your domain (e.g. https://example.com/robots.txt) that tells well-behaved web crawlers which URLs they may or may not request. It uses a simple User-agent / Disallow / Allow grammar standardized by Google and Bing.

Where do I upload the file?

Place it at the root of your domain so a request to /robots.txt returns it. On most hosts that means uploading it to /public, /static or the document root; on Next.js you can put it at app/robots.ts or public/robots.txt. It must be reachable over HTTPS without redirects.

Does robots.txt actually stop a page from being indexed?

Not directly. robots.txt only stops crawling — it tells bots not to fetch the URL. If other sites link to a disallowed URL, Google may still index it as a link with no snippet. To completely de-index, use a noindex meta tag (or X-Robots-Tag header) on an allowed page.

How do I block AI crawlers like GPTBot or ClaudeBot?

Use the “Block AI crawlers” preset — it adds a block listing GPTBot, ChatGPT-User, OAI-SearchBot, anthropic-ai, ClaudeBot, Google-Extended, PerplexityBot, CCBot, FacebookBot, Bytespider, Amazonbot and others, all with Disallow: /. You can edit the list to add or remove agents as new ones appear.

Why is there an Allow rule? Doesn't Disallow do the work?

Allow lets you carve exceptions out of broader Disallow rules. For example, WordPress sites commonly disallow /wp-admin/ but allow /wp-admin/admin-ajax.php, because that endpoint is needed by some plugins on the public site.

What does the path tester do?

It applies your rules to a given URL path for a given User-agent using the same longest-match algorithm Google and Bing use. It tells you whether the URL is Allowed or Disallowed and which exact rule decided it — handy for debugging tricky cases before you ship.

What about Crawl-delay?

Crawl-delay is honored by Bing, Yahoo Slurp and Yandex; Google ignores it. To rate-limit Googlebot, use Google Search Console's crawl-rate setting instead.

Should I list my sitemap in robots.txt?

Yes — both Google and Bing recommend listing one or more Sitemap: URLs in robots.txt as a backup discovery mechanism in addition to submitting sitemaps in their respective Search Consoles.

Is my data sent anywhere?

No. The generator runs entirely in your browser. Your draft is saved in localStorage so a refresh doesn't lose work, but nothing is uploaded to Toollyz or anywhere else.

Is this Robots.txt Generator free?

Completely free with no signup and no limits. Build, test and download as many robots.txt files as you like — privately in your browser.

See all seo tools