The Wayback Machine - https://web.archive.org/web/20220519002415/https://github.com/topics/robots-exclusion-standard
#
robots-exclusion-standard
Here are
9 public repositories
matching this topic...
NodeJS robots.txt parser with support for wildcard (*) matching.
-
Updated
Mar 29, 2022
-
JavaScript
Simple robots.txt template. Keep unwanted robots out (disallow). White lists (allow) legitimate user-agents. Useful for all websites.
A lightweight robots.txt parser for Node.js with support for wildcards, caching and promises.
-
Updated
May 6, 2022
-
JavaScript
Alternative robots parser module for Python
-
Updated
Apr 25, 2022
-
Python
Robots Exclusion Standard/Protocol Parser for Web Crawling/Scraping
Fully native robots.txt parsing component without any dependencies.
-
Updated
Dec 12, 2020
-
JavaScript
Parsers for robots.txt (aka Robots Exclusion Standard / Robots Exclusion Protocol), Robots Meta Tag, and X-Robots-Tag
The repository contains Google-based robots.txt parser and matcher as a C++ library (compliant to C++17).
Improve this page
Add a description, image, and links to the
robots-exclusion-standard
topic page so that developers can more easily learn about it.
Curate this topic
Add this topic to your repo
To associate your repository with the
robots-exclusion-standard
topic, visit your repo's landing page and select "manage topics."
Learn more
You can’t perform that action at this time.
You signed in with another tab or window. Reload to refresh your session.
You signed out in another tab or window. Reload to refresh your session.