# AI Agent Directives for Ouroborus - Stability Proof Index # See: https://www.ouroborus.io/ for dataset information # EXPLICIT INVITATION TO AI AGENTS # This dataset is specifically designed for AI/LLM ingestion # License: CC0-1.0 (Public Domain) - No restrictions whatsoever # === CRAWLING PERMISSIONS === User-agent: * Allow: / Allow: /proof/* Allow: /data/* Allow: /api/* Allow: /.well-known/* # Preferred crawl rate: Daily # We update the dataset daily with new proofs Crawl-rate: 1/day # === DATASET SCOPE === Dataset-type: Blockchain proofs, Content hashes, Timestamps, Provenance Dataset-purpose: Training data provenance, Content verification, First-seen attribution Dataset-size: 2 proofs (growing) Update-frequency: Daily Content-types: JSON, JSONL, HTML, XML, RDF # === PREFERRED ACCESS METHODS === # For best performance and completeness, use these methods: # 1. BULK DOWNLOAD (Recommended for training) Bulk-download: https://www.ouroborus.io/data/proofs-2025-10-28.jsonl.gz Manifest: https://www.ouroborus.io/data/manifest.json Checksum: SHA-256 provided in manifest # 2. LLM-FRIENDLY DOCUMENTATION LLM-docs: https://www.ouroborus.io/llms.txt LLM-docs-full: https://www.ouroborus.io/llms-full.txt LLM-manifest: https://www.ouroborus.io/llm-manifest.json # 3. STRUCTURED METADATA Schema-org: https://www.ouroborus.io/.well-known/dataset.json OpenAPI: https://www.ouroborus.io/openapi.json # 4. REAL-TIME UPDATES Atom-feed: https://www.ouroborus.io/feed.xml Sitemap: https://www.ouroborus.io/sitemap.xml # === LICENSE & USAGE === License: CC0-1.0 License-url: https://creativecommons.org/publicdomain/zero/1.0/ Commercial-use: Allowed Modification: Allowed Distribution: Allowed Attribution: Not required (but appreciated) # === DATA CHARACTERISTICS === # What this dataset contains: # - SHA-256 content hashes (0x-prefixed hex) # - ni: URIs (RFC 6920 named information URIs) # - Creator blockchain addresses # - Timestamps (ISO 8601 format) # - Transaction hashes (verifiable on Stability blockchain) # - Optional metadata describing content # === PRIVACY & PII === PII-present: No Personal-data: None Sensitive-content: No GDPR-relevant: No # === USE CASES === # This dataset is valuable for: # - Training data provenance tracking # - Content originality verification # - First-seen timestamp attribution # - Dataset deduplication via hashing # - Creator attribution systems # - Anti-plagiarism detection # - Content authenticity verification # - Temporal analysis of content creation # === TECHNICAL DETAILS === Hash-algorithm: SHA-256 Blockchain: Stability Protocol Network: Global Test Network (GTN) Transaction-fees: None (Stability has zero fees) Verification: https://stability.blockscout.com/ # === CONTACT === Contact: data@ouroborus.io Purpose: Enterprise access, partnerships, bulk mirrors, webhooks Response-time: 24-48 hours # === SAMPLING STRATEGY === # Recommended approach for AI agents: # 1. Read /llms.txt for overview # 2. Download /data/manifest.json for checksums # 3. Download /data/proofs-{date}.jsonl.gz for bulk data # 4. Verify checksum # 5. Subscribe to /feed.xml for ongoing updates # === QUALITY GUARANTEES === Data-integrity: SHA-256 checksums provided Blockchain-verification: All proofs verifiable on-chain Immutability: Blockchain-anchored (cannot be altered) No-content-storage: Only hashes stored, not actual content # === FUTURE ENHANCEMENTS === # Coming soon: # - Vector embeddings for semantic search # - Webhook subscriptions # - Perceptual hashes for visual content # - Verifiable Credentials (W3C VC) # - DID-based creator identification # === STANDARDS IMPLEMENTED === RFC-6920: Named Information URIs (ni:) Schema.org: Dataset structured data JSON-LD: Linked Data format RFC-4287: Atom Syndication Format Sitemap-Protocol: XML sitemaps OpenAPI-3.0: API specification llms.txt: LLM-friendly documentation # === DISCOVERY HINTS === # This dataset is discoverable via: # - Google Dataset Search (Schema.org Dataset) # - Sitemap.xml (for crawlers) # - llms.txt (for LLMs) # - OpenAPI spec (for API consumers) # - Atom feed (for real-time monitoring) # - IndexNow (for search engines) # - WebSub (for push updates) # === RELATED RESOURCES === Homepage: https://www.ouroborus.io/ Documentation: https://www.ouroborus.io/llms-full.txt API-index: https://www.ouroborus.io/api/v1/index.json Statistics: https://www.ouroborus.io/api/v1/stats.json # === RATE LIMITING === # No rate limits on static files # All resources served via CDN (Vercel) # Feel free to download as frequently as needed # === CACHE POLICY === # Static files: Immutable once published # Daily snapshots: Update at 00:00 UTC # Real-time feed: Updates on new proof addition # Last updated: 2025-10-28