Excellent article on running YaCy #DecentralizedSearch #WebScraping in a professional, secure, and scalable way for web scraping and data extraction:
https://scrapingant.com/blog/decentralized-web-scraping-data-extraction-yacy
It covers private/Robinson setups with WireGuard, public-edge reverse proxies (Caddy), container hardening (NIST SP 800-190), crawl profiles, index lifecycle management, performance tuning, and monitoring, JVM/IO tuning, SLOs, ethical scraping, sitemaps-first — everything needed for serious deployments.