Discussion
Loading...

#Tag

  • About
  • Code of conduct
  • Privacy
  • Users
  • Instances
  • About Bonfire
d@nny disc@ mc²
d@nny disc@ mc² boosted
Clemens
@neverpanic@chaos.social  ·  activity timestamp 2 months ago

I've had it with the aggressive #AI #crawlers now. Some bot has been hitting #MacPorts with a legitimate enough user agent that I can't block it without also blocking users.

Yesterday, it sent 377k requests (62 % of the total), 369k to URLs forbidden in robots.txt from 274k unique IPs. Most of it for content that could be analyzed quicker using svn checkout or git clone.

Dynamic content on the #web is broken. There's just no way to do that anymore. What a waste of energy.

  • Copy link
  • Flag this post
  • Block
Clemens
@neverpanic@chaos.social  ·  activity timestamp 2 months ago

I've had it with the aggressive #AI #crawlers now. Some bot has been hitting #MacPorts with a legitimate enough user agent that I can't block it without also blocking users.

Yesterday, it sent 377k requests (62 % of the total), 369k to URLs forbidden in robots.txt from 274k unique IPs. Most of it for content that could be analyzed quicker using svn checkout or git clone.

Dynamic content on the #web is broken. There's just no way to do that anymore. What a waste of energy.

  • Copy link
  • Flag this post
  • Block
alcinnz
algernon the zellij stan
alcinnz and 1 other boosted
Stefano Marinelli
@stefano@mastodon.bsd.cafe  ·  activity timestamp 2 months ago

If your crawler is spoofing its identity because I have disallowed it from my sites, I consider that an attempted server breach. You are no better than someone trying to compromise my SSH connection.

#IT#SysAdmin#Perplexity#Crawlers

  • Copy link
  • Flag this post
  • Block
Stefano Marinelli
@stefano@mastodon.bsd.cafe  ·  activity timestamp 2 months ago

If your crawler is spoofing its identity because I have disallowed it from my sites, I consider that an attempted server breach. You are no better than someone trying to compromise my SSH connection.

#IT#SysAdmin#Perplexity#Crawlers

  • Copy link
  • Flag this post
  • Block
Alfonso Siciliano
@alfonsosiciliano@mastodon.bsd.cafe  ·  activity timestamp 3 months ago

🖥️ My ultra-budget server powering http://websysctl.alfonsosiciliano.net has been running smoothly for the past 2 months. So far, so good!

📈 #Crawlers hit tens of thousands of sysctl parameter pages daily. That's fine, since robots.txt allows it. But why keep requesting non-existent pages as if the site were built with WordPress 😤 ? Fortunately, the stack (#FreeBSD freebsd + #OpenResty 🌐 + #Lapis ✏️ + a custom-built #database 📦 ) stays well within the limited resources of my $5/month cloud server.

The code might soon be #OpenSource stay tuned!

#UNIX #sysctl #WebDev #WebServer #ThePowerToServe #coding #Lua #kernel

Screenshot of the "Tree MIB" page from the WebSysctl site. The left panel shows an expandable tree view of the FreeBSD sysctl MIB hierarchy, with nodes like sysctl, kern, vm, sys, security, and their subcategories. The security.mac.mmap_revocation_via_cow node is selected. The right panel displays detailed information about this sysctl parameter, including its link, OID, name, description ("Revoke mmap access to files via copy-on-write semantics, or by removing all write access"), type (integer), format (I), flags (RD, WR, MPSAFE), label, and handler status (Defined). The top navigation bar includes links: Home, Docs, Table, Tree (highlighted), Update, Login, and Contacts.
Screenshot of the "Tree MIB" page from the WebSysctl site. The left panel shows an expandable tree view of the FreeBSD sysctl MIB hierarchy, with nodes like sysctl, kern, vm, sys, security, and their subcategories. The security.mac.mmap_revocation_via_cow node is selected. The right panel displays detailed information about this sysctl parameter, including its link, OID, name, description ("Revoke mmap access to files via copy-on-write semantics, or by removing all write access"), type (integer), format (I), flags (RD, WR, MPSAFE), label, and handler status (Defined). The top navigation bar includes links: Home, Docs, Table, Tree (highlighted), Update, Login, and Contacts.
Screenshot of the "Tree MIB" page from the WebSysctl site. The left panel shows an expandable tree view of the FreeBSD sysctl MIB hierarchy, with nodes like sysctl, kern, vm, sys, security, and their subcategories. The security.mac.mmap_revocation_via_cow node is selected. The right panel displays detailed information about this sysctl parameter, including its link, OID, name, description ("Revoke mmap access to files via copy-on-write semantics, or by removing all write access"), type (integer), format (I), flags (RD, WR, MPSAFE), label, and handler status (Defined). The top navigation bar includes links: Home, Docs, Table, Tree (highlighted), Update, Login, and Contacts.
  • Copy link
  • Flag this post
  • Block
Log in

bonfire.cafe

A space for Bonfire maintainers and contributors to communicate

bonfire.cafe: About · Code of conduct · Privacy · Users · Instances
Bonfire social · 1.0.0-rc.3.5 no JS en
Automatic federation enabled
  • Explore
  • About
  • Members
  • Code of Conduct
Home
Login