Discussion
Loading...

Post

Log in
  • About
  • Code of conduct
  • Privacy
  • Users
  • Instances
  • About Bonfire
Neil Craig
@tdp_org@mastodon.social  ·  activity timestamp 4 months ago

I noticed that a lot of the crawlers/bots we see on www.bbc.co.uk & www.bbc.com are spoofed e.g. a "Meta" crawler coming from 10s of different small ISPs across the world (the real one comes from a Meta ASN).
I deployed a change this morning which adds source ASN validation (alongside user-agent string analysis) to our "known crawlers/bots" classifier & well, the results speak for themselves. Attached graphs show RPS from "known crawlers/bots" to www.bbc.co.uk & www.bbc.com.
#WebDev#BBC#Bots

Graph of requests from "known crawlers/bots" over time to www.bbc.com for today.
The graph is relatively steady until about 08:45 UTC when it drops by about 90%
Graph of requests from "known crawlers/bots" over time to www.bbc.com for today. The graph is relatively steady until about 08:45 UTC when it drops by about 90%
Graph of requests from "known crawlers/bots" over time to www.bbc.com for today. The graph is relatively steady until about 08:45 UTC when it drops by about 90%
Graph of requests from "known crawlers/bots" over time to www.bbc.co.uk for today.
The graph is relatively steady until about 08:45 UTC when it drops by about 90%
Graph of requests from "known crawlers/bots" over time to www.bbc.co.uk for today. The graph is relatively steady until about 08:45 UTC when it drops by about 90%
Graph of requests from "known crawlers/bots" over time to www.bbc.co.uk for today. The graph is relatively steady until about 08:45 UTC when it drops by about 90%
  • Copy link
  • Flag this post
  • Block
Log in

bonfire.cafe

A space for Bonfire maintainers and contributors to communicate

bonfire.cafe: About · Code of conduct · Privacy · Users · Instances
Bonfire social · 1.0.0-rc.2.11 no JS en
Automatic federation enabled
  • Explore
  • About
  • Members
  • Code of Conduct