Discussion
Loading...

Post

Log in
  • About
  • Code of conduct
  • Privacy
  • Users
  • Instances
  • About Bonfire
Sander Meijer
Sander Meijer
@bergmeister@mountains.social  ·  activity timestamp last month

ℹ️ (increased) scraping ℹ️

Since roughly a week there is increased scraping taking place on mountains.social. This results in higher load on the server (which you might have noticed) and also an increase in media storage (as older posts are read more frequently, they are no candidate anymore for daily housekeeping).

I remove media for remote posts older than 4 days (they will be fetched again when someone needs that media again). The used media storage was stable around 90 GB, but has increased to 200 GB in the past week. Of course this will increase storage costs.

As a first measure I have changed the housekeeping to delete media from remote posts older than 3 days instead of 4. In the coming days I will have to setup blocking IP addresses / ranges from the seen scrapers, as the currently configured robots.txt ("please do no scrape") is just plainly ignored by them.

Will not write down what I think about them, but I guess you can figure (hint: it is quite explicit).

#MountainsAdmin

  • Copy link
  • Flag this post
  • Block
Sander Meijer
Sander Meijer
@bergmeister@mountains.social replied  ·  activity timestamp last month

I have started to block the first IP-ranges (the most obvious ones).

Their traffic will still reach the webserver (consuming CPU), but will be dropped there. It should not reach the actual posts anymore (and therefore will also not touch their retention time. The cleanup of media from older remote posts will therefore be possible.

This is of course a cat and mouse game, so let's see how this evolves.

In case you notice negatives side-effects from your side, please give me a shout.

#MountainsAdmin

  • Copy link
  • Flag this comment
  • Block
Billy
Billy
@parslii@mountains.social replied  ·  activity timestamp last month

@bergmeister this is a bummer, but I also think it's an unfortunate inevitability on the contemporary internet.

  • Copy link
  • Flag this comment
  • Block

bonfire.cafe

A space for Bonfire maintainers and contributors to communicate

bonfire.cafe: About · Code of conduct · Privacy · Users · Instances
Bonfire social · 1.0.2-alpha.7 no JS en
Automatic federation enabled
Log in
  • Explore
  • About
  • Members
  • Code of Conduct