Discussion
Loading...

Post

Log in
  • About
  • Code of conduct
  • Privacy
  • Users
  • Instances
  • About Bonfire
raffaele
raffaele
@raffaele@digipres.club  ·  activity timestamp 2 weeks ago

wow, this is huge: World's largest Internet Domain Database https://ip.thc.org/
can be relevant for #webarchiving activities.

bulk data available, in parquet format: https://ip.thc.org/docs/bulk-data-access

Bulk Data Access | ip.thc.org

We publish our database in its entirety at the end of each month in CSV and parquet formats, the latest data can downloaded from the links specified below. we recommend the parquet format since its much smaller when uncompressed.
  • Copy link
  • Flag this post
  • Block

bonfire.cafe

A space for Bonfire maintainers and contributors to communicate

bonfire.cafe: About · Code of conduct · Privacy · Users · Instances
Bonfire social · 1.0.1-alpha.40 no JS en
Automatic federation enabled
Log in
  • Explore
  • About
  • Members
  • Code of Conduct