Have you ever needed to extract text from images embedded in a #PDF? I can highly recommend the open source #CLI tool #OCRmyPDF which is easy to automate in for example a #DataPipeline.

It uses #Tesseract#OCR under the hood and has many options to experiment with to get the best possible accuracy for your language and PDF content.

You can get started with just a few commands:

https://samuelplumppu.se/blog/automated-text-extraction-from-pdf-images-with-ocrmypdf

🏕️ my adventures in #selfhosting: day 259 (slow down edition) 🐌​

a blog post that discusses the sense of urgency I felt to learn #Docker (because it will become mandatory for #Ghost)... and how a recent discovery has pushed back my deadline. I can learn slowly, yay!

🔗​: https://news.elenarossini.com/my-so-called-sudo-life/my-adventures-in-self-hosting-day-259-slow-down-edition/

#MySoCalledSudoLife #CLI

🏕️ my adventures in #selfhosting: day 259 (slow down edition) 🐌​

a blog post that discusses the sense of urgency I felt to learn #Docker (because it will become mandatory for #Ghost)... and how a recent discovery has pushed back my deadline. I can learn slowly, yay!

🔗​: https://news.elenarossini.com/my-so-called-sudo-life/my-adventures-in-self-hosting-day-259-slow-down-edition/

#MySoCalledSudoLife #CLI

Emelia 👸🏻
Jeff Sikes 🍎
Emelia 👸🏻 and 1 other boosted

🎉 Big thanks to @2chanhaeng for his first contribution to #Fedify! He implemented the new fedify webfinger command in PR #278, which allows isolated #WebFinger lookups for testing configurations. This addresses the need for developers to test WebFinger functionality without performing comprehensive object retrieval.

The contribution includes:

  • A new fedify webfinger command that accepts @user@domain format handles or URIs
  • Clean JSON output of WebFinger JRD results
  • Proper error handling for invalid handles and lookup failures
  • Complete #CLI integration with help text and usage examples

This was originally filed as issue #260 and marked as a good first issue—perfect for newcomers to learn the codebase structure while contributing meaningful functionality. The PR has been merged and will be included in the upcoming Fedify 1.8.0 release.

We appreciate all first-time contributors who help make Fedify better for the entire #fediverse community. Welcome aboard, ChanHaeng!

#opensource #fedidev

🎉 Big thanks to @2chanhaeng for his first contribution to #Fedify! He implemented the new fedify webfinger command in PR #278, which allows isolated #WebFinger lookups for testing configurations. This addresses the need for developers to test WebFinger functionality without performing comprehensive object retrieval.

The contribution includes:

  • A new fedify webfinger command that accepts @user@domain format handles or URIs
  • Clean JSON output of WebFinger JRD results
  • Proper error handling for invalid handles and lookup failures
  • Complete #CLI integration with help text and usage examples

This was originally filed as issue #260 and marked as a good first issue—perfect for newcomers to learn the codebase structure while contributing meaningful functionality. The PR has been merged and will be included in the upcoming Fedify 1.8.0 release.

We appreciate all first-time contributors who help make Fedify better for the entire #fediverse community. Welcome aboard, ChanHaeng!

#opensource #fedidev

I had the honour to be invited by @ZFAVClub to participate in an online live workshop exploring the mighty and wonderful CLI tool of the @internetarchive, in collaboration with @dweb 💕

If you are interested in data preservation and digital commons, please do watch it, as we did our best to shed some light on the most powerful yet little known features of this software.

The #InternetArchive is an invaluable resource for unlimited, public, open, and free storage for the whole Universe. It seems too good to be true, but it is!

We also touched upon the value of #FreeSoftware, #Linux, #decentralization, and #publicDomain.

https://archive.org/details/youtube-8Pr05zlXdlo

#archive#commonGood #librarian #library #hacker #hacking #hackerLibrarian#DWeb#CLI #terminal #tutorial#Python #workshop

⚡ Use the Internet Archive like a Hacker-Librarian! 🔮

Join a hands-on workshop where we’ll demystify the CLI and turn it into your super power for archiving, accessing, and uploading content to the @internetarchive.

Hosted by the ZkAv Club Foundry, with the participation of @tommi, we’ll learn and share our experience using the awesome ia #CLI tool!

🗓️ Jul 3, 16:00 UTC (9 PDT, 18 CEST)
PARTICIPATION IS FREE

ℹ️ Info + registration: https://lu.ma/zv3blohp

#DWeb#InternetArchive#IA #terminal