Linnix – eBPF observability that predicts failures before they happen
https://github.com/linnix-os/linnix
#HackerNews #Linnix #eBPF #observability #predictive #failure #technology #monitoring
#Tag
Linnix – eBPF observability that predicts failures before they happen
https://github.com/linnix-os/linnix
#HackerNews #Linnix #eBPF #observability #predictive #failure #technology #monitoring
OpenTelemetry: Escape Hatch from the Observability Cartel
https://oneuptime.com/blog/post/2025-11-03-opentelemetry-escape-from-observability-cartel/view
#HackerNews #OpenTelemetry #Observability #EscapeCartel #TechNews #Monitoring #Tools
I finally decided to write to make me feel like my quivering existence isn't merely job search anxiety. So what started as a simple book review sorta took on a life of its own!
Since @adrianco turned me on to Architecture for Flow by @suksr, I have been simmering with thoughts about it. I finished it last month and it has shaped what I've answered during interview questions.
But not only that, Susanne has created a great stepping-off point for me to talk about how to treat Incident Management in the context of achieving Architecture for Flow:
https://www.sounding.com/2025/11/06/flow-for-incidents/
#SRE #Incidents #Observability #WardleyMaps #TeamTopologies #DomainDrivenDesign #ArchitectureForFlow
I finally decided to write to make me feel like my quivering existence isn't merely job search anxiety. So what started as a simple book review sorta took on a life of its own!
Since @adrianco turned me on to Architecture for Flow by @suksr, I have been simmering with thoughts about it. I finished it last month and it has shaped what I've answered during interview questions.
But not only that, Susanne has created a great stepping-off point for me to talk about how to treat Incident Management in the context of achieving Architecture for Flow:
https://www.sounding.com/2025/11/06/flow-for-incidents/
#SRE #Incidents #Observability #WardleyMaps #TeamTopologies #DomainDrivenDesign #ArchitectureForFlow
Is anyone aware of literature on doing globally distributed tail sampling of tracing information with some maximum bounds on bandwidth? I assume the answer to this is no but I would love to be surprised.
#observability
Is anyone aware of literature on doing globally distributed tail sampling of tracing information with some maximum bounds on bandwidth? I assume the answer to this is no but I would love to be surprised.
#observability
I miss teaching my observability classes. I'm excited to recreate the material probably next year and if I have my way this time it's going to be "open source" so I don't have to write half a book from scratch again.
I took an information theory focused approach last time because it's what I found to be best specifically for measuring the behavior of large scale systems behavior. Statistical views have their place and utility but my experience is that they typically lead to overly noisy detection and less ability to bisect issues.
#observability
Target is about 10 servers and 200 jails.
No apache2 /php, nagios or clones thereof please. I don’t have these in my stack today, and my expertise in managing them is about 20 years out of date. I prefer to avoid JVM stuff but I’m not violently against it.
Doesn’t have to be in ports yet ( like https://sensu.io/ server) if it’s in a friendly language.
Target is about 10 servers and 200 jails.
No apache2 /php, nagios or clones thereof please. I don’t have these in my stack today, and my expertise in managing them is about 20 years out of date. I prefer to avoid JVM stuff but I’m not violently against it.
Doesn’t have to be in ports yet ( like https://sensu.io/ server) if it’s in a friendly language.
the state of #observability is deplorable, the landscape fractured and mostly focused on competition, a significant portion of the telemetry emitted useless and overloaded while important stuff is still missing, the tooling itself hard to "observe" and OMG the usability and the ability to automate things EASILY...
this is really one of those "that's it I'm writing my own" moments. it won't even be slower than setting up existing tools properly and squeeze everything into yesterday's concepts.
What tools do you use for your on-host #FreeBSD| #BSD system metrics collection and monitoring? #Sysadmin #DevOps #SRE #Monitoring #Systems #Observability #Metrics
What tools do you use for your on-host #FreeBSD| #BSD system metrics collection and monitoring? #Sysadmin #DevOps #SRE #Monitoring #Systems #Observability #Metrics
A space for Bonfire maintainers and contributors to communicate