Post by @hipsterelectron@circumstances.run

@hipsterelectron@circumstances.run · 7 hours ago

This difficulty, a direct consequence of the use of indirection,

how are you still negging the reader like this

can be broken down as the aliasing [14] and frame [61] problems.

oh my GOD!!!!! ok so these fucking citations my god

[14] this is literally about virtual memory conforming to the C standard https://eis.mdx.ac.uk/staffpages/r_bornat/papers/MPC2000.pdf

The final difficulty is the complexity of the proofs: not only do we have to reason formally about sets, sequences, graphs and trees, we
have to make sure that the locality of assignment operations is reflected in the treatment of assertions about the heap.

EVEN THAT PAPER'S AUTHOR IS TELLING HIM TO DO HIS FUCKING JOB LOL

For all of these reasons, Hoare logic isn’t widely used to verify pointer programs. Yet most low-level and all object-oriented programs use heap pointers freely. If we wish to prove properties of the kind of programs that actually get written and used, we shall have to deal with pointer programs on a regular basis.

View (PDF)

d@nny disc@ mc²

@hipsterelectron@circumstances.run · 7 hours ago

This difficulty, a direct consequence of the use of indirection,

how are you still negging the reader like this

can be broken down as the aliasing [14] and frame [61] problems.

oh my GOD!!!!! ok so these fucking citations my god

[14] this is literally about virtual memory conforming to the C standard https://eis.mdx.ac.uk/staffpages/r_bornat/papers/MPC2000.pdf

The final difficulty is the complexity of the proofs: not only do we have to reason formally about sets, sequences, graphs and trees, we
have to make sure that the locality of assignment operations is reflected in the treatment of assertions about the heap.

EVEN THAT PAPER'S AUTHOR IS TELLING HIM TO DO HIS FUCKING JOB LOL

For all of these reasons, Hoare logic isn’t widely used to verify pointer programs. Yet most low-level and all object-oriented programs use heap pointers freely. If we wish to prove properties of the kind of programs that actually get written and used, we shall have to deal with pointer programs on a regular basis.

d@nny disc@ mc²

@hipsterelectron@circumstances.run · 7 hours ago

literally nothing will prepare you for [61]

d@nny disc@ mc²

@hipsterelectron@circumstances.run · 7 hours ago

literally nothing will prepare you for [61]

d@nny disc@ mc²

@hipsterelectron@circumstances.run · 7 hours ago

[61] McCarthy and P. Hayes. Some philosophical problems from the
standpoint of artificial intelligence. In D. Michie and B. Meltzer, editors,
Machine Intelligence 4, pages 463–502. Edinburgh University Press,
1969.

d@nny disc@ mc²

@hipsterelectron@circumstances.run · 7 hours ago

[61] McCarthy and P. Hayes. Some philosophical problems from the
standpoint of artificial intelligence. In D. Michie and B. Meltzer, editors,
Machine Intelligence 4, pages 463–502. Edinburgh University Press,
1969.

d@nny disc@ mc²

@hipsterelectron@circumstances.run · 7 hours ago

only possible alternative is he mistyped the reference address making a crucial point in his own phd thesis

[62] F. Mehta and T. Nipkow. Proving pointer programs in higher-order
logic. Information and Computation, 199(1-2):200–227, 2005.

and yes, it still assumes the heap. even though if you're managing physical memory. you do not have a heap

d@nny disc@ mc²

@hipsterelectron@circumstances.run · 6 hours ago

god fuck and even this example is literally impossible

For an example of aliasing, consider a program with two pointer variables int * p and int * q and the following triple:
{| True |} ∗p = 37 ; ∗q = 42 ; {| ∗p = ? |}

not only has he just said "triple" without a citation like that's a well-known thing, this is the problem with it:

We are unable to ascertain the value pointed to by p as it may refer to the same location as q.

so you're telling me this C code:

int * p;
int * q;
*p = 37;
*q = 42;

demonstrates a classic aliasing problem....................does this guy even know about restrict or the concept of Undefined Behavior

d@nny disc@ mc²

@hipsterelectron@circumstances.run · 6 hours ago

turns out a "hoare triple" is the fuckboy term for "precondition and postcondition" when you think you're the first to ever create an abstract machine for the C programming language

he never cites anything about a Hoare triple either

Hoare triples, where a block of code is preceded by a pre-condition and followed by a post-condition, have already appeared in §1.1.1.

he did just refer you, with a hyperlinked section heading, to the line above where he just says "triple"

d@nny disc@ mc²

@hipsterelectron@circumstances.run · 6 hours ago

The aliasing problem is much worse for inductively-defined data structures, where it is possible that structural invariants can be violated, and where we need more sophisticated recursive predicates to stipulate aliasing conditions.

this problem is so bad he has no example

These predicates appear in specifications, invariants and proofs, and their discovery is often a time consuming trial-and-error process.

sorry that you hate your job?

The aliasing situation becomes untenable when code is type-unsafe and we are forced to seek improved methods.

if the C compiler can unambiguously interpret it then maybe the C compiler is a better proof framework than isabelle/HOL

If instead we had a variable float * p:
{| True |} ∗p = 3 .14 ; ∗q = 42 ; {| ∗p = ? |}
then not only do we have to consider aliasing between pointers of different types, but also the potential for p to be pointing inside the encoding of ∗q and vice versa.

i don't know what language you're verifying but C has semantics here

We talk about this phenomenon as inter-type aliasing.

who's we

d@nny disc@ mc²

@hipsterelectron@circumstances.run · 6 hours ago

While specifications may mention some state that is affected by the intended behaviour of a program, it is hard to capture the state that is not changed.

literally a nonsense sentence

In the above example, a client verification that also dereferences a pointer r, not mentioned in the specification, has no information on its value after execution of the code fragment.

meaningless

This limits reusability and hence scalability of verifications.

the C compiler is a better proof framework than isabelle/HOL

d@nny disc@ mc²

@hipsterelectron@circumstances.run · 6 hours ago

now he's explaining why he trusts machine output more than anyone

Formal reasoning with the usual process of pen-and-paper mathematical proof neither scales as we would like in software verification nor has the expected degree of rigour.

you know humans made isabelle/HOL too right

d@nny disc@ mc²

@hipsterelectron@circumstances.run · 6 hours ago

"scales" i think he literally just means deskilling here. he's literally saying actually formal verification is more reliable than engineers you pay money to

d@nny disc@ mc²

@hipsterelectron@circumstances.run · 6 hours ago

now he's trying to mansplain "algorithmic verification" as if model checking in TLA+ https://lamport.azurewebsites.net/tla/high-level-view.html isn't actually extremely widely used in this exact space already

Research in this area has produced impressive results in recent years with improvements in the underlying theory and increased available computing power.

theory and computing power are the only things that produce impressive results. no i can't give you a citation it's not like this is my dissertation

They can catch increasingly wider classes of programmer errors

this is still a fucking hateful way to talk about formal verification

and even guarantee the absence of certain types of bugs.

such a half-assed phd thesis. do you even believe in what you're saying

d@nny disc@ mc²

@hipsterelectron@circumstances.run · 6 hours ago

Proof creation has a high cost associated with it.

yeah you keep saying it sucks. aren't you supposed to fix things like that

To give an idea, it has been the author’s experience [93, 94, 96] that, in an interactive theorem prover, verifying the functional correctness of C code can require between one and two orders of magnitude more proof steps than line count,

(a) yeah maybe cause you keep trying to verify functional correctness guarantees about heap allocations in ring 0
(b) "line count" appeals to the VC class. it doesn't work on programmers

d@nny disc@ mc²

@hipsterelectron@circumstances.run · 6 hours ago

literally everyone go read aaron turon's paper on weak atomic memory orderings right now https://plv.mpi-sws.org/gps/paper.pdf yes there's a coq proof but that's not what a paper is for!!!

check out this future work section:

However, the C11 model allows programmers to freely mix memory orderings, and ideally program logics should support such mixed reasoning as well.

literally it's that easy to give a shit about making people safe and providing powerful robust guarantees. this is why rust used to be good

Early investigation suggests that the C11 model has some corner cases when mixing memory orderings that may obstruct compositional reasoning principles.

i get nerd sniped every time i read that line lmao

View (PDF)

d@nny disc@ mc²

@hipsterelectron@circumstances.run · 6 hours ago

We believe the extra generality of GPS is important because it enables us to verify a wider class of weak memory programs, including those whose observable behavior is not SC. The circular buffer and Michael-Scott queue are good examples of this (see the appendix [1]). Singh et al. [35] argue that one should not expose the high-level programmer to such non-SC data structures, but GPS shows that in fact it is possible to reason sensibly and modularly about them.

keep in mind seL4 doesn't even represent concurrent behavior at all in their 2009 proof (as far as i can tell), even though concurrency semantics are a feature on any system with a motherboard. and aaron turon shows actually we can make it easier to write complex code with formal verification

d@nny disc@ mc²

@hipsterelectron@circumstances.run · 6 hours ago

honestly i should totally mess around with coq semantics for my ring buffer from hell

d@nny disc@ mc²

@hipsterelectron@circumstances.run · 6 hours ago

oh yeah the other guy was telling me how much it sucks to verify stuff

and in a single person year, we may be limited to verifying no more than something in the order of 1000 lines-of-code (LoC). Little data exists for how proof-based projects scale, but it is unlikely to be linear.

still just ridiculous that this guy is still talking "verification" without doing the work in the c compiler. i know people do that

The economics of verification have two significant consequences.

so bleak to talk about your research focus like this

First, the range of systems we can hope to verify is limited, but is still large enough to be practically interesting.

that's literally not a "consequence" why would you invoke proof jargon incorrectly lmao

Modern microkernels, with implementations around 10,000 LoC are hopefully within the realm of possibility.

you were just a moment ago saying int * p; int * q; was beyond your abilities

d@nny disc@ mc²

@hipsterelectron@circumstances.run · 6 hours ago

Verification of such systems can bring significant improvements to the reliability of the entire software stack, as above the microkernel layer hardware protection domains limit the impact any incorrectly behaving software has on the trusted computing base [83].

microkernels don't consider it their problem to provide any sort of correctness guarantees except for their own behavior, so this is just a lie.
the MMU isolation is from the CPU, not the microkernel

d@nny disc@ mc²

@hipsterelectron@circumstances.run · 5 hours ago

i will literally die mad about how casually they mentioned fucking shared memory pages are a replacement for sequenced writes https://trustworthy.systems/publications/nicta_full_text/8988.pdf

In original L4, “long” messages could specify multiple buffers in a single IPC invocation to amortise the hardware mode- and context-switch costs.

a single crumb of structured I/O

While long IPC provides functionality that cannot be emulated

literally the actual criterion for minimality

Shared buffers can avoid any explicit copying be-
tween address spaces

"microkernel layer hardware protection domains" cmon

View (PDF)

d@nny disc@ mc²

@hipsterelectron@circumstances.run · 5 hours ago

The result was significant kernel complexity, with many tricky corner cases that risked bugs in the implementation.

i thought that's why we used formal verification? that's why microkernels were worth the cost of proofs?

d@nny disc@ mc²

@hipsterelectron@circumstances.run · 5 hours ago

after reading all this my impression continues to be that microkernels don't do enough isolation at all!!! i even dug up build systems a la carte https://www.microsoft.com/en-us/research/wp-content/uploads/2018/03/build-systems-final.pdf where simon peyton-jones tried to pull this same shit about build systems

View (PDF)

d@nny disc@ mc²

@hipsterelectron@circumstances.run · 5 hours ago

i do actually appreciate that seL4 has a lot of use in single-core embedded applications where you're typically not just greedy for i/o like me and the purpose of an OS actually aligns reasonably well with the atomic i/o APIs

d@nny disc@ mc²

@hipsterelectron@circumstances.run · 5 hours ago

For seL4 there are even stronger reasons for staying away from supporting long messages: The formal verification approach explicitly avoided any concurrency in the kernel [Klein et al. 2009], and nested exceptions introduce a degree of concurrency.

i also very specifically want to avoid introducing subtle concurrency bugs but i'm doing that by expanding "isolation" beyond the MMU and expanding named "synchronization contexts" to structure literally all the externally-visible state changes like i/o

i absolutely don't think i could do seL4 better, and i'm not planning to inject tons of confusing and poorly-documented semantics like linux

d@nny disc@ mc²

@hipsterelectron@circumstances.run · 5 hours ago

at first i was thinking "let's literally just add buffers between everything" but then i got hooked on transactions

d@nny disc@ mc²

@hipsterelectron@circumstances.run · 5 hours ago

the one concurrency i will have to figure out is multiple processes writing to the same synchronization domain at once. i think i'm gonna try my damndest to avoid having to use any red-black trees. maybe i'll make it possible to open the same file/shm mapping rw in two+ threads/processes at once but you have to explicitly tell me you actually want me to handle possibly-concurrent write requests to this shared resource

d@nny disc@ mc²

@hipsterelectron@circumstances.run · 5 hours ago

i also got upset about pipes when i learned even though userspace uses them like ring buffers their semantics just encode the whole monolithic memory architecture i h8. they're literally just a fixed-size queue for atomically pushing/pulling some floating pages

d@nny disc@ mc²

@hipsterelectron@circumstances.run · 5 hours ago

i have such a negative ranty post i haven't sent from many hours ago but seL4's autobio paper ending with the very clear remark "we can't figure out how to schedule anything, nothing works" -- i didn't see that as like indicative of moral decline. to me it was clarifying!

i also felt this way learning that linux and openbsd also schedule their processes the exact same way seL4 does (to my mind at least), which is generally round-robin

it's actually kinda absurd thinking about how scheduling based upon something besides fair slicing ends up imposing this huge huge huge change in the way the entire system operates!

d@nny disc@ mc²

@hipsterelectron@circumstances.run · 5 hours ago

not just me being contrarian when i say driving scheduling from the active data dependency graph is really fascinating to consider too because that's also exactly where it would make sense to update the page attribute table

d@nny disc@ mc²

@hipsterelectron@circumstances.run · 5 hours ago

and telling the CPU to schedule my pages while then scheduling the task that's gonna want them sounds so cute

d@nny disc@ mc²

@hipsterelectron@circumstances.run · 5 hours ago

like the only reason i will ever get around to actually doing this is because i want to have extremely deep control over where and how memory flows (including persisted)

and i'm still excited about this bc there's no atomic globally visible changes ever (maybe i/o devices) which is the stuff that makes my brain hurt when linux does it

d@nny disc@ mc²

@hipsterelectron@circumstances.run · 5 hours ago

how simple do my stateful message queues need to get before i can start pretending it's kind of like formal verification

d@nny disc@ mc²

@hipsterelectron@circumstances.run · 5 hours ago

this other citation "why do people still use c for high-reliability environments" https://dl.acm.org/doi/10.1145/1215995.1216004 because nowhere else is willing to maintain a lingua franca out of the goodness of their heart

https://dl.acm.org/doi/10.1145/1215995.1216004

d@nny disc@ mc²

@hipsterelectron@circumstances.run · 5 hours ago

i was thinking about this too after i learned about the caveats with side effect sequencing. i don't think there's anything terribly special about C other than i know gcc devs are genuinely sweet and thoughtful and passionate

d@nny disc@ mc²

@hipsterelectron@circumstances.run · 5 hours ago

if i wanted to make a language for the macrokernel i would have to decide to understand a lot more than i do now about what the hell a kernel is and especially boot logic. and then i'd have to learn about disk persistence.

i think as an implementation language for managing memory and cpu structures there definitely could be better frontends. and i think once i feel more comfortable about the hardware semantics (particularly x86 cpu + ssd nvme) then i will want to start gernerating structures that translate the macrokernel user API in my head to match the requirements from the hardware

d@nny disc@ mc²

@hipsterelectron@circumstances.run · 4 hours ago

like an interlocking web of formal models dancing together in the memory page prairie is actually what i see in my head when i think of the end goal

d@nny disc@ mc²

@hipsterelectron@circumstances.run · 4 hours ago

look up this paper on "model checking c source code for embedded systems" https://link.springer.com/article/10.1007/s10009-009-0106-5

buy it for $40! thanks!

two just horrifying suggestions to purchase below that:

A Model Checker Collection for the Model Checking Contest Using Docker and Machine Learning
Finding software vulnerabilities in large C projects via bounded model checking

SpringerLink

Model checking C source code for embedded systems - International Journal on Software Tools for Technology Transfer

In this paper, the applicability of model checking to C code for embedded systems is studied. The paper is divided into two parts. In the first part, 13 existing model checkers for C code are detailed and evaluated for their applicability in the verification of C code for embedded systems. A case study is presented that applied CBMC as one representative C code model checker to an exemplary microcontroller program. As a consequence of this case study, we decided to develop a new model checker for source code for microcontrollers, called [mc]square. It is described in the second part of this paper. We present the architecture and the peculiarities of [mc]square, and we successfully applied [mc]square to the same microcontroller program used in the case study.

d@nny disc@ mc²

@hipsterelectron@circumstances.run · 4 hours ago

never sure how to take lines like these https://sci-hub.st/10.1007/s10009-009-0106-5

The disadvantage is that all specific knowledge of the C code and the underlying hardware has to be used in the abstraction process as the general purpose model checkers are not aware of these peculiarities.

i assumed everyone doing this sort of thing was the author of the C code they're checking and that everything of course has to be specialized to the particular CPU. i don't know what anyone would expect to get out of model checking otherwise

Sci-Hub. Model checking C source code for embedded systems / International Journal on Software Tools for Technology Transfer, 2009

d@nny disc@ mc²

@hipsterelectron@circumstances.run · 4 hours ago

actually delighted to hear not only that gcc tends to be the de facto here but that CIL is compatible. CIL sounds sick

If the GCC compiler supports the chosen microcontroller, the adjustments are less costly since many of the C code model checkers use the GCC compiler or a compatible framework such as CIL for preprocessing.

d@nny disc@ mc²

@hipsterelectron@circumstances.run · 4 hours ago

yet another seemingly-legit ARPANET paper that seems like it was made to be found discarded in an abandoned laboratory https://people.mpi-sws.org/~gummadi/teaching/sp07/sys_seminar/arpanet.pdf

Attempts at computer networks have been made in the past

"but they weren't evil enough for our purposes".

dude is absolutely crashing out about "load sharing", claiming it will never be worth the cost, and computer programs are incompatible, etc. given that i know that worked for parallel scala compiles, it seemed confusing until the next section

View (PDF)

d@nny disc@ mc²

@hipsterelectron@circumstances.run · 4 hours ago

Data Sharing:
The program is sent to a remote computer where a large data bank exists.
This type of operation will be particularly useful where data files are too large to be duplicated economically.

so our boy lawrence g. roberts totally predicted bazel cloud builds and github actions.

Access to this data base will be required simply to make an inquiry or may involve executing a complex program using the data base.

mysterious access control mechanisms? potential surveillance? it gets better:

This type of use is particularly important to the military for command and control, information, retrieval, logistics and war gaming applications.
In these cases, one command would send a program to be executed at another center where the data base existed.

i really never know if they're just saying intentionally ridiculous shit

note how he distinguishes "send a program" -- clearly an RPC call, which were definitely around at the time

d@nny disc@ mc²

@hipsterelectron@circumstances.run · 4 hours ago

fidonet did a ton of load sharing on a per-file basis, including some really interesting locality-based queueing

d@nny disc@ mc²

@hipsterelectron@circumstances.run · 4 hours ago

Program Sharing: Data is sent to a program located at a remote computer and the answer is returned. Software of particular efficiency or capability exists on certain machines.

literally this is all google tech lmao it's like he's salivating over this

The use of specialized programs at remote facilities makes possible large gains in performance.
Perhaps even more important is the potential saving in reprogramming effort.

ridiculous shit

d@nny disc@ mc²

@hipsterelectron@circumstances.run · 4 hours ago

yeah and then he mentions three separate times how scientists can use it to do new science together. it seems important that scientists are on it at all

d@nny disc@ mc²

@hipsterelectron@circumstances.run · 4 hours ago

and i found all that about DARPA salivating over people never writing their own programs again because of:

the seL4 paper which got best paper https://web.archive.org/web/20110219113850/http://www.ok-labs.com/releases/release/open-kernel-labs-paper-on-formal-verification-wins-top-prize-at-prestigious

still think this paper is terrible. it keeps saying it made compromises for verifiability and wildly overstates the guarantees

turns out that conference SOSP is literally the (ACM) conference whose first year was when a big ARPANET thing was unveiled https://en.wikipedia.org/wiki/Symposium_on_Operating_Systems_Principles
there's these unstructured notes from the fucking pentagon lmao https://web.archive.org/web/20150405055923/https://web.stanford.edu/dept/SUL/library/extra4/sloan/mousesite/EngelbartPapers/B1_F20_CompuMtg.html
and finally this is IETF at its best https://en.wikipedia.org/wiki/Shared_resource choose the most generic possible term that sounds like page cache, but it specifically means network share

d@nny disc@ mc²

@hipsterelectron@circumstances.run · 4 hours ago

oh this is a great tidbit

Before 2023, SOSP was held every other year, alternating with the conference on Operating Systems Design and Implementation (OSDI);
starting 2024, SOSP began to be held every year.

lots of weird things like this

d@nny disc@ mc²

@hipsterelectron@circumstances.run · 3 hours ago

oh then i found this guy who was at the arpanet conference http://royalsocietypublishing.org/rsbm/article-pdf/doi/10.1098/rsbm.2002.0006/911101/rsbm.2002.0006.pdf

so like this guy is easily off the charts evil imho. this is him saying he was smarter and braver than alan turing:

My few contacts with Turing were not encouraging. I wanted to talk to him about the remarkable results of his paper ‘On computable numbers’. Reading this paper I had found numerous errors in the formal specification of the universal computer. Some were trivial but others were quite subtle and I was not sure that my solutions were correct. When I came to this point, Turing became more and more agitated, until I could see that no sensible discussion was possible. Clearly he felt the errors to be irrelevant and my drawing attention to them rather foolish.

then he mysteriously advises on "cryptography" from the late 80s until he finally fucked off this planet

Retirement did not by any means imply inactivity. For the next 15 years Davies practised
as a consultant in security engineering for the financial and media industries. This was at a
time when systems based on cryptographic and similar techniques were coming into wide use
both for cash cards and pay television.

View (PDF)

d@nny disc@ mc²

@hipsterelectron@circumstances.run · 3 hours ago

omg

Davies was very much an engineer rather than a scientist, and he was always on the lookout
for topics in which his ingenuity and insight could be deployed.

guy who steals people's ideas

d@nny disc@ mc²

@hipsterelectron@circumstances.run · 3 hours ago

He was early in the field of civil uses of cryptography and always had a healthy scepticism of claims to perfection.

terrifying

Mathematical proof of the security of a system struck him as dubious, because it is much easier to prove resistance to attacks one has thought about than it is to prove resistance to attacks one has not thought about.

guy who knows how cryptography works

d@nny disc@ mc²

@hipsterelectron@circumstances.run · 3 hours ago

omg the EROS author is literally validating all my ideas about the macrokernel love that for me

d@nny disc@ mc²

@hipsterelectron@circumstances.run · 3 hours ago

Systems programs are strongly driven by bulk I/O
performance.

he keeps talking about multiple aliasing being problematic and of course this is why i decided to never share anything and instead have layered i/o queues

In systems code, the effect of representation and data placement can be extreme. Bonwick et al. discuss some of these effects [5], noting that the performance of system-level benchmarks can change by 50% through careful management of cache residency and collisions.

but how do you manage something "carefully" if all the interfaces allow for is urgency???

This tends to penalize the performance of automatic storage reclamation strategies. To make matters more interesting, there are caches.

yeah it rly annoys me how the filesystem has its own caches and the kernel has its own caches but there's this assumption that persistence is always the final destiny of all writes

It follows that user-managed storage is a requirement, but perhaps not in fully general form.

that's exactly it!

d@nny disc@ mc²

@hipsterelectron@circumstances.run · 3 hours ago

The facts say otherwise. The annual cost to operate a large banking data center today is $150,000 per square foot. It is by far the most expensive real estate in the world, and more than one third of that cost is the cost of cooling the data center.

2006!!!!

d@nny disc@ mc²

@hipsterelectron@circumstances.run · 3 hours ago

wait shit he had a point:

The general rule of thumb is that power
is proportional to V2F: the square of the voltage times the frequency. Most of this power is wasted as heat. To a system’s programmer, the cost of doubling the clock rate is $50,000 per square foot per machine room.

do i detect an IETF hater???

Raising the clock rate decidedly isn’t free, and walking into the network distribution closet at your business or school will quickly convince you that current power usage is excessive.

d@nny disc@ mc²

@hipsterelectron@circumstances.run · 3 hours ago

oh oops he glazes up tcp/ip immediately after. this is "Programming Language Challenges in Systems Codes" by jonathan shapiro

d@nny disc@ mc²

@hipsterelectron@circumstances.run · 3 hours ago

on the internet:

Large block processing costs are dominated by memory bandwidth, not software overheads.

that makes sense. the difficulty with fitting network i/o into my beautiful symphony of data locality is that the network is "necessary global" in some sense, and can't do multi-level queueing or w/e because you can't dictate to network resources how fast or slow to send data to you!

As Blackwell discusses [4], processing overhead on smaller packets is necessarily much higher.

hmmmm

d@nny disc@ mc²

@hipsterelectron@circumstances.run · 3 hours ago

vaguely interesting microsoft research paper https://research.cs.wisc.edu/areas/os/Seminar/schedules/papers/Deconstructing_Process_Isolation_final.pdf

A software isolated process is a collection of memory pages and a language safety mechanism that ensures that code in a process cannot access another process’s pages. A SIP replaces hardware memory protection with static verification of program safety.

DEEPLY suspicious to hear "replaces hardware memory protection" coming from microsoft lmao

They rely on verifying code’s safe behavior to prevent it from accessing another process’s (or the kernel’s) instructions or data.

LMAO

View (PDF)

d@nny disc@ mc²

@hipsterelectron@circumstances.run · 3 hours ago

However, language safety offers important benefits not provided by hardware process protection, for example, detecting in-process errors such buffer overruns.

literally nothing in this paper makes any sense

d@nny disc@ mc²

@hipsterelectron@circumstances.run · 3 hours ago

just read a liedtke paper for the first time https://cgi.cse.unsw.edu.au/~cs9242/19/papers/Liedtke_93.pdf i think this guy is crazy for still trying to make ipc faster but this was actually cool to read. should have thought to learn that context first before hating on all the modern microkernel stuff =\

and he completely blew my fucking mind with this lmao:

5.3.5 Direct Process Switch
For a remote procedure call it is natural to switch the flow of control directly to the called thread, donating the current timeslice to it (as also LRPC does).
This is also the most efficient method, since it only involves changing stack pointer and address space.

i don't think i would ever have thought of that myself and i can see why all-consuming focus on a hopeless task can actually get you places sometimes if you don't half-ass it

guy seems cool

d@nny disc@ mc²

@hipsterelectron@circumstances.run · 7 hours ago

literally nothing will prepare you for [61]

d@nny disc@ mc²

@hipsterelectron@circumstances.run · 7 hours ago

literally nothing will prepare you for [61]

d@nny disc@ mc²

@hipsterelectron@circumstances.run · 7 hours ago

This difficulty, a direct consequence of the use of indirection,

how are you still negging the reader like this

can be broken down as the aliasing [14] and frame [61] problems.

oh my GOD!!!!! ok so these fucking citations my god

[14] this is literally about virtual memory conforming to the C standard https://eis.mdx.ac.uk/staffpages/r_bornat/papers/MPC2000.pdf

The final difficulty is the complexity of the proofs: not only do we have to reason formally about sets, sequences, graphs and trees, we
have to make sure that the locality of assignment operations is reflected in the treatment of assertions about the heap.

EVEN THAT PAPER'S AUTHOR IS TELLING HIM TO DO HIS FUCKING JOB LOL

For all of these reasons, Hoare logic isn’t widely used to verify pointer programs. Yet most low-level and all object-oriented programs use heap pointers freely. If we wish to prove properties of the kind of programs that actually get written and used, we shall have to deal with pointer programs on a regular basis.

View (PDF)

d@nny disc@ mc²

@hipsterelectron@circumstances.run · 7 hours ago

This difficulty, a direct consequence of the use of indirection,

how are you still negging the reader like this

can be broken down as the aliasing [14] and frame [61] problems.

oh my GOD!!!!! ok so these fucking citations my god

[14] this is literally about virtual memory conforming to the C standard https://eis.mdx.ac.uk/staffpages/r_bornat/papers/MPC2000.pdf

The final difficulty is the complexity of the proofs: not only do we have to reason formally about sets, sequences, graphs and trees, we
have to make sure that the locality of assignment operations is reflected in the treatment of assertions about the heap.

EVEN THAT PAPER'S AUTHOR IS TELLING HIM TO DO HIS FUCKING JOB LOL

For all of these reasons, Hoare logic isn’t widely used to verify pointer programs. Yet most low-level and all object-oriented programs use heap pointers freely. If we wish to prove properties of the kind of programs that actually get written and used, we shall have to deal with pointer programs on a regular basis.

View (PDF)