Post · bonfire.cafe

d@nny disc@ mc²

@hipsterelectron@circumstances.run · 3 hours ago

having learned slightly more of russian history, i reassert my initial response to:

Россія to Россия

you literally closed down the discotheque and put bars over its doors!!!!
we were all chilling around the tree together, ссuddling together: now it becomes brother against brother (ссия)

d@nny disc@ mc²

@hipsterelectron@circumstances.run · 3 hours ago

i also think the font i'm reading this in spent so much love around kerning the сія

LOOK!!!! IT'S SO CUTE

d@nny disc@ mc²

@hipsterelectron@circumstances.run · 3 hours ago

so it's important to understand that is not representative of the way those letters looked and felt to the subjects of the orthographic reform

d@nny disc@ mc²

@hipsterelectron@circumstances.run · 3 hours ago

i guess korn didn't use the и at all. oh 😞 omg 😞 i was gonna say "why does и have fascist vibes" and i realized that's the fucking putinoid russian Z sigil

d@nny disc@ mc²

@hipsterelectron@circumstances.run · 2 hours ago

god wait so ok if "Z" is actually и which is the letter that closed the doors of the discotheque and turned brother against brother i feel like at some level that's also invoking the bolshevik orthographic reforms (cause putinoids are like yeah!!!! conquest!!!!!) like it's a fucking noose sign. like "you won't last long" some shit like that

idk what it being sideways would mean (besides swastika) but it strikes me of the manner of a lynch mob

~24 more replies

d@nny disc@ mc²

@hipsterelectron@circumstances.run · 2 hours ago

so recall this classic text of mine https://codeberg.org/cosmicexplorer/corporeal

Consider the following examples:
"THE TRAGEDY OF HAMLET, PRINCE OF DENMARK" = 40 unicode chars, 40 bytes"Que trata de la condición" = 25 unicode chars, 26 bytes"ЧАСТЬ ПЕРВАЯ" = 12 unicode chars, 23 bytes"源氏物語" = 4 unicode chars, 12 bytes

the reason for hamlet was because i fucking love hamlet and because it begins a tragedy. the rest are all also in the repo. i absolutely did just scan for shit that looks wack all in a row like that. the kanji were obviously chosen because they are the least likely to scan like sounds to us english speakers. so we begin with the sound and fury. i leave you with a noise you cannot speak

As we can see from the above examples, the further one gets from the latinate caricature of US english, the more space is taken to represent it.

i admit "latinate caricature" just sounds silly. but the romans were not silly. i think i hadn't yet decided upon the IETF being richard nixon so it's funny that i noticed ASCII and made some basic assumptions

Codeberg.org

corporeal

String library that uses corpus dictionaries to produce a more efficient encoding than UTF-8.

d@nny disc@ mc²

@hipsterelectron@circumstances.run · 2 hours ago

it remains the case that representing languages outside of US English requires significantly more memory during usage, more disk space to store, and more bandwidth to transmit

yeah. yeah. yeah. yeah