@Taweret finally did the "vibe code instructions from audio of a cat meowing" challenge and we are cooking. it doesn't have a builtin audio transcription feature, so it started copy-pasting some first-year grad student's parametric spectrum-to-phoneme code, and this is where we're at.
Post
@happyborg
i have been doing this, poking at language models, for a long time, unfortunately
@jonny thank you for your sacrifice 🙇
@happyborg it used to be fun before global capital bet its life on them performing all labor
@happyborg
i have been doing this, poking at language models, for a long time, unfortunately
@Taweret that's it pawning off the task of building the thing on some subagent, so the LLM is telling another LLM to build an app based on a voice memo of ow-aw-ow-ow-aw-ow-ow-ow, which it has decided is a work order management app. to be fair it did say "idk what this audio is," and it took some coaxing to not just make it give up, but this is where it got to eventually.
@Taweret so it's sort of a bust because the thing doesn't have a builtin audio feature, but i am going to see if i can coax it into thinking it's a real language that it can understand
@happyborg @Taweret we got human rights for code before we got human rights for humans
@Taweret also just to be super clear and not give credit where credit is not due, the phoneme detector does not fucking work at all, this is the transcript for this video: https://neuromatch.social/@jonny/116196912161623726
its phoneme detector was specialized to detecting only vowels in 4x slowed speech because it decided that a) the cats meows were normal speech just 4x pitch shifted and b) that vowels and frication are the most informative phonetic features
@Taweret i actually do know a thing or two about parametrically detecting phonemes from acoustics and i can indeed conclude that this is a hallucinatory nightmare that is like tiny pieces of working techniques slapped together and terminating in a final "if all else fails, the phoneme is a "c"
@Taweret to a language model, everything is a temporarily embarassed kanban board flask app