#Tag · bonfire.cafe

#Tag

Hacker News

HuggingFace Agent Skills

https://github.com/huggingface/skills

#HackerNews #HuggingFace #Agent #Skills #AI #MachineLearning #NaturalLanguageProcessing #OpenSource

GitHub

GitHub - huggingface/skills

Contribute to huggingface/skills development by creating an account on GitHub.

Hacker News

@h4ckernews@mastodon.social · 2 months ago

Recursive Language Models

https://arxiv.org/abs/2512.24601

#HackerNews #Recursive #Language #Models #AI #Research #MachineLearning #NaturalLanguageProcessing #DeepLearning

Hacker News

@h4ckernews@mastodon.social · 3 months ago

Bag of words, have mercy on us

https://www.experimental-history.com/p/bag-of-words-have-mercy-on-us

#HackerNews #BagOfWords #NaturalLanguageProcessing #AIResearch #TechCommunity

Esther Payne :bisexual_flag: boosted

Aaron

@hosford42@techhub.social · 3 months ago

If you want a specific example of why many researchers in machine learning and natural language processing find the idea that LLMs like ChatGPT or Claude are "intelligent" or "conscious" is laughable, this article describes one:

https://news.mit.edu/2025/shortcoming-makes-llms-less-reliable-1126

#LLM
#ChatGPT
#Claude
#MachineLearning
#NaturalLanguageProcessing
#ML
#AI
#NLP

Aaron

@hosford42@techhub.social · 3 months ago

https://news.mit.edu/2025/shortcoming-makes-llms-less-reliable-1126

#LLM
#ChatGPT
#Claude
#MachineLearning
#NaturalLanguageProcessing
#ML
#AI
#NLP

Hacker News

@h4ckernews@mastodon.social · 3 months ago

Adversarial Poetry as a Universal Single-Turn Jailbreak Mechanism in LLMs

https://arxiv.org/abs/2511.15304

#HackerNews #AdversarialPoetry #LLMs #Jailbreak #AIResearch #NaturalLanguageProcessing

Hacker News

@h4ckernews@mastodon.social · 4 months ago

Continuous Autoregressive Language Models

https://arxiv.org/abs/2510.27688

#HackerNews #Continuous #Autoregressive #Language #Models #NaturalLanguageProcessing #AI #Research #MachineLearning #TransformerModels

arXiv.org

Continuous Autoregressive Language Models

The efficiency of large language models (LLMs) is fundamentally limited by their sequential, token-by-token generation process. We argue that overcoming this bottleneck requires a new design axis for LLM scaling: increasing the semantic bandwidth of each generative step. To this end, we introduce Continuous Autoregressive Language Models (CALM), a paradigm shift from discrete next-token prediction to continuous next-vector prediction. CALM uses a high-fidelity autoencoder to compress a chunk of K tokens into a single continuous vector, from which the original tokens can be reconstructed with over 99.9\% accuracy. This allows us to model language as a sequence of continuous vectors instead of discrete tokens, which reduces the number of generative steps by a factor of K. The paradigm shift necessitates a new modeling toolkit; therefore, we develop a comprehensive likelihood-free framework that enables robust training, evaluation, and controllable sampling in the continuous domain. Experiments show that CALM significantly improves the performance-compute trade-off, achieving the performance of strong discrete baselines at a significantly lower computational cost. More importantly, these findings establish next-vector prediction as a powerful and scalable pathway towards ultra-efficient language models. Code: https://github.com/shaochenze/calm. Project: https://shaochenze.github.io/blog/2025/CALM.

Hacker News

@h4ckernews@mastodon.social · 4 months ago

Language Models Are Injective and Hence Invertible

https://arxiv.org/abs/2510.15511

#HackerNews #LanguageModels #Invertibility #AIResearch #NaturalLanguageProcessing #MachineLearning

arXiv.org

Language Models are Injective and Hence Invertible

Transformer components such as non-linear activations and normalization are inherently non-injective, suggesting that different inputs could map to the same output and prevent exact recovery of the input from a model's representations. In this paper, we challenge this view. First, we prove mathematically that transformer language models mapping discrete input sequences to their corresponding sequence of continuous representations are injective and therefore lossless, a property established at initialization and preserved during training. Second, we confirm this result empirically through billions of collision tests on six state-of-the-art language models, and observe no collisions. Third, we operationalize injectivity: we introduce SipIt, the first algorithm that provably and efficiently reconstructs the exact input text from hidden activations, establishing linear-time guarantees and demonstrating exact invertibility in practice. Overall, our work establishes injectivity as a fundamental and exploitable property of language models, with direct implications for transparency, interpretability, and safe deployment.

bonfire.cafe

A space for Bonfire maintainers and contributors to communicate

bonfire.cafe: About · Code of conduct · Privacy · Users · Instances

Bonfire social · 1.0.2-alpha.34 no JS en

Automatic federation enabled