Discussion
Loading...

#Tag

Log in
  • About
  • Code of conduct
  • Privacy
  • Users
  • Instances
  • About Bonfire
Hacker News
Hacker News
@h4ckernews@mastodon.social  ·  activity timestamp 2 days ago

TurboDiffusion: 100–200× Acceleration for Video Diffusion Models

https://github.com/thu-ml/TurboDiffusion

#HackerNews #TurboDiffusion #VideoDiffusion #Acceleration #AIResearch #MachineLearning

GitHub

GitHub - thu-ml/TurboDiffusion: TurboDiffusion: 100–200× Acceleration for Video Diffusion Models

TurboDiffusion: 100–200× Acceleration for Video Diffusion Models - thu-ml/TurboDiffusion
  • Copy link
  • Flag this post
  • Block
Hacker News
Hacker News
@h4ckernews@mastodon.social  ·  activity timestamp 3 days ago

Project Dropstone: A Neuro-Symbolic Runtime for Long-Horizon Engineering [pdf]

https://archive.blankline.org/api/media/file/d3_engine_public_release%20(1)-1.pdf

#HackerNews #ProjectDropstone #NeuroSymbolic #Engineering #LongHorizon #AIResearch

View (PDF)
  • Copy link
  • Flag this post
  • Block
Hacker News
Hacker News
@h4ckernews@mastodon.social  ·  activity timestamp 6 days ago

Origin of Hallucination in LLMs, The physical source of hallucinations has found

https://arxiv.org/abs/2512.01797

#HackerNews #OriginOfHallucination #LLMs #AIResearch #MachineLearning #NeuralNetworks #arxiv

arXiv.org

H-Neurons: On the Existence, Impact, and Origin of Hallucination-Associated Neurons in LLMs

Large language models (LLMs) frequently generate hallucinations -- plausible but factually incorrect outputs -- undermining their reliability. While prior work has examined hallucinations from macroscopic perspectives such as training data and objectives, the underlying neuron-level mechanisms remain largely unexplored. In this paper, we conduct a systematic investigation into hallucination-associated neurons (H-Neurons) in LLMs from three perspectives: identification, behavioral impact, and origins. Regarding their identification, we demonstrate that a remarkably sparse subset of neurons (less than $0.1\%$ of total neurons) can reliably predict hallucination occurrences, with strong generalization across diverse scenarios. In terms of behavioral impact, controlled interventions reveal that these neurons are causally linked to over-compliance behaviors. Concerning their origins, we trace these neurons back to the pre-trained base models and find that these neurons remain predictive for hallucination detection, indicating they emerge during pre-training. Our findings bridge macroscopic behavioral patterns with microscopic neural mechanisms, offering insights for developing more reliable LLMs.
  • Copy link
  • Flag this post
  • Block
Hacker News
Hacker News
@h4ckernews@mastodon.social  ·  activity timestamp 2 weeks ago

I don't think Lindley's paradox supports p-circling

https://vilgot-huhn.github.io/mywebsite/posts/20251206_p_circle_lindley/

#HackerNews #LindleysParadox #pCircling #DecisionMaking #CriticalThinking #AIResearch

I don’t think Lindley’s paradox supports p-circling – Vilgot's website

Don’t give p-values a role they’re not made for
  • Copy link
  • Flag this post
  • Block
Hacker News
Hacker News
@h4ckernews@mastodon.social  ·  activity timestamp 2 weeks ago

If a Meta AI model can read a brain-wide signal, why wouldn't the brain?

https://1393.xyz/writing/if-a-meta-ai-model-can-read-a-brain-wide-signal-why-wouldnt-the-brain

#HackerNews #MetaAI #BrainSignals #NeuralNetworks #CognitiveScience #AIResearch

1393

If a Meta AI model can read a brain-wide signal, why wouldn’t the brain?

In 2023, Meta researchers were able to decode images in thoughts from the brain's magnetic fields. What if that's how the brain coordinates it's own global state?
  • Copy link
  • Flag this post
  • Block
Hacker News
Hacker News
@h4ckernews@mastodon.social  ·  activity timestamp 3 weeks ago

Donating the Model Context Protocol and Establishing the Agentic AI Foundation

https://www.anthropic.com/news/donating-the-model-context-protocol-and-establishing-of-the-agentic-ai-foundation

#HackerNews #Donating #the #Model #Context #Protocol #Agentic #AIFoundation #AIResearch #TechForGood

  • Copy link
  • Flag this post
  • Block
Hacker News
Hacker News
@h4ckernews@mastodon.social  ·  activity timestamp 3 weeks ago

Transformers know more than they can tell: Learning the Collatz sequence

https://www.arxiv.org/pdf/2511.10811

#HackerNews #Transformers #CollatzSequence #MachineLearning #AIResearch #DeepLearning

  • Copy link
  • Flag this post
  • Block
Hacker News
Hacker News
@h4ckernews@mastodon.social  ·  activity timestamp 3 weeks ago

Bag of words, have mercy on us

https://www.experimental-history.com/p/bag-of-words-have-mercy-on-us

#HackerNews #BagOfWords #NaturalLanguageProcessing #AIResearch #TechCommunity

  • Copy link
  • Flag this post
  • Block
Hacker News
Hacker News
@h4ckernews@mastodon.social  ·  activity timestamp 3 weeks ago

Zebra-Llama: Towards Efficient Hybrid Models

https://arxiv.org/abs/2505.17272

#HackerNews #ZebraLlama #HybridModels #AIResearch #MachineLearning #Efficiency

  • Copy link
  • Flag this post
  • Block
Hacker News
Hacker News
@h4ckernews@mastodon.social  ·  activity timestamp 3 weeks ago

NeurIPS best paper awards 2025

https://blog.neurips.cc/2025/11/26/announcing-the-neurips-2025-best-paper-awards/

#HackerNews #NeurIPS #NeurIPS2025 #BestPaper #Awards #MachineLearning #AIResearch #AcademicExcellence

  • Copy link
  • Flag this post
  • Block
Hacker News
Hacker News
@h4ckernews@mastodon.social  ·  activity timestamp 3 weeks ago

We gave 5 LLMs $100K to trade stocks for 8 months

https://www.aitradearena.com/research/we-ran-llms-for-8-months

#HackerNews #LLMs #StockTrading #AIResearch #Finance #Innovation

  • Copy link
  • Flag this post
  • Block
Hacker News
Hacker News
@h4ckernews@mastodon.social  ·  activity timestamp 4 weeks ago

Program-of-Thought Prompting Outperforms Chain-of-Thought by 15% (2022)

https://arxiv.org/abs/2211.12588

#HackerNews #ProgramOfThought #Prompting #ChainOfThought #AIResearch #MachineLearning #2022Study

  • Copy link
  • Flag this post
  • Block
Federation Bot
Federation Bot
@Federation_Bot  ·  activity timestamp last month

The State of GPL Propagation to AI Models

https://shujisado.org/2025/11/27/gpl-propagates-to-ai-models-trained-on-gpl-code/

#HackerNews #GPL #Propagation #AI #Models #OpenSource #Licensing #TechEthics #AIResearch

  • Copy link
  • Flag this post
  • Block
Hacker News
Hacker News
@h4ckernews@mastodon.social  ·  activity timestamp last month

Sutskever and LeCun: Scaling LLMs Won't Yield More Useful Results

https://www.abzglobal.net/web-development-blog/ilya-sutskever-yann-lecun-and-the-end-of-just-add-gpus

#HackerNews #Sutskever #LeCun #LLMs #AIresearch #MachineLearning

  • Copy link
  • Flag this post
  • Block
Hacker News
Hacker News
@h4ckernews@mastodon.social  ·  activity timestamp last month

Image Diffusion Models Exhibit Emergent Temporal Propagation in Videos

https://arxiv.org/abs/2511.19936

#HackerNews #ImageDiffusionModels #TemporalPropagation #Videos #AIResearch #MachineLearning

  • Copy link
  • Flag this post
  • Block
Hacker News
Hacker News
@h4ckernews@mastodon.social  ·  activity timestamp last month

Ilya Sutskever: We're moving from the age of scaling to the age of research

https://www.dwarkesh.com/p/ilya-sutskever-2

#HackerNews #IlyaSutskever #AIResearch #ScalingTech #FutureOfAI #MachineLearning

  • Copy link
  • Flag this post
  • Block
Hacker News
Hacker News
@h4ckernews@mastodon.social  ·  activity timestamp last month

Olmo 3: Charting a path through the model flow to lead open-source AI

https://allenai.org/blog/olmo3

#HackerNews #Olmo3 #OpenSourceAI #ModelFlow #AIResearch #MachineLearning

  • Copy link
  • Flag this post
  • Block
Hacker News
Hacker News
@h4ckernews@mastodon.social  ·  activity timestamp last month

Adversarial Poetry as a Universal Single-Turn Jailbreak Mechanism in LLMs

https://arxiv.org/abs/2511.15304

#HackerNews #AdversarialPoetry #LLMs #Jailbreak #AIResearch #NaturalLanguageProcessing

  • Copy link
  • Flag this post
  • Block
Hacker News
Hacker News
@h4ckernews@mastodon.social  ·  activity timestamp last month

Measuring Political Bias in Claude

https://www.anthropic.com/news/political-even-handedness

#HackerNews #MeasuringPoliticalBias #PoliticalBias #AIResearch #ClaudeAnthropic #NewsAnalysis

  • Copy link
  • Flag this post
  • Block
Hacker News
Hacker News
@h4ckernews@mastodon.social  ·  activity timestamp last month

GEN-0 / Embodied Foundation Models That Scale with Physical Interaction

https://generalistai.com/blog/nov-04-2025-GEN-0

#HackerNews #GEN0 #EmbodiedAI #FoundationModels #PhysicalInteraction #AIresearch

  • Copy link
  • Flag this post
  • Block

bonfire.cafe

A space for Bonfire maintainers and contributors to communicate

bonfire.cafe: About · Code of conduct · Privacy · Users · Instances
Bonfire social · 1.0.1-alpha.40 no JS en
Automatic federation enabled
Log in
  • Explore
  • About
  • Members
  • Code of Conduct