Discussion
Loading...

Post

Log in
  • About
  • Code of conduct
  • Privacy
  • Users
  • Instances
  • About Bonfire
Kevin Thomas ✅
Kevin Thomas ✅
@kevinthomas@defcon.social  ·  activity timestamp 2 weeks ago

In the Reverse Engineering world, we have a rule: You don't own it until you can take it apart.

The same applies to Artificial Intelligence.

We are currently drowning in API wrappers. Everyone is "building AI apps," but very few people are looking at the wiring underneath. To truly understand modern LLMs, I decided to stop using libraries. I went back to the drawing board to build a custom architecture from scratch.

Meet SARAN (Shallow Auto-Regressive Attention Network).

It’s not designed to beat GPT-4. It’s designed to be transparent. 🔹 I built a strict 15-stage computational graph. 🔹 I manually implemented backpropagation to trace the gradients. 🔹 I scaled it to a 354M parameter model to watch how it learns.

I’ve documented the entire build log—including the architecture decisions and the "why" behind the math—in my new engineering newsletter, Bits & Neurons.

If you want to move beyond the hype and understand the mechanics of AI, read the full breakdown here: https://mytechnotalent.substack.com

#ArtificialIntelligence #MachineLearning #DeepLearning #ReverseEngineering #Engineering #BuildInPublic

Sorry, no caption provided by author
Sorry, no caption provided by author
Sorry, no caption provided by author
  • Copy link
  • Flag this post
  • Block

bonfire.cafe

A space for Bonfire maintainers and contributors to communicate

bonfire.cafe: About · Code of conduct · Privacy · Users · Instances
Bonfire social · 1.0.2-alpha.7 no JS en
Automatic federation enabled
Log in
  • Explore
  • About
  • Members
  • Code of Conduct