Discussion
Loading...

#Tag

Log in
  • About
  • Code of conduct
  • Privacy
  • Users
  • Instances
  • About Bonfire
Hacker News
Hacker News
@h4ckernews@mastodon.social  ·  activity timestamp 6 days ago

Llama 3.1 70B on a single RTX 3090 via NVMe-to-GPU bypassing the CPU

https://github.com/xaskasdf/ntransformer

#HackerNews #Llama3.1 #RTX3090 #NVMe #GPU #bypass #CPU #AItechnology

GitHub

GitHub - xaskasdf/ntransformer: High-efficiency LLM inference engine in C++/CUDA. Run Llama 70B on RTX 3090.

High-efficiency LLM inference engine in C++/CUDA. Run Llama 70B on RTX 3090. - xaskasdf/ntransformer
  • Copy link
  • Flag this post
  • Block
vermaden
vermaden
@vermaden@mastodon.bsd.cafe  ·  activity timestamp 9 months ago

Added 𝗨𝗣𝗗𝗔𝗧𝗘 𝟭 - 𝗛𝗶𝗴𝗵 𝗛𝗼𝗽𝗲𝘀 to 𝗙𝗮𝗶𝗹𝗲𝗱 𝗕𝗮𝗰𝗸𝘂𝗽 𝗦𝗲𝗿𝘃𝗲𝗿 𝗕𝘂𝗶𝗹𝗱 article.

https://vermaden.wordpress.com/2025/05/28/failed-backup-server-build/#high-hopes

#freebsd #nas #zfs #lowpower #nvme #ssd #tiny #small #backup #fail

Sorry, no caption provided by author
Sorry, no caption provided by author
Sorry, no caption provided by author
  • Copy link
  • Flag this post
  • Block
vermaden
vermaden
@vermaden@mastodon.bsd.cafe  ·  activity timestamp 9 months ago

New 𝗙𝗮𝗶𝗹𝗲𝗱 𝗕𝗮𝗰𝗸𝘂𝗽 𝗦𝗲𝗿𝘃𝗲𝗿 𝗕𝘂𝗶𝗹𝗱 [Failed Backup Server Build] article on my https://vermaden.wordpress.com/ blog.

https://vermaden.wordpress.com/2025/05/28/failed-backup-server-build/

#backup #data #freebsd #hardware #nvme #server #small #ssd #storage #tiny #unix #zfs #zpool

Sorry, no caption provided by author
Sorry, no caption provided by author
Sorry, no caption provided by author
  • Copy link
  • Flag this post
  • Block

bonfire.cafe

A space for Bonfire maintainers and contributors to communicate

bonfire.cafe: About · Code of conduct · Privacy · Users · Instances
Bonfire social · 1.0.2-alpha.34 no JS en
Automatic federation enabled
Log in
Instance logo
  • Explore
  • About
  • Members
  • Code of Conduct