Llama 3.1 70B on a single RTX 3090 via NVMe-to-GPU bypassing the CPU
https://github.com/xaskasdf/ntransformer
#HackerNews #Llama3.1 #RTX3090 #NVMe #GPU #bypass #CPU #AItechnology
#Tag
Llama 3.1 70B on a single RTX 3090 via NVMe-to-GPU bypassing the CPU
https://github.com/xaskasdf/ntransformer
#HackerNews #Llama3.1 #RTX3090 #NVMe #GPU #bypass #CPU #AItechnology
LLM from scratch, part 28 – training a base model from scratch on an RTX 3090
https://www.gilesthomas.com/2025/12/llm-from-scratch-28-training-a-base-model-from-scratch
#HackerNews #LLMfromScratch #RTX3090 #BaseModel #AITraining #MachineLearning