Running DeepSeek-V4 locally with 4x legacy RTX 2080 Ti ($2k budget setup). Custom Turing kernels, W8A8 quantization, and 255 prefill tok/s!
We weren't able to fetch a full version of this story.
The publisher didn't expose a readable body and our fallback extraction came back empty. You can still read it at the source below — and our editorial angle / reactions remain attached.
Read at r/LocalLLaMAWhat people are saying
Discussion
Hot takes
Loading takes…
Comments
Discussion · 0
Sign in to comment, like, and save articles.
Sign inLoading comments…

