RTX 5080 16GB: Qwen3.6 35B MoE at 128k context — 56 tok/s, and why MTP doesn't help
We weren't able to fetch a full version of this story.
The publisher didn't expose a readable body and our fallback extraction came back empty. You can still read it at the source below — and our editorial angle / reactions remain attached.
Read at r/LocalLLaMAWhat people are saying
Discussion
Hot takes
Loading takes…
Comments
Discussion · 0
Sign in to comment, like, and save articles.
Sign inLoading comments…

