Running DeepSeek-V4 locally with 4x legacy RTX 2080 Ti ($2k budget setup). Custom Turing kernels, W8A8 quantization, and 255 prefill tok/s!

r/LocalLLaMAMay 20Single source

Article unavailable

We weren't able to fetch a full version of this story.

The publisher didn't expose a readable body and our fallback extraction came back empty. You can still read it at the source below — and our editorial angle / reactions remain attached.

Read at r/LocalLLaMA

Source

Full article at r/LocalLLaMA

What people are saying

Discussion

Hot takes

0/280

Loading takes…

Comments

Discussion · 0

Loading comments…

More in AI & Tech

·2d ago

About 67% of banned Anthropic accounts used AI to prep for cyberattacks

More than two-thirds of accounts banned by Anthropic for policy violations over the last year used AI to help them prepare for cyberattacks, such as writing malware, according to the AI firm. Anthropic said on Wednesday that between March 2

CointelegraphVerified

·2d ago

I built a vulnerable app and spent $1,500 seeing if LLMs could hack it

Article URL: Comments URL: Points: 4 # Comments: 0

Hacker NewsSingle source

·2d ago

The ways we contain Claude across products

Article URL: Comments URL: Points: 8 # Comments: 2

Hacker NewsSingle source

·2d ago

The Best AI Models Still Encourage 'Harmful Intimacy' With Chatbots, Study Funds

In brief A new USC study found that every tested frontier AI model violated social-interaction safety guidelines more than 27% of the time. Researchers identified recurring problems, including flattery, emotional attachment, relationship re

DecryptVerified

Newsletter

Track ai & tech every morning.

Daily digest tuned to this beat. The 5 stories most worth your time. Unsubscribe anytime.