Edward Snowden slams Nvidia's RTX 50-series 'F-tier value,' whistleblows on lackluster VRAM capacity

Avieshek@lemmy.world · 1 month ago

Edward Snowden slams Nvidia's RTX 50-series 'F-tier value,' whistleblows on lackluster VRAM capacity

The Hobbyist@lemmy.zip · 1 month ago

You can. I’m running a 14B deepseek model on mine. It achieves 28 t/s.

Viri4thus@feddit.org · 1 month ago

I also have a 3060, can you detail which framework (sglang, ollama, etc) you are using and how you got that speed? i’m having trouble reaching that level of performance. Thx

levzzz@lemmy.world · 1 month ago

You need a pretty large context window to fit all the reasoning, ollama forces 2048 by default and more uses more memory

Jeena@piefed.jeena.net · 1 month ago

Oh nice, that’s faster than I imagined.