Edward Snowden slams Nvidia's RTX 50-series 'F-tier value,' whistleblows on lackluster VRAM capacity

Avieshek@lemmy.world · 1 month ago

Edward Snowden slams Nvidia's RTX 50-series 'F-tier value,' whistleblows on lackluster VRAM capacity

Eager Eagle@lemmy.world · 1 month ago

I bet he just wants a card to self host models and not give companies his data, but the amount of vram is indeed ridiculous.

Jeena@piefed.jeena.net · 1 month ago

Exactly, I’m in the same situation now and the 8GB in those cheaper cards don’t even let you run a 13B model. I’m trying to research if I can run a 13B one on a 3060 with 12 GB.

The Hobbyist@lemmy.zip · 1 month ago

You can. I’m running a 14B deepseek model on mine. It achieves 28 t/s.

Viri4thus@feddit.org · 1 month ago

I also have a 3060, can you detail which framework (sglang, ollama, etc) you are using and how you got that speed? i’m having trouble reaching that level of performance. Thx

levzzz@lemmy.world · 1 month ago

You need a pretty large context window to fit all the reasoning, ollama forces 2048 by default and more uses more memory

Jeena@piefed.jeena.net · 1 month ago

Oh nice, that’s faster than I imagined.