Bob
@Bob88488488
This is the craziest part: NVIDIA uses the B200 (2.5× compute and 2.4× bandwidth) with FP4, yet its throughput is only barely better than DeepSeek using the H800 with FP8 (21k vs. 14.8k). That’s insane.
Rafael Dominguez
@somoscode
·
2 mar.
Imagine what these guys could do with the latest chips! 🤦🏻♂️