nondistilled

1 Articles
SambaNova hits 198 tokens per second on the full, non-distilled DeepSeek-R1 671B with only 16 SN40L RDU chips
Tech

SambaNova hits 198 tokens per second on the full, non-distilled DeepSeek-R1 671B with only 16 SN40L RDU chips

SambaNova runs DeepSeek-R1 at 198 tokens/sec using 16 custom chips The SN40L RDU chip is reportedly 3X faster, 5X more efficient than GPUs...