Tech

SambaNova hits 198 tokens per second on the full, non-distilled DeepSeek-R1 671B with only 16 SN40L RDU chips

Share
Share

  • SambaNova runs DeepSeek-R1 at 198 tokens/sec using 16 custom chips
  • The SN40L RDU chip is reportedly 3X faster, 5X more efficient than GPUs
  • 5X speed boost is promised soon, with 100X capacity by year-end on cloud

Chinese AI upstart DeepSeek has very quickly made a name for itself in 2025, with its R1 large-scale open source language model, built for advanced reasoning tasks, showing performance on par with the industry’s top models, while being more cost-efficient.

SambaNova Systems, an AI startup founded in 2017 by experts from Sun/Oracle and Stanford University, has now announced what it claims is the world’s fastest deployment of the DeepSeek-R1 671B LLM to date.

Share

Leave a comment

Leave a Reply

Your email address will not be published. Required fields are marked *

Related Articles
South Korea’s LG Energy Solution exits from .4bn Indonesia project
Tech

South Korea’s LG Energy Solution exits from $8.4bn Indonesia project

Credit: Unsplash/CC0 Public Domain South Korea’s LG Energy Solution said Tuesday it...

The Oscars’ new AI rule provides a tentative green light for generative tech in movies
Tech

The Oscars’ new AI rule provides a tentative green light for generative tech in movies

Ahead of the 98th Oscars ceremony, scheduled for March 2026, the Academy...

US urges curb of Google’s search dominance as AI looms
Tech

US urges curb of Google’s search dominance as AI looms

Google contends the US is overreaching by asking a federal judge to...

Auto Shanghai to showcase electric competition at sector’s new frontier
Tech

Auto Shanghai to showcase electric competition at sector’s new frontier

The Shanghai auto show is the world’s biggest and will showcase some...