Categories: Business

Cerebras becomes the fastest host in the world for Deepseek R1, going beyond Nvidia GPUs of 57x

Join our daily and weekly newsletters for the latest updates and the exclusive content on AI coverage. Learn more


Cerebras Systems has announced today that it will welcome the Deepseek Breakthrough Artificial R1 model on American servers, promising speeds up to 57 times faster than GPU -based solutions while keeping data sensitive to the ‘interior of American borders. This decision comes in the midst of increasing concerns concerning the rapid AI progress and data confidentiality.

The Puce Start-up AI will deploy a version of 70 billion Deepseek-R1 parameters operating on its owner equipment on a brochure scale, offering 1,600 tokens per second-a spectacular improvement compared to traditional GPU implementations that have Fight with more recent “reasoning” models.

The response times of the main AI platforms, measured in seconds. Cerebras obtains the fastest answer to just over a second, while the Novita system takes almost 38 seconds to generate its first outing – a critical metric for real world applications. (Source: artificial analysis)

Why the Deepseek reasoning models reshape the company AI

“These reasoning models affect the economy,” said James Wang, a Cerebras senior manager in an exclusive interview with Venturebeat. “Any knowledge worker must mainly perform a kind of cognitive tasks in several stages. And these reasoning models will be the tools that enter their workflow. »»

The announcement follows a tumultuous week during which the emergence of Deepseek sparked the greatest loss of market value in Nvidia, nearly $ 600 billion, which raises questions about the supremacy of the AI ​​of the AI ​​of giant chips. The Cerebras solution responds directly to two key concerns that have emerged: the requirements for calculating advanced AI models and data sovereignty.

“If you are using the Deepseek API, which is very popular at the moment, this data is sent directly to China,” said Wang. “It is a strong warning that (fact) many American companies and companies … not willing to consider it (this one).”

CEREBRAS demonstrates advantages of spectacular performance in the output speed, dealing 1,508 tokens per second – almost six times faster than its nearest competitor, GROQ and about 100 times faster than traditional solutions based on GPU like Novita. (Source: artificial analysis)

How the technology of the Cerebras brochure beats traditional GPUs at AI speed

Cerebras obtains its advantage at speed thanks to a new chip architecture which retains whole AI models on a processor the size of a single brochure, eliminating the bottlenecks of memory that afflict GPU -based systems. The company claims that its implementation of Deepseek-R1 matches or exceeds the performance of OPENAI proprietary models, while fully operating on American soil.

Development represents a significant change in the AI ​​landscape. Deepseek, founded by the former director of hedge funds, Liang Wenfeng, shocked the industry by obtaining the reasoning capacities of sophisticated AI which would have been only 1% of the cost of American competitors. Cerebras’ accommodation solution now offers American companies a way to take advantage of these advances while maintaining data control.

“It is actually a great story that American research laboratories have offered this gift to the world. The Chinese have taken it and improved it, but it has limits because it works in China, has censorship problems, and now we take them back and execute them on American data centers, without censorship, without retention of retention data, “said Wang.

Performance references showing Deepseek-R1 operating on Cerebras surpassing both the GPT-4O and the O1-min of Openai through the answers to questions, mathematical reasoning and coding tasks. The results suggest that the development of Chinese AI can approach or exceed American capacities in certain regions. (Credit: Cerebras)

American technological leadership faces new questions as IA innovation becomes global

The service will be available via an overview of the developer from today. Although it is initially free, Cerebras plans to implement API access controls due to high early demand.

This decision comes as American legislators are faced with the implications of the rise of Deepseek, which has exposed potential limits in American commercial restrictions designed to maintain technological advantages compared to China. The capacity of Chinese companies to achieve the power -pierced capacities of AI despite the chip export controls has caused calls to new regulatory approaches.

Industry analysts suggest that this development could accelerate the discrepancy of the IA infrastructure dependent on the GPU. “Nvidia is no longer the leader in inference performance,” noted Wang, pointing benchmarks showing higher performance of various specialized AI chips. “These other IA flea companies are really faster than GPUs to manage these latest models.”

The impact extends beyond technical measures. As the AI ​​models are increasingly incorporating sophisticated reasoning capacities, their calculation requests have skyrocketed. Cerebras maintains that its architecture is better suited to these emerging workloads, potentially reshaping the competitive landscape of the deployment of corporate AI.

remon Buul

Recent Posts

Brutal, “courageous” and relentless: the North Korean troops fighting Ukraine

North KoreaThe soldiers are implacable, almost fanatical, faced with death. They are determined and capable…

3 days ago

Dogecoin Whale Dayt, spark 17% crash: are the bears here for Doge?

The Dogecoin whales have sold another important part of their assets in the last 24…

3 days ago

What Ryan Day said about Chip Kelly leaving Ohio State Football after a season

Columbus, Ohio - The news from Chip Kelly on Sunday leave Ohio State Football to…

3 days ago

Lip reader decodes what Kanye West said to his wife Bianca Censori during the Grammys red carpet appearance 2025

Kanye West and his wife Bianca Censori the exchange during their scandalous appearance on the…

3 days ago

Faced with Trump’s threats to Greenland, the chief of Denmark asks for the support of his EU partners

Brussels (AP) - The Prime Minister of Denmark insisted on Monday that Greenland is not…

3 days ago

The crews recover more victims as efforts continue after the deadly collision of helicopter

Washington (7news) - The United States crews and rescuers have recovered more victims of the…

3 days ago