DeepSeek has gone viral.
Chinese AI lab DeepSeek broke into the mainstream consciousness this week afterย its chatbot app rose to the top of the Apple App Store charts (and Google Play, as well). DeepSeekโs AI models, which were trained using compute-efficient techniques,ย have led Wall Street analystsย โย and technologistsย โ to question whether the U.S. can maintain its lead in the AI race and whether the demand for AI chips will sustain.
But where did DeepSeek come from, and how did it rise to international fame so quickly?
DeepSeekโs trader origins
DeepSeek is backed by High-Flyer Capital Management, a Chinese quantitative hedge fund that uses AI to inform its trading decisions.
AI enthusiast Liang Wenfeng co-founded High-Flyer in 2015. Wenfeng, who reportedly began dabbling in trading while a student at Zhejiang University, launched High-Flyer Capital Management as a hedge fund in 2019 focused on developing and deploying AI algorithms.
In 2023, High-Flyer started DeepSeek as a lab dedicated to researching AI tools separate from its financial business. With High-Flyer as one of its investors, the lab spun off into its own company, also called DeepSeek.
From day one, DeepSeek built its own data center clusters for model training. But like other AI companies in China, DeepSeek has been affected by U.S. export bans on hardware. To train one of its more recent models, the company was forced to use Nvidia H800 chips, a less-powerful version of a chip, the H100, available to U.S. companies.
DeepSeekโs technical team is said to skew young. The company reportedly aggressively recruits doctorate AI researchers from top Chinese universities. DeepSeek also hires people without any computer science background to help its tech better understand a wide range of subjects, per The New York Times.
DeepSeekโs strong models
DeepSeek unveiled its first set of models โ DeepSeek Coder, DeepSeek LLM, and DeepSeek Chat โ in November 2023. But it wasnโt until last spring, when the startup released its next-gen DeepSeek-V2 family of models, that the AI industry started to take notice.
DeepSeek-V2, a general-purpose text- and image-analyzing system, performed well in various AI benchmarks โ and was far cheaper to run than comparable models at the time. It forced DeepSeekโs domestic competition, including ByteDance and Alibaba, to cut the usage prices for some of their models, and make others completely free.
DeepSeek-V3, launched in December 2024, only added to DeepSeekโs notoriety.
According to DeepSeekโs internal benchmark testing, DeepSeek V3 outperforms both downloadable, openly available models like Metaโsย Llama and โclosedโ models that can only be accessed through an API, like OpenAIโs GPT-4o.
Equally impressive is DeepSeekโs R1 โreasoningโ model. Released in January, DeepSeek claims R1 performs as well as OpenAIโsย o1ย model on key benchmarks.
Being a reasoning model, R1 effectively fact-checks itself, whichย helps it to avoid some of theย pitfallsย that normally trip up models. Reasoning models take a little longer โ usually seconds to minutes longer โ to arrive at solutions compared to a typical non-reasoning model. The upside is that they tend to be more reliable in domains such as physics, science, and math.
There is a downside to R1, DeepSeek V3, and DeepSeekโs other models, however. Being Chinese-developed AI, theyโre subject toย benchmarkingย by Chinaโs internet regulator to ensure that its responses โembody core socialist values.โ In DeepSeekโs chatbot app, for example, R1 wonโt answer questions about Tiananmen Square or Taiwanโs autonomy.
A disruptive approach
If DeepSeek has a business model, itโs not clear what that model is, exactly. The company prices its products and services well below market value โ and gives others away for free.
The way DeepSeek tells it, efficiency breakthroughs have enabled it to maintain extreme cost competitiveness. Some experts dispute the figures the company has supplied, however.
Whatever the case may be, developers have taken to DeepSeekโs models, which arenโt open source as the phrase is commonly understood but are available under permissive licenses that allow for commercial use. According to Clem Delangue, the CEO of Hugging Face, one of the platforms hosting DeepSeekโs models, developers on Hugging Face have created over 500 โderivativeโ models of R1 that have racked up 2.5 million downloads combined.
DeepSeekโs success against larger and more established rivals has been described as โupending AIโ and โover-hyped.โ The companyโs success was at least in part responsible for causing Nvidiaโs stock price to drop by 18% in January, and for eliciting a public response from OpenAI CEO Sam Altman.
Microsoft announced that DeepSeek is available on its Azure AI Foundry service, Microsoftโs platform that brings together AI services for enterprises under a single banner. When asked about DeepSeekโs impact on Metaโs AI spending during its first-quarter earnings call, CEO Mark Zuckerberg said spending on AI infrastructure will continue to be a โstrategic advantageโ for Meta.
During Nvidiaโs fourth-quarter earnings call, CEO Jensen Huang emphasized DeepSeekโs โexcellent innovation,โ saying that it and other โreasoningโ models are great for Nvidia because they need so much more compute.
At the same time, some companies are banning DeepSeek, and so are entire countries and governments, including South Korea. New York state also banned DeepSeek from being used on government devices.
As for what DeepSeekโs future might hold, itโs not clear. Improved models are a given. But the U.S. government appears to be growing wary of what it perceives as harmful foreign influence.
TechCrunch has an AI-focused newsletter!ย Sign up hereย to get it in your inbox every Wednesday.
This story was originally published January 28, 2025, and will be updated regulary.


