- by
- 01 30, 2025
Loading
With the AIAIAIAGIAIAVAIAIAIAGIRAIGPTAIRRAIAI release of its latest artificial-intelligence () model, , an obscure Chinese firm, has laid waste to several years of American policy meant to hold back Chinese innovation—and, in the process, blown a hole in the valuations of companies from , America’s chip champion, to Siemens Energy, a manufacturer of electrical equipment used in data centres. In demonstrating its ability to innovate around American export restrictions, DeepSeek has raised doubts as to whether access to piles of cutting-edge semiconductors and related equipment is as important as previously thought when it comes to training models.The man at the centre of it all is Liang Wenfeng, DeepSeek’s 40-year-old founder. It is unclear how much he has relished the global market turmoil he has unleashed. A high-school classmate who recently spoke to local media said Mr Liang is hiding out in his home town for the lunar new year, which started on January 29th. Playfully mocked on Chinese social media for his skinny, pale appearance, Mr Liang remains a mystery to most people. Those who have had professional dealings with DeepSeek say he is obsessed with human-like artificial general intelligence () and the impact it could have on the world. In his pursuit of it, DeepSeek’s founder is upending ideas about technological progress both in the West and China.Public information on Mr Liang is scant. Born into a family of teachers in an impoverished village near the southern city of Zhanjiang in 1985, he was a gifted student. A former instructor claimed he mastered university-level maths in middle school. In 2002 he gained entry into an electronic-information degree at Zhejiang University, a prestigious school in the eastern Chinese city of Hangzhou. A master’s degree at the same university, under a well-known machine-vision scientist, exposed him to the field of .At the time, Hangzhou was a bustling hub for internet technology and home to rising groups such as Alibaba, an e-commerce firm. Mr Liang and several classmates remained in the city and began experimenting with quantitative investing models, which do not rely on company fundamentals but on crunching reams of data. In 2013 Mr Liang and three classmates launched an investment group called Yakebi in an attempt to monetise the trading models they had built.Two years later Mr Liang co-founded High-Flyer, a quantitative hedge-fund that grew rapidly alongside dozens of similar firms during a period of deregulation and market volatility in China. In 2021 it claimed to be managing as much as 100bn yuan ($14bn), though it appears to have rapidly shrunk in size in the latter half of that year. Quant funds have routinely tussled with Chinese regulators, who view them as profiting from market routs. Industry insiders say High-Flyer made a name for itself as one of the most aggressive quant funds, regularly drawing the ire of securities regulators.DeepSeek’s origins lie in an effort to improve High-Flyer’s algorithms. In 2019 the group invested 200m yuan to set up a separate unit to develop its own deep-learning platform, called “Fire-Flyer 1”. The fund poured 1bn yuan into the effort in 2021 in order to launch a second iteration armed with 10,000 of Nvidia’s 100 graphics processing units. This made High-Flyer an outlier: at the time just four other firms in China held such large arsenals of powerful chips, all of which were tech giants such as Alibaba. DeepSeek was made a standalone company in 2023.It delivered its first jolt to the market in May last year, when it released an ultra-cheap chatbot based on its 2 model. That kicked off a price war in China’s industry, forcing the country’s biggest tech firms—Alibaba, Baidu, ByteDance and Tencent—to lower their own prices.By Mr Liang’s own telling, this was not a ploy to capture more users. In July he said costs had fallen as DeepSeek explored new model structures, something that set it apart from others. Although rival Chinese groups have been conducting research into models, their disadvantage in computing power, owing to American export restrictions, has led them to focus more on creating clever applications that use the technology. Many Chinese groups have used Llama, the family of large language models developed by Meta, America’s social-media titan, as a basis for their applications.For Mr Liang, developing models using less computing power is an essential step in pursuit of his longer-term objective. “Our goal is , which requires us to explore new model structures to achieve superior capabilities within limited resources,” he has told local media.DeepSeek’s new 1 model, which has shocked the West, suggests it is making progress. The company says it cost less than $6m to train, a tiny fraction of comparable models from the likes of Open, maker of Chat. Sam Altman, Open’s boss, has called 1 “impressive” (though he has also promised to produce “much better models”, adding that it is “legit invigorating to have a new competitor”).DeepSeek certainly has its doubters. Early testing seems to confirm that 1 is as powerful as its maker says it is. But some have questioned whether the firm has underplayed the number of high-end chips it used to develop the model, even if others argue its claims are plausible. There is also speculation that DeepSeek has trained its models by studying the results of American ones, a process known as “distillation”. Open has said it has evidence that points to DeepSeek distilling its models, in violation of its terms of service.If, however, DeepSeek’s is truly efficient, as many believe it is, then it is an affront both to American tech prowess and China’s state-led innovation model. Chinese state media has been quick to champion DeepSeek as a national asset in the country’s fight for supremacy. Mr Liang was invited to meet with Li Qiang, China’s premier, on January 20th, alongside a handful of other entrepreneurs.Yet as Zhang Zhiwei of Pinpoint Asset Management, an investment firm, points out, DeepSeek’s achievements did not emerge from one of China’s myriad government-backed research institutes or state-controlled companies. Mr Liang appears to control most of the shares in DeepSeek, and has steered clear of China’s state-dominated venture-capital industry.Mr Liang views China’s role over the past 30 years as that of a technological “follower”, building on foundations developed in the West. The gap between America and China is between “originality and imitation”, Mr Liang said in an interview with local media in July. Nvidia’s success, he argues, has not relied solely on its own performance, but also on technological collaboration among Western companies. China’s efforts to imitate Western computing power have fallen short, in his view, because it lacks this type of collaboration, despite a capital-intensive state-led effort to create one. DeepSeek may not be a wake-up call only for the West, but also for China’s leaders in Beijing.