Skip to main content

China's DeepSeek releases long-awaited new AI model

Chinese startup DeepSeek released a new artificial intelligence model with “drastically reduced” costs on Friday, more than a year after it stunned the world with a low-cost reasoning model that matched the capabilities of US rivals.

The AI race has intensified the rivalry between China and the United States, and the White House on Thursday accused Chinese entities of a massive effort to steal artificial intelligence technology.

Hangzhou-based DeepSeek burst onto the scene in January last year with a generative AI chatbot, powered by its R1 reasoning model, that upended assumptions of US dominance in the strategic sector.

The new version, DeepSeek-V4, “features an ultra-long context of one million words”, the company said in a statement on social media platform WeChat, hailing it as “world-leading… with drastically reduced compute (and) memory costs” in a separate announcement on X.

The model’s context length, which determines how much input a model is able to absorb to help it complete tasks, “(achieves) leadership in both domestic and open-source fields across agent capabilities, world knowledge, and reasoning performance”, the WeChat statement said.

A “preview version” of the open source model is now available, the company said.

Experts say V4’s release marks an “inflection point” in terms of hardware and cost.

“This addresses the long-standing issues of slower performance and higher costs associated with long context lengths, marking a genuine inflection point for the industry,” Zhang Yi, the founder of tech research firm iiMedia, told AFP.

“For end users, this will bring widespread, accessible benefits. For instance, if ultra-long context support becomes a standard feature, long-text processing is expected to move beyond high-end research labs and enter mainstream commercial applications,” he said.

The new V4 is released as two versions, DeepSeek-V4-Pro and DeepSeek-V4-Flash, with the latter being “a more efficient and economical choice” because it has smaller parameters.

‘Sputnik moment’

V4-Pro has 1.6 trillion parameters while the V4-Flash has 284 billion parameters, which refine models’ decision-making ability.

The model has also been “optimised” for popular AI Agent products such as Claude Code, OpenClaw, OpenCode, and CodeBuddy, the DeepSeek statement said.

“In world knowledge benchmarks, DeepSeek-V4-Pro significantly leads other open-source models and is only slightly outperformed by the top-tier closed-source model, (Google’s) Gemini-Pro-3.1,” the statement added.

Last year’s so-called “DeepSeek shock” sparked a sell-off of AI-related shares and a reckoning on business strategy in what was also described as a “Sputnik moment” for the industry.

The chatbot performed at a similar level to ChatGPT and other top American offerings, but the company said it had taken significantly less computing power to develop.

However, its sudden popularity raised questions over data privacy and censorship, with the chatbot often refusing to answer questions on sensitive topics such as the 1989 Tiananmen crackdown.

At home, DeepSeek’s AI tools have been widely adopted by Chinese municipalities and healthcare institutions as well as the financial sector and other businesses.

This has been partly driven by DeepSeek’s decision to make its systems open source, with their inner workings public — in contrast to the proprietary models sold by OpenAI and other Western rivals.

But the White House has accused Chinese firms of vying to “steal” American technology, ahead of an expected summit between Donald Trump and Xi Jinping in Beijing next month.

“The US has evidence that foreign entities, primarily in China, are running industrial-scale distillation campaigns to steal American AI,” Trump’s science and technology chief advisor Michael Kratsios said in a post on X.

Distillation is a common practice within AI development, often used by companies to create cheaper, smaller versions of their own models.

DeepSeek’s Friday announcement also came as Meta said it planned to cut a tenth of its staff as it looks for productivity gains from the rest of the workforce while investing heavily in artificial intelligence. Reports said Microsoft was also looking to trim its ranks.



from Dawn - Home https://ift.tt/agcL8up

Comments

Popular posts from this blog

Ailing Pope Francis to embark on Asia trip, his longest ever, in September

Pope Francis will travel to Indonesia, Papua New Guinea, Timor-Leste and Singapore from September 2-13, the Vatican said on Friday, announcing his first overseas trip of the year and the longest of his 11-year papacy. The Asia trip has been on the papal agenda for some time, but there had been doubts on whether the 87-year-old pontiff would embark on it given his increasing frailty, with a record of skipping engagements due to health problems. His last international journey was a two-day stay in Marseille, France in September. In November, he pulled out of a trip to the COP28 climate conference in Dubai because of a lung inflammation . Francis is now scheduled to be in Jakarta between Sept 3-6, Port Moresby and Vanimo between Sept 6-9, Dili September. 9-11 and Singapore Sept 11-13, his spokesman said in a statement. Vietnam, which had been suggested by the pope and Vatican officials as a possible further destination during the nearly two-week long Asia trip, was not mentioned. In ...

‘A war out there’: Maple Leafs survive shootout thriller in Utah

SALT LAKE CITY — Whew. They needed this one, even if they didn’t wholly deserve it. For a Monday night in Salt Lake City, the stakes felt unusually high for the sagging, road-weary Toronto Maple Leafs .  Heading into their inaugural game at Delta Center, the Leafs had dropped three straight, blown a couple leads, slipped out of first place, and  distracted  the fan base by propositioning their best player with a trade.  Worse: Their process hasn’t been tight for a couple weeks. Mistakes have crept in. Speed is giving their defence issues. And their razor-sharp goaltenders have begun to look human. Head coach Craig Berube held an intense team meeting Sunday, following Saturday’s 7-4 outclassing in Denver. Multiple players spoke up. Captain Auston Matthews said they’d reached look-in-the-mirror time. “The really bad games have a good way of being the biggest learning experiences,” thoughtful goaltender Joseph Woll said, following Monday’s slump-snuffing, nail-b...

A diary of (near) default - 2023 was a year of economic uncertainty in Pakistan

Despite having little in common, even our political parties could agree on one thing: Pakistan’s economic situation was dire in 2023. The year saw Pakistan go through a long and rocky road to finding some semblance of economic stability — if it can even be called that — while weathering political and social turmoil. Pakistanis also experienced a double whammy this year: the one-two punches of the worst economic crisis in decades and all-time high inflation. Add to that the gut punch of the aftermath of the catastrophic floods of 2022 began to settle in. Flood victims receive boiled rice from relief workers, after taking refuge on a motorway, following rains and floods during the monsoon season in Charsadda, Pakistan on August 27, 2022 — Reuters In 2023, according to the World Bank , over 39.4 per cent of the population fell below the poverty line, which means over 12.5 million people are living in meagre conditions. Additionally, 8.5 million people face acute food insecurity due ...