DeepSeek rushes to launch new AI model as China goes all in
BEIJING - DeepSeek, the Chinese startup, triggered a $1 trillion-plus sell-off in global equities markets last month with a cut-price AI reasoning model that outperformed many Western competitors.
"The launch of DeepSeek's R2 model could be a pivotal moment in the AI industry," said Vijayasimha Alilughatta, chief operating officer of Indian tech services provider Zensar. DeepSeek's success at creating cost-effective AI models "would likely spur companies worldwide to accelerate their own efforts ... breaking the stranglehold of the few dominant players in the field," he said.
The initiative enlists children to collect used cooking oil from their homes, which is then filtered and turned into biofuel.They told a story of a company that functioned more like a research lab than a for-profit enterprise and was unencumbered by the hierarchical traditions of China's high-pressure tech industry, even as it became responsible for what many investors see as the latest breakthrough in AI.Liang was born in 1985 in a rural village in the southern province of Guangdong.
"Liang gave us control and treated us as experts. He constantly asked questions and learned alongside us," said 26-year-old researcher Benjamin Liu, who left the company in September. "DeepSeek allowed me to take ownership of critical parts of the pipeline, which was very exciting.
High-Flyer spent 1.2 billion yuan on two supercomputing AI clusters in 2020 and 2021. The second cluster, Fire-Flyer II, was made up of around 10,000 Nvidia A100 chips, used for training AI models. Beijing now celebrates DeepSeek, but has instructed it not to engage with the media without approval, according to a person familiar with Chinese official thinking.
"The key advantage of vast resources is that it allows for large-scale experimentation," said Liu, the former employee. The MoE technique divides an AI model into different areas of expertise and activates only those related to a query, as opposed to more common architectures that use the entire model.
For now, Western and Chinese tech giants have signaled plans to continue heavy AI spending, but DeepSeek's success with R1 and its earlier V3 model has prompted some to alter strategies.
پاکستان تازہ ترین خبریں, پاکستان عنوانات
Similar News:آپ اس سے ملتی جلتی خبریں بھی پڑھ سکتے ہیں جو ہم نے دوسرے خبروں کے ذرائع سے جمع کی ہیں۔
Chinese universities launch DeepSeek courses to capitalise on AI boomChinese universities launch DeepSeek courses to capitalise on AI boom
مزید پڑھ »
DeepSeek to share some AI model code, doubling down on open sourceDeepSeek to share some AI model code, doubling down on open source
مزید پڑھ »
American AI firms try to poke holes in disruptive DeepSeekSoftware maker Snowflake decided on Monday to add DeepSeek models to its AI model marketplace
مزید پڑھ »
Chinese startup launches DeepSeek R1 to compete with OpenAIChinese startup DeepSeek has unveiled its latest innovation, DeepSeek R1, an advanced AI model positioned as a strong rival to OpenAI's
مزید پڑھ »
Chinese AI startup DeepSeek overtakes ChatGPT on Apple App StoreChinese AI startup DeepSeek overtakes ChatGPT on Apple App Store
مزید پڑھ »
Trump and Nvidia CEO discuss DeepSeek, AI chip exports during meetingTrump and Nvidia CEO discuss DeepSeek, AI chip exports during meeting
مزید پڑھ »