DeepSeek’s R1 AI model competes with OpenAI’s o1 reasoning model across math, coding, and science on an even playing field at 3% of the cost.
Some AI researchers hailed DeepSeek’s R1 as a breakthrough on the same level as DeepMind’s AlphaZero, a 2017 model that became superhuman at the board games Chess and Go by purely playing against itself and improving, rather than observing any human games.
The Chinese startup DeepSeek released an AI reasoning model that appears to rival the abilities of a frontier model from OpenAI, the maker of ChatGPT.
The announcement confirms one of two rumors that circled the internet this week. The other was about superintelligence.
DeepSeek's friendly whale hearkens back to a more playful era of tech branding—and it might just be the disruptor the AI industry needs.
Hedge fund manager and entrepreneur Liang Wenfeng built an AI model on a tight budget despite US attempts to halt China’s high-tech ambitions.
Chinese artificial intelligence group’s use of ‘reinforcement learning’ and ‘small language models’ leads to breakthroughs
Researchers are identifying current and future dangers within AI models away from the conflicts of interest they’d face in the industry
Mistral, the French AI lab, is working toward an initial public offering, co-founder and CEO Arthur Mensch said in an interview at Davos.
R1, sent shockwaves through Wall Street, with major tech firms—most notably Nvidia—experiencing sharp stock declines.
This story incorporates reporting fromPC Gamer, BGR, MIT Technology Review, Computerworld, TechRadar and newsbytesapp.com.OpenAI has released Operator, a largely autonomous AI tool designed to execute tasks on the internet based on simple text prompts.
This new approach, based on natural selection, dramatically improves the reliability of large language models for practical tasks like trip planning. Here's how it works.