FT商学院

DeepSeek’s ‘aha moment’ creates new way to build powerful AI with less money

Chinese artificial intelligence group’s use of ‘reinforcement learning’ and ‘small language models’ leads to breakthroughs

Chinese AI lab DeepSeek adopted innovative techniques to develop an AI model that was trained with limited human intervention, producing an “aha moment” that could transform the cost for developers to build killer applications based on the technology.

The research paper published on the workings of DeepSeek’s R1 “reasoning” model reveals how the group, led by hedge fund billionaire Liang Wenfeng, has achieved powerful results by removing bottlenecks in AI development. 

The paper shows how DeepSeek adopted a series of more efficient techniques to develop R1, which like OpenAI’s rival o1 model, generates accurate answers by “thinking” step by step about its responses for longer than most large language models.

您已阅读11%(715字),剩余89%(6009字)包含更多重要信息,订阅以继续探索完整内容,并享受更多专属服务。
版权声明:本文版权归manbetx20客户端下载 所有,未经允许任何单位或个人不得转载,复制或以任何其他方式使用本文全部或部分,侵权必究。
设置字号×
最小
较小
默认
较大
最大
分享×