Chinese AI lab DeepSeek adopted innovative techniques to develop an AI model that was trained with limited human intervention, producing an “aha moment” that could transform the cost for developers to build killer applications based on the technology.
The research paper published on the workings of DeepSeek’s R1 “reasoning” model reveals how the group, led by hedge fund billionaire Liang Wenfeng, has achieved powerful results by removing bottlenecks in AI development.
The paper shows how DeepSeek adopted a series of more efficient techniques to develop R1, which like OpenAI’s rival o1 model, generates accurate answers by “thinking” step by step about its responses for longer than most large language models.