商业快报

AI development’s speed stretches risk assessments to breaking point

Artificial intelligence’s complexity exposes flaws in traditional methods used to evaluate its safety and accuracy
Google, Anthropic, Cohere and Mistral have each released AI models over the past two months as they seek to unseat OpenAI from the top of public rankings

The increasing power of the latest artificial intelligence systems is stretching traditional evaluation methods to breaking point, posing a challenge to businesses and public bodies over how best to work with the fast-evolving technology.

Flaws in the evaluation criteria commonly used to gauge performance, accuracy and safety are being exposed as more models come to market, according to people who build, test and invest in AI tools. The traditional tools are easy to manipulate and too narrow for the complexity of the latest models, they said.

您已阅读12%(700字),剩余88%(5144字)包含更多重要信息,订阅以继续探索完整内容,并享受更多专属服务。
版权声明:本文版权归manbetx20客户端下载 所有,未经允许任何单位或个人不得转载,复制或以任何其他方式使用本文全部或部分,侵权必究。
设置字号×
最小
较小
默认
较大
最大
分享×