AI benchmarks are broken. Here’s what we need instead.

Source

technologyreview.comfull article ↗

Publisher summary· verbatim

For decades, artificial intelligence has been evaluated through the question of whether machines outperform humans. From chess to advanced math, from coding to essay writing, the performance of AI models and applications is tested against that of individual humans completing tasks. This framing is s

Discussion

No replies yet. Be first.

AI benchmarks are broken. Here’s what we need instead.

Related coverage