The Arc Prize Foundation has a new test for AGI that leading AI models from Anthropic, Google, and DeepSeek score poorly on.
A new test of AI capabilities consists of puzzles that humans are able to solve without too much trouble, but which all ...