DeepMind Proposes Cognitive Framework to Measure AGI Progress
- •Google DeepMind introduces 'A Cognitive Taxonomy' to empirically evaluate artificial general intelligence capabilities.
- •Framework identifies 10 core cognitive abilities including metacognition, social cognition, and executive functions.
- •Kaggle hackathon launched with $200,000 prize pool to develop benchmarks for the hardest-to-measure AI traits.
Google DeepMind is addressing one of the most elusive questions in the field: how do we actually know when we have achieved Artificial General Intelligence (AGI)? While current benchmarks often focus on specific tasks or narrow datasets, DeepMind’s new paper, "Measuring Progress Toward AGI: A Cognitive Taxonomy," suggests shifting the focus toward the fundamental cognitive processes that underpin human intelligence.
The framework outlines ten critical abilities—ranging from logical reasoning to social cognition and metacognition (thinking about one's own thinking)—to create a more holistic map of AI progress. By comparing machine performance against a representative sample of human baselines, researchers aim to move past anecdotal assessments of AI and toward a rigorous, scientific standard for general intelligence.
To turn this theoretical framework into a practical reality, DeepMind has partnered with Kaggle to launch a $200,000 hackathon. This initiative specifically targets five areas where current evaluation methods are weakest, such as executive functions and social understanding. This collaborative approach invites the broader community to build the next generation of benchmarks, ensuring that as models grow more complex, our ability to measure their true capabilities keeps pace.