Searching a system to determine how smart its AI systems are

The Abstraction and Reasoning Corpus (ARC) is a unique benchmark designed to measure AI skill acquisition and track progress toward achieving human-level AI.

The Abstraction and Reasoning Corpus (ARC) is a dataset created by François Chollet in 2019. It's designed to measure the gap between machine and human learning. The dataset consists of 1000 image-based reasoning tasks (Measure of Intelligence.)

‍

ARC-AGI stands for “Abstraction and Reasoning Corpus for Artificial General Intelligence” and is aimed to measure the efficiency of AI skill acquisition on unknown tasks.

The ARC-AGI benchmark isn’t new. It has been around for a while, five years to be exact. And here comes the crazy part, since its introduction in 2019, no AI has been able to solve it.

Now I know what you’re thinking, if AI can’t pass the test, this ARC thing must be pretty hard. Turns out, it isn’t. Most of its puzzles can be solved by a 5-year old.

The benchmark was explicitly designed to compare artificial intelligence with human intelligence. It doesn’t rely on acquired or cultural knowledge. Instead, the puzzles (for lack of a better word) require something that Chollet refers to as ‘core knowledge’. These are things that we as humans naturally understand about the world from a very young age.

‍

Here are a few examples:

1. Objectness Objects persist and cannot appear or disappear without reason. Objects can interact or not depending on the circumstances.

2. Goal-directedness Objects can be animate or inanimate. Some objects are “agents” who have intentions and pursue goals.

3. Numbers & counting Objects can be counted or sorted by their shape, appearance, or movement using basic mathematics like addition, subtraction, and comparison.

4. Basic geometry & topology Objects can be shapes like rectangles, triangles, and circles which can be mirrored, rotated, translated, deformed, combined, repeated, etc. Differences in distances can be detected.

As children, we learn experimentally. We learn by interacting with the world, often through play, and that which we come to understand intuitively, we apply to novel situations.

‍

Searching a system to determine how smart its AI systems are

The ARC-AGI benchmark isn’t new. It has been around for a while, five years to be exact. And here comes the crazy part, since its introduction in 2019, no AI has been able to solve it.

Latest Posts

Tech shortage and nearshore savings

Breast cancer screening

SAKANA AI writes his own code

A new GenAI assistant

New startup with a new processor for inference

Generative AI with business application