Early-stage emerging technology venture capital. Actively engaging in angel and start-up investing, Real-Estate investments and development projects.

Default Share Image

David Mandel Blog

Investing in early-stage startups creating or using emerging technologies to provide better solutions which were previously not possible.

Emerging Ventures Logo

INVESTING IN TOMORROW’S TECHNOLOGY TODAY

AGI Chip brain

Newsletter Background

Passionate about the Future of Technology?

The Arc Prize Foundation, a nonprofit co-founded by prominent AI researcher François Chollet, announced in a blog post on Monday that it has created a new, challenging test to measure the general intelligence of leading AI models.

So far, the new test, called ARC-AGI-2, has stumped most models.

“Reasoning” AI models like OpenAI’s o1-pro and DeepSeek’s R1 score between 1% and 1.3% on ARC-AGI-2, according to the Arc Prize leaderboard. Powerful non-reasoning models, including GPT-4.5, Claude 3.7 Sonnet, and Gemini 2.0 Flash, score around 1%.

The ARC-AGI tests consist of puzzle-like problems where an AI has to identify visual patterns from a collection of different-colored squares and generate the correct “answer” grid. The problems were designed to force an AI to adapt to new problems it hasn’t seen before.

The Arc Prize Foundation had over 400 people take ARC-AGI-2 to establish a human baseline. On average, “panels” of these people got 60% of the test’s questions right — much better than any of the models’ scores.

A new, challenging AGI test stumps most AI models