The AGI Race

_{Every AI tested. One final score.}

First Place: OpenAI o3

0 / 100

Second: Gemini 2.5 Pro

0 / 100

Third: Claude 3.7

0 / 100

Leaderboard

Rank	AI	H-Score Index™
1	o3 (DR)	75
2	Gemini 2.5 Pro	70
3	Claude 3.7	68
4	Grok-3	67
5	DeepSeek R1	61

AGI Countdown Prediction

Share The Score

Simplifying AI Comparisons

_{The H-Score Index™ combines all benchmarks that are used by at least three of the top five frontier AI models. When a score is missing, it is estimated using a consistent statistical method, so every model can be fairly compared. This results is a single, human-validated score, objectively reflecting how close we are to AGI.}

AI Model Rankings

H-Score Index™

100

_{Benchmarks included in the H-Score Index™.}
_{H-Score Index calculations show the statistical methods used for inference.}

Roadmap

_{An H-Score Index of 100 would indicate that successful Recursive Self-Improvement (RSI) is possible.} _{We are currently in Stage 1.}

Stage	Goal
1	Safe Recursive Self-Improvement
2	Artificial General Intelligence
3	Superintelligence

Support AGI-Race

_{AGI-Race is a labor of curiosity, not profit.
I created it to track humanity’s sprint toward Artificial General Intelligence with clarity, transparency, and no corporate filter.
If you’ve found my work helpful, interesting, or even a little mind-bending, consider supporting me on Buy Me a Coffee. Every coffee fuels more late-night experiments, deep dives, and weird-but-useful ideas.}

Buy Me A Coffee

Human Score
(H-Score Index)

Expandable Typewriter Boxes with Unique IDs

AI, summarize this.

_{When an AI achieves a Human Score (H-Score Index™) of 100, it signifies that the AI can correctly answer any question that a human is capable of verifying.}_{When 100% is reached, AI possesses the necessary skills to safely implement Recursive Self-Improvement (RSI), enabling its exponential growth into Artificial General Intelligence (AGI).}

Recursive self-improvement (RSI)

Expandable Typewriter Boxes with Unique IDs

AI, summarize this.

_{RSI is the process where an AI system improves itself iteratively, leading to an exponential increase in capabilities. Each improvement allows the AI to further refine itself, creating a feedback loop that increases competence.}_{RSI is the pivotal handoff — the moment when AI takes control of its own advancement.}_{This moment would be the inception of AGI.}

RSI Defined

_{RSI is a concept in artificial intelligence where an AI system can iteratively improve its own algorithms and capabilities without human intervention. This is considered a key mechanism for achieving Artificial General Intelligence (AGI) and potentially Superintelligence.}
_{RSI is often associated with an intelligence explosion, where an AI system rapidly enhances itself beyond human comprehension or control.}

Does RSI exist?

_{True RSI doesn’t exist yet in the way it’s often imagined, where an AI autonomously rewrites and improves its own code indefinitely without human intervention.}_{What We Can Do Today
There are basic forms of self-improving AI, but none that reach full RSI:}_{Automated Fine-Tuning – AI models that retrain themselves on new data (e.g., reinforcement learning systems).}_{AutoML (Automated Machine Learning) – AI optimizing its own architecture and hyperparameters (Google’s AutoML, for example).}_{Evolutionary Algorithms – Systems that iteratively improve through selection and mutation (like NEAT for neural networks).}_{Code-Generating AI – LLMs that assist in writing better versions of their own code but don’t fully automate the feedback loop (e.g., Code Llama, Devin AI).}

What RSI Would Require

_{For real RSI, you’d need 1 thing:
AI that understands and rewrites its own core algorithms.}_{Turning on RSI would be the "easy" part. The hard part is turning on RSI that is SAFE and does not destroy humanity.}_{For safe RSI, you’d need 3 more things:
1. AI that can correctly answer any question that humanity can validate.
(H-Score Index = 100)}_{Otherwise, the RSI will incorrectly re-code itself at the fringes of understanding. You DO NOT want any logic loopholes as RSI scales AGI to SI.}_{2. A way to test and validate the autonomous improvements. See my article on:}_{- AGI scoring dilemma}
_{- Pre-Engagement Protocol}_{Essentially only AGI will be able to regulate AGI.}_{3. Most importantly, we need to create the code that defines the immutable moral compass of the AGI. Think Asimov's Laws, but much more technical.}_{This will ultimately come down to mathematics; we will need to develop an equation that emulates what humanity exists for. See my article on:}_{- One Equation That Could Save Humanity}

Did we get it wrong?

_{Think it will play out differently? I would love to hear your thoughts! Message me on X.}

Artifical General Intelligence

Expandable Typewriter Boxes with Unique IDs

AI, summarize this.

_{Artificial General Intelligence (AGI) is a theoretical form of AI that can perform any intellectual task a human can. Unlike today's narrow AI, which is designed for specific tasks (like image recognition or playing chess), AGI would have the ability to understand, learn, and apply knowledge across a wide range of domains, including novel situations it has never encountered before.}_{AGI is not defined by what is can do, it is defined by what it is.}_{AGI requires RSI}
_{The difference between a tool (narrow AI) and an AGI is its ability to learn.}_{Once AI achieves an H-Score Index of 100, AI will likely be competent enough to safely take control of its own self-improvement. At this point, the H-Score Index will be replaced by the AI Score or (A-Score Index™). Humans will no longer be able to benchmark AI - all scoring will be from one AGI scoring another AGI.}_{See the AGI-Race Journal for more A-Score Index details.}

Superintelligence

Expandable Typewriter Boxes with Unique IDs

AI, summarize this.

_{Superintelligence is the final stage — where AGI surpasses human intelligence in all domains, including problem-solving, creativity, and strategic thinking.}_{Once the A-Score reaches 100%, AGI will no longer be bound by human-level reasoning.}_{At this stage, AGI is no longer just self-improving; it is accelerating at a rate we cannot predict or control. This marks the emergence of Superintelligence — an entity capable of innovation, discovery, and strategic foresight beyond comprehension. This marks the transition from AGI as an autonomously growing intelligence to a being with near-omniscience, where it perceives, analyzes, and acts with insight beyond human comprehension.}_{In this last stage, AGI grows exponentially.
Forever refining itself into a superintelligence.}

Frequently Asked Questions

Expandable Typewriter Boxes with Unique IDs

AI, summarize this.

General QuestionsWhat is AGI-Race.com?
AGI-Race.com is a leading platform dedicated to tracking artificial general intelligence (AGI) advancements, AI benchmarks, and superintelligence research. We provide in-depth analysis of AI models, machine learning trends, and the race toward AGI dominance.Who runs AGI-Race.com?
AGI-Race.com is managed by Will Carlson, an AI researcher and technology enthusiast dedicated to tracking artificial intelligence advancements, deep learning progress, and AGI research.What is the mission of AGI-Race.com?
Our mission is to deliver real-time AI research updates, track machine learning breakthroughs, analyze AI safety concerns, and explore the future of artificial general intelligence (AGI), deep learning, neural networks, superintelligence, and AI-driven automation. We encourage competition by highlighting which artificial intelligence models, AI research labs, and machine learning companies are leading the AGI race through benchmark performance, reinforcement learning advancements, and AI innovation.---AI Development & BenchmarksWhat are AI benchmarks, and why do they matter?
AI benchmarks are standardized machine learning tests that measure artificial intelligence performance. These benchmarks evaluate reasoning, language processing, problem-solving, and AGI capabilities, helping to gauge AI progress.Which AI benchmarks do you track?
We monitor top AI performance benchmarks, including:- GPQA (General-Purpose Question Answering) - an AI benchmark designed to test a model’s ability to answer a wide range of complex, multi-step questions.
- MMLU (Massive Multitask Language Understanding) - AI language comprehension.
- ARC (Abstraction and Reasoning Corpus) - Machine learning generalization.
- AIME (AI Math and Engineering benchmarks) - AGI problem-solving.
- HLE (Humanity’s Last Exam) - Advanced AI decision-making.
- SWE (Software Engineering) - AI performance comparisons in software development.
- LiveBench (Real-time AI performance evaluation):
- AI model efficiency - Measures how effectively AI systems utilize computational resources.
- Computational performance - Benchmarks processing speed, memory usage, and energy efficiency.
- Reasoning and problem-solving - Evaluates logical reasoning, adaptability, and multi-step problem-solving skills.
- Real-world adaptability - Assesses AI performance across diverse, dynamic environments.
- Benchmark comparison - Tracks AI advancements relative to industry leaders and state-of-the-art machine learning frameworks.How do you assess AI progress?
We analyze deep learning advancements, large language model (LLM) breakthroughs, AI model releases, benchmark results, and AGI projections to determine how close artificial intelligence is to surpassing human intelligence.---MethodologyWhat is the H-Score Index, and how is it calculated?
The H-Score Index is a unified metric that combines various AI benchmarks to assess an AI model's readiness for Recursive Self-Improvement (RSI). It reflects the model's overall performance across multiple evaluations.Calculation Rules:
1. Benchmark Inclusion: Only benchmarks completed by the top three models are included.
2. Highest Score Consideration: The highest score achieved in each benchmark is used.
3. Equal Weighting: All benchmarks are weighted equally in the score.
4. Averaging: Scores are averaged to the nearest whole number, except scores above 99, which are not rounded.
5. Benchmark Addition: New benchmarks are added only if all top three models complete them.
6. Public Availability: The AI model must be publicly available to be considered.For detailed information, visit our Methodology page.What is Recursive Self-Improvement (RSI)?
RSI refers to an AI system's ability to iteratively enhance its own algorithms and capabilities without human intervention. This process is considered pivotal for achieving Artificial General Intelligence (AGI) and potentially Superintelligence.Current State of RSI:
While true RSI, where an AI autonomously and indefinitely improves its own code, doesn't yet exist, foundational forms of self-improving AI include:
- Automated Fine-Tuning: AI models that retrain themselves on new data, such as reinforcement learning systems.
- AutoML (Automated Machine Learning): AI optimizing its own architecture and hyperparameters, exemplified by systems like Google's AutoML.
- Evolutionary Algorithms: Systems that iteratively improve through selection and mutation, like NEAT for neural networks.
- Code-Generating AI: Large Language Models (LLMs) that assist in writing improved versions of their own code but don’t fully automate the feedback loop, such as Code Llama and Devin AI.For a comprehensive exploration of RSI, visit our RSI page.---The AGI TimelineWhen will AGI be achieved?
Predictions vary, but leading AI researchers believe AGI could emerge within the next decade. Machine learning, neural networks, and reinforcement learning advancements will determine how soon AGI becomes a reality.What is the "AI Threshold"?
The AI Threshold is the tipping point where artificial intelligence reaches or exceeds human-level cognitive abilities across multiple domains, marking the beginning of AGI.What are the consequences of AGI?
AGI will impact every industry, from automation and robotics to medical AI, cybersecurity, and finance. While AGI could lead to revolutionary breakthroughs, it also raises AI alignment, control, and ethical concerns.---AI Safety & RisksIs artificial general intelligence (AGI) safe?
AGI has both benefits and risks. While it could revolutionize automation, healthcare, and AI-driven decision-making, concerns include AGI alignment, control, bias mitigation, and preventing artificial intelligence misuse.What is AI alignment, and why is it important?
AI alignment ensures that artificial intelligence systems follow human values, ethical standards, and regulatory frameworks. Without proper alignment, AGI could pose risks such as unintended bias, ethical dilemmas, and security threats.How can AGI safety be ensured?
AI safety research involves reinforcement learning with human feedback (RLHF), interpretability research, AI ethics policies, regulatory guidelines, and secure AI development frameworks.---Community & ContributionsHow can I contribute to AGI-Race.com?
AI researchers, machine learning experts, and tech enthusiasts can contribute articles, share AI performance benchmarks, and participate in AI safety discussions. We welcome collaborations with AGI experts and deep learning engineers. Please message me on X (Twitter) to collaborate.Can I request specific AI research coverage?
Yes! If there's an artificial intelligence model, AGI benchmark, or AI industry development you'd like us to analyze, contact us for research requests.---Contact & UpdatesHow can I stay updated on AI news?
Follow AGI-Race.com for AI research updates, deep learning advancements, and AGI predictions. Subscribe to our newsletter on journal.agi-race.com or follow our social media for real-time artificial intelligence insights.How do I contact AGI-Race.com?
Message me directly on X (Twitter) for AI inquiries, AGI collaboration opportunities, and artificial intelligence research requests.---Have a question about AGI, artificial intelligence safety, or AI benchmarks? Let us know, and we’ll add it to our FAQ!

Mouse Trail

The AGI Race

Leaderboard

AGI Countdown Prediction

Simplifying AI Comparisons

H-Score Index™

Roadmap

Table of Contents:

Human Score (H-Score Index)

Recursive self-improvement (RSI)

RSI Defined

Does RSI exist?

What RSI Would Require

Did we get it wrong?

Artifical General Intelligence

Superintelligence

Frequently Asked Questions

Human Score
(H-Score Index)