H2O.ai Breaks New World Record for Most Accurate Agentic AI for Generalized Assistants

Achieving 79.7% accuracy, h2oGPTe sets a new benchmark for enterprise-grade, Sovereign AI across regulated industries.

MOUNTAIN VIEW, Calif.–(BUSINESS WIRE)–H2O.ai, the #1 Agentic AI company, announced its h2oGPTe Agent has once again achieved the top ranking on the prestigious General AI Assistant (GAIA) benchmark, with a record-setting 79.7% accuracy, fast approaching human-level performance, measured at 92%, and far surpassing general-purpose models from Google and Microsoft, which scored below 50%.




GAIA is a rigorous evaluation framework that measures how effectively AI agents perform over 300 real-world tasks, spanning research, data analysis, document handling, and advanced reasoning. It serves as a key indicator of enterprise readiness, assessing whether AI systems can handle the kinds of high-effort, skilled tasks traditionally done by humans. For business leaders, this milestone means h2oGPTe is capable of handling the kind of nuanced decisions, regulations, and workflows that drive value in banking, telecom, healthcare, and government. H2O.ai first achieved the top GAIA ranking in December 2024 and continues to lead in production-grade, sovereign AI.

The improved performance of H2O.ai’s agent technology comes from key enhancements, including advanced browser navigation for precise information extraction, unified search across multiple sources like Google and Wikipedia, and the integration of Google’s Gemini 2.5 Pro and Claude 3.7 Sonnet. Additionally, the platform now features GitHub integration for navigating codebases and real-time source attribution, ensuring transparency during research.

“GAIA is fast becoming the barometer of enterprise intelligence, and at 79.7%, our agents aren’t just accurate, they’re adaptable,” said H2O.ai CEO and Founder Sri Ambati. “Gemini sharpened our vision and multimodal skills, Claude boosted our reasoning and code understanding, and now we’re building toward an auto-agentic future, a framework where planning agents coordinate a series of task-specific power tools. DeepResearch already gave hedge funds an edge in volatile markets, and in today’s shifting geopolitical landscape, scenario planning is not a luxury, it’s a necessity. Delivering all of this, on-prem, inside sovereign AI environments for governments and public institutions, that’s a game changer.”

H2O.ai’s agents are deployed in some of the world’s most highly regulated environments to support mission-critical, task-specific operations. Global banks use them to streamline regulatory reporting and detect fraud, telecom providers to optimize call centers, and public agencies to manage complex document workflows. H2O.ai offers a growing portfolio of vertical agents — prebuilt for industries like banking, telecom, and government — and a flexible agent builder framework for creating custom agents on private data and internal systems. Built on a multi-agent architecture, planning agents can coordinate specialized sub-agents across departments, delivering structure, speed, and scale. With human-in-the-loop review, continuous learning, and auditability built in, H2O agents meet strict compliance needs while accelerating decision-making and ROI.

As enterprises move from AI pilots to production, H2O.ai continues to lead the way with modular, hardware-agnostic solutions that run securely on private clouds, on-premise infrastructure, or air-gapped environments.

With GAIA as a clear signal of applied intelligence, H2O.ai stands apart in an increasingly crowded field, proving what’s possible when agentic AI is purpose-built for the enterprise.

For more information about H2O.ai’s capabilities, visit h2o.ai.

About H2O.ai

Founded in 2012, H2O.ai is on a mission to democratize AI. As the world’s leading agentic AI company, H2O.ai converges Generative and Predictive AI to help enterprises and public sector agencies develop purpose-built GenAI applications on their private data. With a focus on Sovereign AI—secure, compliant, and infrastructure-flexible deployments—H2O.ai delivers solutions that align with the highest standards of data privacy and control.

Its open-source technology is trusted by over 20,000 organizations worldwide, including more than half of the Fortune 500. H2O.ai powers AI transformation for companies like AT&T, Commonwealth Bank of Australia, Singtel, Chipotle, Workday, Progressive Insurance, and NIH.

H2O.ai partners include Dell Technologies, Deloitte, Ernst & Young (EY), NVIDIA, Snowflake, AWS, Google Cloud Platform (GCP) and VAST. H2O.ai’s AI for Good program supports nonprofit groups, foundations, and communities in advancing education, healthcare, and environmental conservation. With a vibrant community of 2 million data scientists worldwide, H2O.ai aims to co-create valuable AI applications for all users.

H2O.ai has raised $256 million from investors, including Commonwealth Bank, NVIDIA, Goldman Sachs, Wells Fargo, Capital One, Nexus Ventures and New York Life.

Contacts

H2O.ai

Betty Candel

VP Marketing

betty.candel@h2o.ai

Artículos Relacionados