-
'Competitive animal' Messi set for sixth World Cup
-
Spaun hopes grit and grinding brings US Open title repeat
-
Belgium fight back to draw with Egypt in World Cup group game
-
Fearsome France begin World Cup wary of over-confidence
-
Forget losing course: Fitzpatrick wants Shinnecock tough
-
No panic, says De la Fuente after Spain held by Cape Verde
-
Belgium and Egypt draw 1-1 in World Cup group game
-
Vilified Knicks owner Dolan gets some relief with NBA title
-
Clark seeks US Open redemption after smashing Oakmont locker
-
New York classical concerts adapt to growing population with dementia
-
Cape Verde hero Vozinha sheds 'tears of resilience' after stopping Spain
-
England ready to take final step at World Cup, says Saka
-
Trump says Hormuz to 'completely open' after US-Iran peace deal
-
Senegal aim to overcome 'regrettable' absence of fans denied World Cup visas
-
Spain held by tiny Cape Verde at World Cup as Iran make bow
-
US won't need 'much help' on Hormuz, Trump says at G7
-
Toothless Spain held by Cape Verde on World Cup debut
-
With visas denied, Senegal World Cup fans watch from afar
-
Crystal Palace appoint Sage as manager
-
Trump says Strait of Hormuz will be 'completely open' Friday
-
Brazil's Splitter to become new NBA Bulls coach: reports
-
Greed or player health? 'Damaging' World Cup drinks breaks under spotlight
-
Murdochs' Fox to acquire US streaming giant Roku
-
Argentine mining threatens scarce water resources in the Andes
-
Abdullah Ibrahim, world-renowned South African jazz pianist
-
Trump to hold political rally on July 4 to mark US 250th
-
Deschamps points to Spain as team to beat at World Cup
-
Tunisian football bosses mull firing Lamouchi after World Cup thrashing
-
Timeline of Trump-linked resort project in Albania
-
New Zealand need collective effort to replace Williamson: Ravindra
-
IMF chief warns energy recovery to take time after US-Iran ceasefire
-
Lebanese mourn destroyed homes, livelihoods in southern city
-
Amazonian tribal leader Raoni hospitalized in intensive care
-
Trump faces G7 as questions swirl on Iran accord
-
England to give debuts to Cox and Baker against New Zealand
-
France shuts down dozen Israeli stands at defence trade show
-
Launch 3 Telecom Secures New Lakeland Facility
-
England coach McCullum 'worried' about Stokes after curfew incident
-
Sevilla's Mir sentenced to 8.5 years in prison for sexual assault
-
'They want to destroy us': Shock and anger as Russian attack sets Kyiv cathedral ablaze
-
'Start your engines'? Shipping groups wary on Hormuz reopening
-
Deadly Russian strikes set landmark Kyiv monastery ablaze
-
WHO, Lula urge G7 action on finishing pandemic treaty
-
US-Iran deal met with hope, scepticism in Mideast
-
Trump threatens 100% tariff on French wines over digital tax
-
German working-age population to shrink dramatically: study
-
MSF warns of 'dangerous gaps' in Ebola response in DR Congo
-
Three things we learned from the Barcelona Grand Prix
-
Deadly Russian strikes leave landmark Kyiv cathedral in flames
-
Real Madrid confirm Cucurella signing from Chelsea
LLM Consensus Matches or Outperforms the Best AI Models in Expert Evaluation Without Performance Degradation
A multi-model consensus system matches or outperforms GPT-5.4, Claude Opus 4.6 and Gemini 3.1 Pro across 100 expert-level questions infinance, law, medicine and technology, with no performance degradation.
SHERIDAN, WY / ACCESS Newswire / April 2, 2026 / LLM Consensus has released the results of its Expert-Domain Evaluation Benchmark v1.0, an independent study analyzing the performance of its multi-model consensus technology across 100 high-complexity questions in areas such as financial regulation, law, clinical medicine and technical architecture.
According to the results, the system matches or outperforms the best individual AI model across all evaluated questions, achieving measurable improvement in 44.9% of cases and with no instances of performance loss.
Key findings
In nearly half of the questions (45%), responses generated by the consensus system clearly outperformed those of the best individual model. The system was able to identify regulatory details that other models missed, resolve contradictions across sources, and deliver more complete answers.
In the remaining 55%, performance matched that of the best available model, ensuring a consistent baseline of quality without requiring users to choose between different models.
Additionally, in none of the 100 questions analyzed did the system produce a worse result than an individual model.
Performance by domain
The analysis focused on complex questions typical of regulated industries:
Clinical medicine (59% improvement): stronger performance in complex drug interactions, comorbidities, and application of clinical guidelines.
Financial regulation (50% improvement): advantages in scenarios combining multiple European regulatory frameworks such as DORA, PSD2, GDPR, and NIS2.
Legal analysis (44% improvement): greater precision in multi-jurisdictional and cross-regulatory compliance questions.
Technical architecture (30% improvement, 70% match): consistent results in system design decisions under regulatory and technical constraints.
Why it matters
The use of artificial intelligence in regulated industries continues to grow, yet no single model consistently excels across all domains. A system may perform well in financial regulation but fall short in clinical medicine, or vice versa.
LLM Consensus addresses this challenge by combining multiple leading models into a single response. It integrates technologies from OpenAI, Anthropic, Google, Mistral, and Meta, applying a synthesis process with cross-verification that leverages each model's strengths while reducing their weaknesses.
"Reliability is the core value proposition," the company said. "Users no longer have to decide which model to use. They get a single answer that consistently matches or outperforms the best available model for each case."
Evaluation methodology
The benchmark was specifically designed to assess tasks that require combining multiple sources of knowledge. Each question was evaluated by three independent reviewers from different AI providers, who scored responses blindly based on accuracy and quality.
Responses - from both the consensus system and individual models - were presented anonymously and in random order. Cases where sufficient agreement was not reached were classified as inconclusive and excluded from the final results.
The full dataset has been published to enable independent verification.
About LLM Consensus
LLM Consensus is an AI orchestration API that combines multiple advanced models into a single optimized response using patent-pending consensus technology.
The solution is available via REST API with different operating modes and is designed for developers and organizations in regulated sectors such as finance, healthcare, legal, and technology.
Press contact
Francisco Javier Nunez
Email: [email protected]
Web: llmconsensus.io
Patent pending: US 19/215,933 | EU EP25176020.3
This press release contains forward-looking statements based on current benchmark results. The evaluation was conducted using specific model versions as of March 2026; performance may vary with model updates. LLM Consensus is a system benchmark evaluating multi-model orchestration on expert synthesis tasks and should not be interpreted as a general-purpose comparison of individual AI models.
SOURCE: LLM Consensus
View the original press release on ACCESS Newswire
P.Costa--AMWN