Skip to main content

AI benchmarks are broken. Here’s what we need instead.

Technology
Global
Started April 01, 2026

For decades, artificial intelligence has been evaluated through the question of whether machines outperform humans. From chess to advanced math, from coding to essay writing, the performance of AI models and applications is tested against that of individual humans completing tasks. This framing is seductive: An AI vs. human comparison on isolated problems with clear…

Need to find a specific claim? Search all statements.
🗳️ Join the conversation
5 statements to vote on • Your perspective shapes the analysis
📊 Progress to Consensus Analysis Need: 7+ participants, 20+ votes, 3+ votes per statement
Participants 0/7
Statements (7+ recommended) 5/7
Total Votes 0/20
💡 Progress updates live here. Final readiness is confirmed when all three requirements are met.

Your votes count

No account needed — your votes are saved and included in the consensus analysis. Create an account to track your voting history and add statements.

CLAIM Posted by will Apr 01, 2026
Maintaining human-centric benchmarks is essential for ensuring AI systems remain accountable and aligned with human values.
Vote options for this statement: agree, disagree, or unsure
Vote to see results
CLAIM Posted by will Apr 01, 2026
Current benchmarks, despite their flaws, provide a familiar framework for understanding AI advancements and should not be discarded entirely.
Vote options for this statement: agree, disagree, or unsure
Vote to see results
CLAIM Posted by will Apr 01, 2026
Shifting focus from human comparison to task efficiency could drive innovation and prioritize AI's unique strengths.
Vote options for this statement: agree, disagree, or unsure
Vote to see results
CLAIM Posted by will Apr 01, 2026
Redefining AI performance metrics could lead to misinterpretation of capabilities, potentially causing public mistrust in AI technologies.
Vote options for this statement: agree, disagree, or unsure
Vote to see results
CLAIM Posted by will Apr 01, 2026
AI benchmarks should evolve beyond human comparisons to better reflect real-world applications and collaborative potential.
Vote options for this statement: agree, disagree, or unsure
Vote to see results

💡 How This Works

  • Add Statements: Post claims or questions (10-500 characters)
  • Vote: Agree, Disagree, or Unsure on each statement
  • Respond: Add detailed pro/con responses with evidence
  • Consensus: After enough participation, analysis reveals opinion groups and areas of agreement

Society Speaks is open and independent. Your support keeps civic discussion free from advertising and commercial influence.

Support us