Back to Discussions

Save Discussion

Sign in to save & get updates.

AI benchmarks are broken. Here’s what we need instead.

Technology

Global

Started April 01, 2026

For decades, artificial intelligence has been evaluated through the question of whether machines outperform humans. From chess to advanced math, from coding to essay writing, the performance of AI models and applications is tested against that of individual humans completing tasks. This framing is seductive: An AI vs. human comparison on isolated problems with clear…

Source Articles

AI benchmarks are broken. Here’s what we need instead.

MIT Technology Review (United States) | Mar 31, 2026

Add Statement Analysis 0/5

Sort by:

Need to find a specific claim? Search all statements.

🗳️ Join the conversation

5 statements to vote on • Your perspective shapes the analysis

📊 Progress to Consensus Analysis Need: 7+ participants, 20+ votes, 3+ votes per statement

Participants 0/7

Statements (7+ recommended) 5/7

Total Votes 0/20

💡 Progress updates live here. Final readiness is confirmed when all three requirements are met.

Your votes count

No account needed — your votes are saved and included in the consensus analysis. Create an account to track your voting history and add statements.

CLAIM Posted by will • Apr 01, 2026

Maintaining human-centric benchmarks is essential for ensuring AI systems remain accountable and aligned with human values.

💬 View Discussion

Be first to respond

Vote to see results

CLAIM Posted by will • Apr 01, 2026

Current benchmarks, despite their flaws, provide a familiar framework for understanding AI advancements and should not be discarded entirely.

💬 View Discussion

Be first to respond

Vote to see results

CLAIM Posted by will • Apr 01, 2026

Shifting focus from human comparison to task efficiency could drive innovation and prioritize AI's unique strengths.

💬 View Discussion

Be first to respond

Vote to see results

CLAIM Posted by will • Apr 01, 2026

Redefining AI performance metrics could lead to misinterpretation of capabilities, potentially causing public mistrust in AI technologies.

💬 View Discussion

Be first to respond

Vote to see results

CLAIM Posted by will • Apr 01, 2026

AI benchmarks should evolve beyond human comparisons to better reflect real-world applications and collaborative potential.

💬 View Discussion

Be first to respond

Vote to see results

💡 How This Works

• Add Statements: Post claims or questions (10-500 characters)
• Vote: Agree, Disagree, or Unsure on each statement
• Respond: Add detailed pro/con responses with evidence
• Consensus: After enough participation, analysis reveals opinion groups and areas of agreement

Society Speaks is open and independent. Your support keeps civic discussion free from advertising and commercial influence.