Pareto AI Blog | Perspectives from the Frontier

LLM Metacognition: Shared and Shallow?

Across 19 frontier models, metacognitive confidence on question and answer tasks tracks a shared difficulty heuristic with only a weak relationship to actual performance.

M. Moran

Mar 31, 2026

2064

LLM Metacognition: Shared and Shallow?

Across 19 frontier models, metacognitive confidence on question and answer tasks tracks a shared difficulty heuristic with only a weak relationship to actual performance.

M. Moran

Mar 31, 2026

2064

The bar exam was not designed for this

AI models pass the bar. Credentials weren't built for that and the methodology to fix them is already being built in post-training.

Phoebe Yao

Mar 13, 2026

900

The bar exam was not designed for this

AI models pass the bar. Credentials weren't built for that and the methodology to fix them is already being built in post-training.

Phoebe Yao

Mar 13, 2026

900

Confidence needs calibration

Frontier labs spend billions on reasoning and accuracy. Almost nobody trains models to know when to say, "I'm not sure."

Marilyn Zhang

Mar 5, 2026

1160

Confidence needs calibration

Frontier labs spend billions on reasoning and accuracy. Almost nobody trains models to know when to say, "I'm not sure."

Marilyn Zhang

Mar 5, 2026

1160

You can't prompt your way to safety

AI models are giving medical and mental health advice to millions of people. Can you prevent harmful advice by adding safety instructions to the prompt? The UK's AI Safety Institute (AISI) recently tested this.

Phoebe Yao

Feb 28, 2026

966

You can't prompt your way to safety

AI models are giving medical and mental health advice to millions of people. Can you prevent harmful advice by adding safety instructions to the prompt? The UK's AI Safety Institute (AISI) recently tested this.

Phoebe Yao

Feb 28, 2026

966

90% of human expertise is not verifiable

RLVR's verification crisis exposes a fundamental gap in how AI measures expert judgment across professional domains

Phoebe Yao

Jan 27, 2026

758

90% of human expertise is not verifiable

RLVR's verification crisis exposes a fundamental gap in how AI measures expert judgment across professional domains

Phoebe Yao

Jan 27, 2026

758

Advancing AI alignment through human-judged LLM debates

Learn how Pareto helped MATS obtain high quality data for their research.

Pareto

Jul 25, 2025

842

Advancing AI alignment through human-judged LLM debates

Learn how Pareto helped MATS obtain high quality data for their research.

Pareto

Jul 25, 2025

842

A Community-Driven Vision for a New Knowledge Resource for AI

Insights from 50+ researchers at an AAAI workshop toward an open engineering framework for knowledge modules in AI

Jun 19, 2025

165

A Community-Driven Vision for a New Knowledge Resource for AI

Insights from 50+ researchers at an AAAI workshop toward an open engineering framework for knowledge modules in AI

Jun 19, 2025

165

Annotation fatigue: Why human data quality declines over time

Learn how prolonged annotation tasks lead to fatigue, reduced data quality, and slower output, and discover research-backed strategies Pareto AI uses to keep annotators engaged.

Ayush Parti

Feb 6, 2025

732

Annotation fatigue: Why human data quality declines over time

Learn how prolonged annotation tasks lead to fatigue, reduced data quality, and slower output, and discover research-backed strategies Pareto AI uses to keep annotators engaged.

Ayush Parti

Feb 6, 2025

732

The micro-decisions made by AI trainers that define data quality

Discover how micro-decisions by AI trainers shape data quality, safety, and alignment in LLMs.

Ayush Parti

Jan 23, 2025

1079

The micro-decisions made by AI trainers that define data quality

Discover how micro-decisions by AI trainers shape data quality, safety, and alignment in LLMs.

Ayush Parti

Jan 23, 2025

1079

The false dichotomy of "synthetic data vs. human data"

We provide actionable strategies on how AI companies can effectively combine synthetic and human data to enhance model performance

Ayush Parti

Jan 15, 2025

1201

The false dichotomy of "synthetic data vs. human data"

We provide actionable strategies on how AI companies can effectively combine synthetic and human data to enhance model performance

Ayush Parti

Jan 15, 2025

1201

Perspectives from the frontier

LLM Metacognition: Shared and Shallow?

LLM Metacognition: Shared and Shallow?

The bar exam was not designed for this

The bar exam was not designed for this

Confidence needs calibration

Confidence needs calibration

You can't prompt your way to safety

You can't prompt your way to safety

90% of human expertise is not verifiable

90% of human expertise is not verifiable

Advancing AI alignment through human-judged LLM debates

Advancing AI alignment through human-judged LLM debates

A Community-Driven Vision for a New Knowledge Resource for AI

A Community-Driven Vision for a New Knowledge Resource for AI

Annotation fatigue: Why human data quality declines over time

Annotation fatigue: Why human data quality declines over time

The micro-decisions made by AI trainers that define data quality

The micro-decisions made by AI trainers that define data quality

The false dichotomy of "synthetic data vs. human data"

The false dichotomy of "synthetic data vs. human data"