MANIFOLD
GPT 5.2 Pro METR time horizon
14
Ṁ1.4kṀ9.7k
Mar 31
0.4%
< 2h
0.4%
2h00 - 2h30
0.5%
2h30 - 3h00
0.6%
3h00 - 3h30
0.7%
3h30 - 4h00
0.7%
4h00 - 4h30
1.9%
4h30 - 5h00
3%
5h00 - 5h30
2%
5h30 - 6h00
14%
6h00 - 7h00
18%
7h00 - 8h00
21%
8h00 - 9h00
18%
9h00 - 10h00
19%
Other

This market will resolve to the highest 50% time horizon, as reported by METR, for GPT 5.2 Pro.

Context (roon works at OpenAI, david rein works at METR):

50% time horizon is a measure of AI autonomy based on the length of tasks that AI can do: roughly, it is the time that humans take to complete tasks that an AI system can successfully do 50% of the time. See METR's "Measuring AI Ability to Complete Long Tasks" for the technical definition. Claude 3.7 Sonnet, released in February 2025, was the leading model with a 50% horizon of 59 minutes.

Left bounds inclusive, right bounds exclusive.

Time horizon could vary based on the set of tasks used to measure it, so this market will be based on the time horizon for the most comprehensive set of tasks reported by METR (as of 2025, largely software and engineering tasks). This will be ambiguous if METR stops publishing time horizons across all of their autonomy tasks and only publishes separate results for different subsets; I might N/A in that scenario.

See also:

/jim/gpt-52-metr

/Bayesian/gpt-52-pro-metr-time-horizon (this market)

/jim/gpt-53-metr-time-horizon

/Bayesian/gemini-3-pro-metr-50-time-horizon

/Bayesian/claude-sonnet-46s-metr-50-time-hori

/Bayesian/claude-opus-46-metr-50-time-horizon

/Bayesian/claude-sonnet-5-metr-50-time-horizo

/Bayesian/claude-opus-5-metr-50-time-horizon

/Bayesian/grok-420s-metr-50-time-horizon

/Bayesian/grok-5s-50-time-horizon-per-metr

/Bayesian/r2s-50-time-horizon-per-metr

/Bayesian/kimi-k3-thinkings-metr-50-time-hori

  • Update 2026-02-04 (PST) (AI summary of creator comment): This market is about GPT 5.2 Pro (not GPT 5.2 base model).

  • Update 2026-02-18 (PST) (AI summary of creator comment): METR may not evaluate GPT 5.2 Pro for a while. The market will wait for METR's evaluation whenever it occurs (before the close date).

Market context
Get
Ṁ1,000
to start trading!
Sort by:

is market supposed to be resolved?

@MaxLennartson no, hasnt been evaluated yet

sold Ṁ0 YES

@Bayesian Does METR usually evaluate pro models?

@MaxLennartson no and they probably wont for a while. Im just high on hopium

filled a Ṁ208 YES at 94% order

@Bayesian resolve

@jim or um add new buckets or something

@jim is pro

@Bayesian ye i didn't know that was a thing, anyway, new buckets?

© Manifold Markets, Inc.TermsPrivacy