Will mechanistic/transformer interpretability [eg Neel Nanda] end up affecting p(doom) more than 5%? | Manifold

Will mechanistic/transformer interpretability [eg Neel Nanda] end up affecting p(doom) more than 5%?

Basic

2

Ṁ35

2223

36%

chance

1D

1W

1M

ALL

This question is managed and resolved by Manifold.

Get

1,000

and

3.00

Related questions

Will mechanistic interpretability have more academic impact than representation engineering by the end of 2025?

Will agent foundations [eg Scott Garrabrant] end up affecting p(doom) more than 5%?

Will mechanistic interpretability be essentially solved for GPT-4 before 2030?

Will mechanistic interpretability be essentially solved for the human brain before 2040?

Will janus/@repligate meaningfully affect p(doom) by more than 5%?

Will manifold markets meaningfully affect p(doom) by more than 3%?

Will davidad meaningfully affect p(doom) by more than 3%?

Will MIRI meaningfully affect p(doom) by more than 5%?

Will mechanistic interpretability be essentially solved for GPT-3 before 2030?

Will mechanistic interpretability be essentially solved for GPT-2 before 2030?

Related questions

Will mechanistic interpretability have more academic impact than representation engineering by the end of 2025?

Will manifold markets meaningfully affect p(doom) by more than 3%?

Will agent foundations [eg Scott Garrabrant] end up affecting p(doom) more than 5%?

Will davidad meaningfully affect p(doom) by more than 3%?

Will mechanistic interpretability be essentially solved for GPT-4 before 2030?

Will MIRI meaningfully affect p(doom) by more than 5%?

Will mechanistic interpretability be essentially solved for the human brain before 2040?

Will mechanistic interpretability be essentially solved for GPT-3 before 2030?

Will janus/@repligate meaningfully affect p(doom) by more than 5%?

Will mechanistic interpretability be essentially solved for GPT-2 before 2030?

© Manifold Markets, Inc.•Terms + Mana-only Terms•Privacy•Rules