Skip to content

correct typo "manifolds markets" to "manifold markets" #2976

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 1 commit into from
Jul 21, 2025
Merged
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 1 addition & 1 deletion futurebench.md
Original file line number Diff line number Diff line change
Expand Up @@ -40,7 +40,7 @@ Forecasting makes **contamination impossible by design**, as you can't train on

Perhaps most importantly, predictions about the future are **inherently verifiable**. We can wait and see who was right, creating an objective, time-stamped measure of model performance.

We therefore propose evaluating agents on their ability to predict future events (Ye et al., 2024; Karger et al., 2025). **FutureBench** draws from real-world prediction markets and emerging news to create interesting prediction tasks grounded in actual future outcomes. We collect events from platforms and live news coverages and manifolds markets, filtering them to focus on emerging events worth predicting. Using an agent-based approach, we curate scenarios that require genuine reasoning rather than simple pattern matching. Think geopolitical developments, market movements, or technology adoption trends - events where informed analysis actually matters.
We therefore propose evaluating agents on their ability to predict future events (Ye et al., 2024; Karger et al., 2025). **FutureBench** draws from real-world prediction markets and emerging news to create interesting prediction tasks grounded in actual future outcomes. We collect events from platforms and live news coverages and manifold markets, filtering them to focus on emerging events worth predicting. Using an agent-based approach, we curate scenarios that require genuine reasoning rather than simple pattern matching. Think geopolitical developments, market movements, or technology adoption trends - events where informed analysis actually matters.

## Can Agents Predict Future Events?
This is the obvious question, and it's at the heart of what makes this benchmark interesting! We believe the answer cannot be a simple “yes” or a “no”, as it mostly depends on the actual questions; there are always important caveats to consider.
Expand Down