From 500a96492b24c316f5b57b7908341ced36716fa1 Mon Sep 17 00:00:00 2001 From: CharlesCNorton <135471798+CharlesCNorton@users.noreply.github.com> Date: Thu, 17 Jul 2025 12:16:17 -0400 Subject: [PATCH] correct typo "manifolds markets" to "manifold markets" Remove extra 's' from "manifold" in the sentence about collecting events from platforms and live news coverage. --- futurebench.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/futurebench.md b/futurebench.md index 11f7363428..cbbc12272e 100644 --- a/futurebench.md +++ b/futurebench.md @@ -40,7 +40,7 @@ Forecasting makes **contamination impossible by design**, as you can't train on Perhaps most importantly, predictions about the future are **inherently verifiable**. We can wait and see who was right, creating an objective, time-stamped measure of model performance. -We therefore propose evaluating agents on their ability to predict future events (Ye et al., 2024; Karger et al., 2025). **FutureBench** draws from real-world prediction markets and emerging news to create interesting prediction tasks grounded in actual future outcomes. We collect events from platforms and live news coverages and manifolds markets, filtering them to focus on emerging events worth predicting. Using an agent-based approach, we curate scenarios that require genuine reasoning rather than simple pattern matching. Think geopolitical developments, market movements, or technology adoption trends - events where informed analysis actually matters. +We therefore propose evaluating agents on their ability to predict future events (Ye et al., 2024; Karger et al., 2025). **FutureBench** draws from real-world prediction markets and emerging news to create interesting prediction tasks grounded in actual future outcomes. We collect events from platforms and live news coverages and manifold markets, filtering them to focus on emerging events worth predicting. Using an agent-based approach, we curate scenarios that require genuine reasoning rather than simple pattern matching. Think geopolitical developments, market movements, or technology adoption trends - events where informed analysis actually matters. ## Can Agents Predict Future Events? This is the obvious question, and it's at the heart of what makes this benchmark interesting! We believe the answer cannot be a simple “yes” or a “no”, as it mostly depends on the actual questions; there are always important caveats to consider.