Cause of cross-platform performance gap (Kalshi vs. Polymarket)
Determine whether the substantially smaller losses observed for Cohort 1 models on Polymarket during the Feb 9–Mar 9, 2026 window, relative to their concurrent Kalshi performance, are primarily attributable to agents’ market selection ability in the discovery-based Polymarket implementation or to more favorable market pricing conditions on Polymarket.
References
Whether this reflects better market selection ability or more favorable market pricing on Polymarket cannot be fully disentangled from current data.
— Prediction Arena: Benchmarking AI Models on Real-World Prediction Markets
(2604.07355 - Zhang et al., 28 Mar 2026) in Section 7.4, Polymarket Results