Paper page - Building Social World Models with Large Language Models
…To evaluate SWM, we introduce a benchmark, SWM-bench, derived from real-world prediction markets , specifically Kalshi and Polymarket . SWM-bench includes over 12k data points for social belief prediction tasks spanning…