Paper page - Can RL Teach Long-Horizon Reasoning to LLMs? Expressiveness Is Key
…Our proposed framework supports a wide range of logics: from simple implication-only logic ("if-then") towards more expressive first-order reasoning with conjunction ("and"), disjunction ("or"), negation ("not"), and universal quantification…