The best-performing AI can now achieve 97.1% on GSM8K (Zhong et al, 2024), an improvement from 74.4% in April 2022 (Wang et al, 2022), and 87.9% on MATH (Lei et al, 2024), an improvement from 64.9% in ...
When it comes to the biggest challenge yet to solve, however, it seems there’s not much debate: “The Riemann Hypothesis has a ...