BlueMO: A comprehensive collection of challenging mathematical olympiad problems and solutions from the renowned 'Little Blue Book' (小蓝书) series, tailored for evaluating the reasoning capabilities of language models.
benchmark
math
mathematics
dataset
question-answering
reasoning
mathematical-olympiad
large-language-models
llm
reasoning-agent
-
Updated
Jan 16, 2024 - TeX