MR-Align: Meta-Reasoning Informed Factuality Alignment for Large Reasoning Models Paper • 2510.24794 • Published Oct 27, 2025 • 31
LongDocURL: a Comprehensive Multimodal Long Document Benchmark Integrating Understanding, Reasoning, and Locating Paper • 2412.18424 • Published Dec 24, 2024 • 1
From System 1 to System 2: A Survey of Reasoning Large Language Models Paper • 2502.17419 • Published Feb 24, 2025 • 3
SOLIDGEO: Measuring Multimodal Spatial Math Reasoning in Solid Geometry Paper • 2505.21177 • Published May 27, 2025 • 1
TL;DR: Too Long, Do Re-weighting for Effcient LLM Reasoning Compression Paper • 2506.02678 • Published Jun 3, 2025 • 5
MV-MATH: Evaluating Multimodal Math Reasoning in Multi-Visual Contexts Paper • 2502.20808 • Published Feb 28, 2025 • 1
MV-MATH: Evaluating Multimodal Math Reasoning in Multi-Visual Contexts Paper • 2502.20808 • Published Feb 28, 2025 • 1
SOLIDGEO: Measuring Multimodal Spatial Math Reasoning in Solid Geometry Paper • 2505.21177 • Published May 27, 2025 • 1
TL;DR: Too Long, Do Re-weighting for Effcient LLM Reasoning Compression Paper • 2506.02678 • Published Jun 3, 2025 • 5
LongDocURL: a Comprehensive Multimodal Long Document Benchmark Integrating Understanding, Reasoning, and Locating Paper • 2412.18424 • Published Dec 24, 2024 • 1
From System 1 to System 2: A Survey of Reasoning Large Language Models Paper • 2502.17419 • Published Feb 24, 2025 • 3
SwS: Self-aware Weakness-driven Problem Synthesis in Reinforcement Learning for LLM Reasoning Paper • 2506.08989 • Published Jun 10, 2025 • 14