24 30 9

yueliu1999

https://yueliu1999.github.io/

yueliu1999

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

Multi-Docker-Eval: A `Shovel of the Gold Rush' Benchmark on Automatic Environment Building for Software Engineering

upvoted a paper about 2 months ago

Reasoning with Confidence: Efficient Verification of LLM Reasoning Steps via Uncertainty Heads

upvoted a paper 3 months ago

MCPMark: A Benchmark for Stress-Testing Realistic and Comprehensive MCP Use

View all activity

Organizations

None yet

upvoted a paper about 1 month ago

Multi-Docker-Eval: A `Shovel of the Gold Rush' Benchmark on Automatic Environment Building for Software Engineering

Paper • 2512.06915 • Published Dec 7, 2025 • 12

upvoted a paper about 2 months ago

Reasoning with Confidence: Efficient Verification of LLM Reasoning Steps via Uncertainty Heads

Paper • 2511.06209 • Published Nov 9, 2025 • 18

upvoted a paper 3 months ago

MCPMark: A Benchmark for Stress-Testing Realistic and Comprehensive MCP Use

Paper • 2509.24002 • Published Sep 28, 2025 • 174

upvoted a paper 4 months ago

MAPO: Mixed Advantage Policy Optimization

Paper • 2509.18849 • Published Sep 23, 2025 • 26

upvoted a paper 5 months ago

Beyond Pass@1: Self-Play with Variational Problem Synthesis Sustains RLVR

Paper • 2508.14029 • Published Aug 19, 2025 • 118

upvoted a paper 6 months ago

Pixels, Patterns, but No Poetry: To See The World like Humans

Paper • 2507.16863 • Published Jul 21, 2025 • 68

upvoted 3 papers 7 months ago

Balancing Truthfulness and Informativeness with Uncertainty-Aware Instruction Fine-Tuning

Paper • 2502.11962 • Published Feb 17, 2025 • 38

SwS: Self-aware Weakness-driven Problem Synthesis in Reinforcement Learning for LLM Reasoning

Paper • 2506.08989 • Published Jun 10, 2025 • 14

SynthRL: Scaling Visual Reasoning with Verifiable Data Synthesis

Paper • 2506.02096 • Published Jun 2, 2025 • 52

upvoted 7 papers 8 months ago

upvoted a collection 8 months ago

GuardReasoner-VL

Collection

A reasoning-based VLM guard model • 6 items • Updated May 28, 2025 • 2

upvoted a paper 8 months ago

Beyond 'Aha!': Toward Systematic Meta-Abilities Alignment in Large Reasoning Models

Paper • 2505.10554 • Published May 15, 2025 • 120

upvoted 2 papers 9 months ago

FlowReasoner: Reinforcing Query-Level Meta-Agents

Paper • 2504.15257 • Published Apr 21, 2025 • 47

NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation

Paper • 2504.13055 • Published Apr 17, 2025 • 19

yueliu1999

AI & ML interests

Recent Activity

Organizations

yueliu1999's activity