kk
zhangyikai
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 2 hours ago
ScaleEnv: Scaling Environment Synthesis from Scratch for Generalist Interactive Tool-Use Agent Training
upvoted
a
paper
5 days ago
CoBA-RL: Capability-Oriented Budget Allocation for Reinforcement Learning in LLMs
updated
a Space
5 days ago
Now-Join-Us/Generalist-Value-Model-V0