sian cao
sonald
AI & ML interests
AI, big data, OS
Recent Activity
upvoted
an
article
7 days ago
SmolVLM Grows Smaller β Introducing the 256M & 500M Models!
liked
a Space
13 days ago
HuggingFaceH4/blogpost-scaling-test-time-compute
upvoted
a
paper
about 1 month ago
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization