Pavlo Molchanov PRO
pmolchanov
AI & ML interests
Efficiency in Multi-Modal LLMs
Recent Activity
upvoted
a
collection
1 day ago
NVIDIA Nemotron v3
authored
a paper
12 days ago
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization