Outcome Accuracy is Not Enough: Aligning the Reasoning Process of Reward Models Paper β’ 2602.04649 β’ Published 8 days ago β’ 10
WideSeek-R1: Exploring Width Scaling for Broad Information Seeking via Multi-Agent Reinforcement Learning Paper β’ 2602.04634 β’ Published 8 days ago β’ 91
PixelGen: Pixel Diffusion Beats Latent Diffusion with Perceptual Loss Paper β’ 2602.02493 β’ Published 10 days ago β’ 41
Shaping capabilities with token-level data filtering Paper β’ 2601.21571 β’ Published 14 days ago β’ 26
MedGemma Release Collection Collection of Gemma 3 variants for performance on medical text and image comprehension to accelerate building healthcare-based AI applications. β’ 9 items β’ Updated 29 days ago β’ 437
π§ LFM2.5 Collection Collection of Instruct, Base, and Japanese LFM2.5-1.2B models. β’ 22 items β’ Updated 10 days ago β’ 84
Project Atom Collection An ongoing research initiative into refining and scaling the Atom persona from 4B to 400B+ β’ 6 items β’ Updated 15 days ago β’ 2
End-to-End Test-Time Training for Long Context Paper β’ 2512.23675 β’ Published Dec 29, 2025 β’ 24
view article Article Transformers v5: Simple model definitions powering the AI ecosystem +2 Dec 1, 2025 β’ 296
view article Article Tokenization in Transformers v5: Simpler, Clearer, and More Modular +4 Dec 18, 2025 β’ 119