Theory of Space: Can Foundation Models Construct Spatial Beliefs through Active Exploration? Paper • 2602.07055 • Published 7 days ago • 20
CellFlux: Simulating Cellular Morphology Changes via Flow Matching Paper • 2502.09775 • Published Feb 13, 2025
MoCa: Measuring Human-Language Model Alignment on Causal and Moral Judgment Tasks Paper • 2310.19677 • Published Oct 30, 2023
No Tokens Wasted: Leveraging Long Context in Biomedical Vision-Language Models Paper • 2510.03978 • Published Oct 4, 2025 • 4
Stanza: A Python Natural Language Processing Toolkit for Many Human Languages Paper • 2003.07082 • Published Mar 16, 2020
Transductive Visual Programming: Evolving Tool Libraries from Experience for Spatial Reasoning Paper • 2512.20934 • Published Dec 24, 2025
PaperSearchQA: Learning to Search and Reason over Scientific Papers with RLVR Paper • 2601.18207 • Published 17 days ago • 19
view post Post 1613 Are you familiar with reverse residual connections or looping in language models?Excited to share my Looped-GPT blog post and codebase 🚀https://github.com/sanyalsunny111/Looped-GPTTL;DR: looping during pre-training improves generalization.Plot shows GPT2 LMs pre-trained with 15.73B OWT tokensP.S. This is my first post here — I have ~4 followers and zero expectations for reach 😄 See translation 3 replies · 🧠 6 6 👍 3 3 + Reply
SkillFactory: Self-Distillation For Learning Cognitive Behaviors Paper • 2512.04072 • Published Dec 3, 2025 • 5
🍨 Gelato Collection From Data Curation to Reinforcement Learning: Building a Strong Grounding Model for Computer-Use Agents • 5 items • Updated Nov 15, 2025 • 1
🍨 Gelato Collection From Data Curation to Reinforcement Learning: Building a Strong Grounding Model for Computer-Use Agents • 5 items • Updated Nov 15, 2025 • 1