In a Training Loop 🔄

66 70 114

Daniel Fox

FlameF0X

https://flamefox.gattodev.tech/

FlameF0X

AI & ML interests

Pre-training text generator. (Brother, im 17) Please don't try to contact me.

Recent Activity

updated a model about 16 hours ago

FlameF0X/Qwen3-1.7b-Pro

published a model about 16 hours ago

FlameF0X/Qwen3-1.7b-Pro

liked a Space 1 day ago

ACE-Step/Ace-Step-v1.5

View all activity

Organizations

upvoted a paper 2 days ago

Outcome Accuracy is Not Enough: Aligning the Reasoning Process of Reward Models

Paper • 2602.04649 • Published 8 days ago • 10

upvoted a paper 3 days ago

Reinforced Attention Learning

Paper • 2602.04884 • Published 8 days ago • 25

upvoted a paper 6 days ago

WideSeek-R1: Exploring Width Scaling for Broad Information Seeking via Multi-Agent Reinforcement Learning

Paper • 2602.04634 • Published 8 days ago • 91

upvoted a changelog 6 days ago

Changelog

Community Evals and Benchmark Repositories

7 days ago

• 43

upvoted a paper 8 days ago

PixelGen: Pixel Diffusion Beats Latent Diffusion with Perceptual Loss

Paper • 2602.02493 • Published 10 days ago • 41

upvoted 2 papers 13 days ago

Shaping capabilities with token-level data filtering

Paper • 2601.21571 • Published 14 days ago • 26

How AI Impacts Skill Formation

Paper • 2601.20245 • Published 15 days ago • 8

upvoted a changelog 20 days ago

Changelog

Sort Datasets by Size

20 days ago

• 82

upvoted 2 changelogs 21 days ago

Changelog

Sort Models by Parameter Size

21 days ago

• 33

Changelog

MLX Hardware Compatibility

21 days ago

• 44

upvoted a collection 21 days ago

MedGemma Release

Collection

Collection of Gemma 3 variants for performance on medical text and image comprehension to accelerate building healthcare-based AI applications. • 9 items • Updated 29 days ago • 437

upvoted a paper 26 days ago

Muon is Scalable for LLM Training

Paper • 2502.16982 • Published Feb 24, 2025 • 10

upvoted 2 collections about 1 month ago

💧 LFM2.5

Collection

Collection of Instruct, Base, and Japanese LFM2.5-1.2B models. • 22 items • Updated 10 days ago • 84

Project Atom

Collection

An ongoing research initiative into refining and scaling the Atom persona from 4B to 400B+ • 6 items • Updated 15 days ago • 2

upvoted a changelog about 1 month ago

Changelog

HuggingChat for Papers

Jan 7

• 102

upvoted a paper about 1 month ago

End-to-End Test-Time Training for Long Context

Paper • 2512.23675 • Published Dec 29, 2025 • 24

upvoted an article about 2 months ago

Article

Continuous batching from first principles

Nov 25, 2025

•

324

upvoted a paper about 2 months ago

LFM2 Technical Report

Paper • 2511.23404 • Published Nov 28, 2025 • 52

upvoted 2 articles about 2 months ago

Article

Transformers v5: Simple model definitions powering the AI ecosystem

Dec 1, 2025

•

296

Article

Tokenization in Transformers v5: Simpler, Clearer, and More Modular

Dec 18, 2025

•

119

Daniel Fox

AI & ML interests

Recent Activity

Organizations

FlameF0X's activity

Community Evals and Benchmark Repositories

Sort Datasets by Size

Sort Models by Parameter Size

MLX Hardware Compatibility

HuggingChat for Papers

Continuous batching from first principles

Transformers v5: Simple model definitions powering the AI ecosystem

Tokenization in Transformers v5: Simpler, Clearer, and More Modular