Running 16 The Jagged AI Frontier is a Data Frontier π§ 16 Why AI capabilities are shaped by data availability
Runtime error Featured 2.95k The Smol Training Playbook π 2.95k The secrets to building world-class LLMs
HuggingFaceTB/SmolLM2-1.7B-Instruct-16k Text Generation β’ 2B β’ Updated Feb 21, 2025 β’ 155 β’ 9
HuggingFaceTB/SmolLM2-1.7B-Instruct Text Generation β’ 2B β’ Updated Apr 21, 2025 β’ 50.7k β’ 708
Running Featured 1.28k FineWeb: decanting the web for the finest text data at scale π· 1.28k Generate high-quality text data for LLMs using FineWeb