Frankentext: Stitching random text fragments into long-form narratives Paper ⢠2505.18128 ⢠Published May 23, 2025 ⢠4
Large-Scale Data Selection for Instruction Tuning Paper ⢠2503.01807 ⢠Published Mar 3, 2025 ⢠14
From Hours to Minutes: Lossless Acceleration of Ultra Long Sequence Generation up to 100K Tokens Paper ⢠2502.18890 ⢠Published Feb 26, 2025 ⢠30
CLIPPER: Compression enables long-context synthetic data generation Paper ⢠2502.14854 ⢠Published Feb 20, 2025 ⢠11
Jamba 1.5 Collection The AI21 Jamba family of models are state-of-the-art, hybrid SSM-Transformer instruction following foundation models ⢠2 items ⢠Updated Mar 6, 2025 ⢠87
Model Merging Collection Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! ⢠30 items ⢠Updated Jun 12, 2024 ⢠250