NEO1_0 Collection From Pixels to Words -- Towards Native Vision-Language Primitives at Scale • 7 items • Updated 16 days ago • 8
Running 80 Chinese Open Source Heatmap 🔥 80 Explore open source AI projects and their release activity over time
Qwen/Qwen3-VL-235B-A22B-Thinking Image-Text-to-Text • 236B • Updated Nov 26, 2025 • 2.46M • • 375
Running Featured 155 DINOv3 Web 🦖 155 Visualize rich, dense image features locally in your browser
The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding Paper • 2512.19693 • Published Dec 22, 2025 • 65
Next-Embedding Prediction Makes Strong Vision Learners Paper • 2512.16922 • Published Dec 18, 2025 • 86
Fast and Accurate Causal Parallel Decoding using Jacobi Forcing Paper • 2512.14681 • Published Dec 16, 2025 • 42
LongVie 2: Multimodal Controllable Ultra-Long Video World Model Paper • 2512.13604 • Published Dec 15, 2025 • 74
sensenova/SenseNova-SI-1.1-Qwen2.5-VL-7B Image-Text-to-Text • 8B • Updated Dec 9, 2025 • 1.48k • 4
sensenova/SenseNova-SI-1.1-Qwen2.5-VL-3B Image-Text-to-Text • 4B • Updated Dec 9, 2025 • 1.55k • 4
sensenova/SenseNova-SI-1.2-InternVL3-8B Image-Text-to-Text • 8B • Updated Dec 10, 2025 • 3.2k • 10
sensenova/SenseNova-SI-1.1-Qwen3-VL-8B Image-Text-to-Text • 9B • Updated Dec 9, 2025 • 2.11k • 5