Safe-Sora: Safe Text-to-Video Generation via Graphical Watermarking Paper • 2505.12667 • Published May 19, 2025 • 8
VisualWebInstruct: Scaling up Multimodal Instruction Data through Web Search Paper • 2503.10582 • Published Mar 13, 2025 • 24
UniF^2ace: Fine-grained Face Understanding and Generation with Unified Multimodal Models Paper • 2503.08120 • Published Mar 11, 2025 • 32