Ai - a chethan62 Collection

Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

chethan62 's Collections

STT

TTS

Ai

spaces

webgpu

papers

models

Ai

updated 9 days ago

bosonai/higgs-audio-v2-generation-3B-base

Text-to-Speech • 6B • Updated Jul 28, 2025 • 148k • 658
rednote-hilab/dots.ocr

Image-Text-to-Text • 3B • Updated Oct 31, 2025 • 275k • 1.23k
Skywork/Skywork-UniPic-1.5B

Any-to-Any • Updated Sep 8, 2025 • 48 • 114
OmniSVG/OmniSVG

Text Generation • Updated Jul 21, 2025 • 246 • 188
LiquidAI/LFM2-VL-450M

Image-Text-to-Text • 0.5B • Updated Jan 5 • 8.56k • 144
Running

19

onnx-asr demo

🐢

19

ASR demo using onnx-asr
Running on Zero

43

Canary 1B Flash

🐤

43

Canary 1B Flash demo
AIDC-AI/Ovis2.5-9B

Image-Text-to-Text • 9B • Updated Oct 24, 2025 • 4.04k • 302
Running on Zero

Featured

80

LIA-X

🐠

80

Interactive Portrait Animation and Editing
Runtime error

341

NSFW Face Swap

📈

341

Swap faces in images and enhance them if desired
Running

Featured

1.74k

Realistic Text To Speech Unlimited

🔥

1.74k

Free Text-To-Speech generator with Emotion control (OpenAI)
Runtime error

109

Ovis2.5 2B

📚

109

Lightweight vision for efficient deployment
lodestones/Chroma1-HD

Text-to-Image • Updated Oct 23, 2025 • 17.7k • 326
silveroxides/Chroma-GGUF

Text-to-Image • 9B • Updated Sep 11, 2025 • 8.28k • 233
Clybius/Chroma-GGUF

9B • Updated Apr 29, 2025 • 260 • 28
QuantStack/Chroma1-Base-GGUF

Text-to-Image • 9B • Updated Aug 23, 2025 • 544 • 8
stepfun-ai/Step-Audio-2-mini

Any-to-Any • 8B • Updated Sep 5, 2025 • 1.56k • 247
QuantStack/Chroma1-Flash-GGUF

Text-to-Image • 9B • Updated Aug 24, 2025 • 628 • 9
mradermacher/NuMarkdown-8B-Thinking-GGUF

8B • Updated Aug 7, 2025 • 1.45k • 13
Running on Zero

Featured

267

granite-docling-258M demo

📝

267

Extract and query structured data from document images
openbmb/VoxCPM-0.5B

Text-to-Speech • Updated Sep 19, 2025 • 1.09k • 790
CypressYang/SongBloom

Text-to-Audio • Updated Oct 11, 2025 • 778 • 125
Alibaba-NLP/Tongyi-DeepResearch-30B-A3B

Text Generation • 31B • Updated Oct 10, 2025 • 53.6k • 798
decart-ai/Lucy-Edit-Dev

Video-to-Video • Updated Nov 20, 2025 • 521 • 326
XiaomiMiMo/MiMo-Audio-7B-Instruct

Any-to-Any • 8B • Updated Sep 23, 2025 • 1.93k • 149
Running on CPU Upgrade

76

MiMo-Audio-Chat

💬

76

Chat with Xiaomi MiMo-Audio using voice
QuantStack/Qwen-Image-Edit-2509-GGUF

Image-to-Image • 20B • Updated Oct 18, 2025 • 57k • 322
Running on Zero

739

IndexTTS 2 Demo

🏢

739

Generate expressive speech from text with voice and emotion controls
tencent/HunyuanImage-3.0

Text-to-Image • 83B • Updated 14 days ago • 635k • • 637
Kwai-Klear/Klear-46B-A2.5B-Instruct

Text Generation • 46B • Updated Sep 7, 2025 • 12 • 81
deepseek-ai/DeepSeek-V3.1-Terminus

Text Generation • 685B • Updated Sep 29, 2025 • 6.88k • • 360
Running

Featured

180

HunyuanImage-3.0

📊

180

Generate images from prompts (PRO users only)
neuphonic/neutts-air

Text-to-Speech • 0.7B • Updated Jan 9 • 11.4k • 846
Kwai-Klear/Klear-46B-A2.5B-Base

Text Generation • 46B • Updated Sep 7, 2025 • 7 • 30
nineninesix/kani-tts-370m

Text-to-Speech • 0.4B • Updated Nov 2, 2025 • 878 • 156
LiquidAI/LFM2-Audio-1.5B

Audio-to-Audio • 1B • Updated 19 days ago • 185 • 345
kyutai/stt-2.6b-en

Automatic Speech Recognition • Updated Jun 26, 2025 • 118
Running

18

Fathom DeepResearch

📊

18

DeepResearch with the fathom search and synthesizer models
LiquidAI/LFM2-VL-1.6B

Image-Text-to-Text • 2B • Updated 19 days ago • 2.93k • 221
bartowski/LiquidAI_LFM2-8B-A1B-GGUF

Text Generation • 8B • Updated Oct 8, 2025 • 1.34k • 8
prithivMLmods/DeepCaption-VLA-V2.0-7B

Image-Text-to-Text • 8B • Updated Oct 15, 2025 • 16 • 7
numind/NuMarkdown-8B-Thinking

Image-to-Text • 8B • Updated Nov 13, 2025 • 1.09M • 438
microsoft/BiomedParse

Updated Oct 10, 2025 • 340 • 103
prithivMLmods/Perseus-Doc-VL-0712

Image-Text-to-Text • 8B • Updated Oct 10, 2025 • 7 • 3
Running on Zero

Featured

221

Ovi [local]

🎥

221

Generate Hollywood Style Actors on your Local Machine
microsoft/UserLM-8b

Text Generation • 8B • Updated Oct 9, 2025 • 1.16k • 362
fka/prompts.chat

Viewer • Updated 5 days ago • 1.2k • 17.7k • 9.59k
prithivMLmods/Kepler-Qwen3-4B-Super-Thinking

Text Generation • 4B • Updated Sep 27, 2025 • 5 • 5
PaddlePaddle/PaddleOCR-VL

Image-Text-to-Text • 1.0B • Updated 7 days ago • 16.6k • 1.55k
facebook/MobileLLM-Pro

Text Generation • 1B • Updated Nov 11, 2025 • 274 • 159
Open-Bee/Bee-8B-RL

Image-Text-to-Text • 9B • Updated 11 days ago • 44.3k • 77
Running on Zero

MCP

199

Qwen3-VL-Outpost

🔥

199

Qwen3-VL / Qwen2.5-VL Demo
lightonai/LightOnOCR-1B-1025

Image-to-Text • Updated about 9 hours ago • 104k • 230
internlm/JanusCoderV-8B

Image-Text-to-Text • 9B • Updated Oct 30, 2025 • 28 • 13
moonshotai/Kimi-Linear-48B-A3B-Instruct

Text Generation • 49B • Updated Dec 16, 2025 • 33k • 539
nvidia/omnivinci

Feature Extraction • Updated 13 days ago • 1.32k • 169
nvidia/audio-flamingo-3-hf

Audio-Text-to-Text • 8B • Updated 15 days ago • 102k • 171
tencent/HunyuanOCR

Image-Text-to-Text • 1.0B • Updated 30 days ago • 1.44M • 553
Running on A100

233

Omnilingual ASR Media Transcription

🌍

233

Transcribe audio/video to text in multiple languages
zai-org/GLM-TTS

Text-to-Speech • Updated about 1 month ago • 437 • 314
mradermacher/Dolphin-v2-GGUF

3B • Updated Dec 12, 2025 • 220 • 4
LiquidAI/LFM2-2.6B-Exp-GGUF

Text Generation • 3B • Updated Dec 26, 2025 • 27.9k • 61
Running

53

Nemotron Speech Streaming

🎤

53

Real-time speech recognition with NVIDIA Triton
Running on Zero

Featured

94

LightOnOCR 2 1B Demo

🐨

94

Extract text and tables from images or PDFs
prithivMLmods/GutenOCR-3B-AIO-GGUF

Image-Text-to-Text • 3B • Updated 17 days ago • 1.57k • 3
zai-org/GLM-OCR

Image-to-Text • Updated 3 days ago • 373k • 958

Collection guide
Browse collections

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs