-
DeepSeek-Coder: When the Large Language Model Meets Programming -- The Rise of Code Intelligence
Paper • 2401.14196 • Published • 68 -
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
Paper • 2501.12948 • Published • 434 -
mHC: Manifold-Constrained Hyper-Connections
Paper • 2512.24880 • Published • 208
蓋瑞王
gary109
AI & ML interests
GAN,Music,LLM
Recent Activity
updated
a collection
2 days ago
DeepSeek
updated
a model
7 days ago
gary109/qwen3-vl-4b-instruct-sft-seed3407-20251231-0201-merged_16bit
published
a model
7 days ago
gary109/qwen3-vl-4b-instruct-sft-seed3407-20251231-0201-merged_16bit
Organizations
None yet