internlm
/

Intern-S1-Pro

Image-Text-to-Text

text-generation

Model card Files Files and versions

Update README.md by emphasizing the usage of the LLM inference engine.

#8

by QipengGuo - opened 1 day ago

base: refs/heads/main

←

from: refs/pr/8

Discussion Files changed

Files changed (1) hide show

README.md +3 -0

README.md CHANGED Viewed

@@ -60,6 +60,9 @@ temperature = 0.8
 ### Serving
 Intern-S1-Pro can be deployed using any of the following LLM inference frameworks:
 - LMDeploy

 ### Serving
+> [!IMPORTANT]
+> Running a trillion-parameter model using the native Hugging Face forward method is challenging. We strongly recommend using an LLM inference engine (such as LMDeploy, vLLM, or sglang) to host Intern-S1-Pro and accessing the model via API.
 Intern-S1-Pro can be deployed using any of the following LLM inference frameworks:
 - LMDeploy