Signed-off-by: Roger Wang <hey@rogerw.io>
This commit is contained in:
Roger Wang
2025-09-30 23:16:12 -07:00
parent 7d9af595f5
commit 1fda6f1ba7
+1 -1
View File
@@ -1028,7 +1028,7 @@ We highly recommend using vLLM for deployment and inference. All of our evaluati
```shell
# Launch vLLM model server
vllm serve rednote-hilab/dots.ocr --trust-remote-code --async-scheduling
vllm serve rednote-hilab/dots.ocr --trust-remote-code --async-scheduling --gpu-memory-utilization 0.95
# vLLM API Demo
# See demo/demo_vllm.py for details on parameter and prompt settings that help achieve the best output quality.