Skip to content

Commit 282efc1

Browse files
committed
Document Sync by Tina
1 parent 8c92942 commit 282efc1

File tree

3 files changed

+41
-189
lines changed

3 files changed

+41
-189
lines changed

docs/stable/cli/cli_api.md

Lines changed: 37 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -43,7 +43,9 @@ After setting up the ServerlessLLM cluster, you can use the commands listed belo
4343
```
4444

4545
### sllm-cli deploy
46-
Deploy a model using a configuration file or model name.
46+
Deploy a model using a configuration file or model name, with options to overwrite default configurations. The configuration file requires minimal specifications, as sensible defaults are provided for advanced configuration options.
47+
48+
For more details on the advanced configuration options and their default values, please refer to the [Example Configuration File](#example-configuration-file-configjson) section.
4749

4850
##### Usage
4951
```bash
@@ -52,18 +54,49 @@ sllm-cli deploy [OPTIONS]
5254

5355
##### Options
5456
- `--model <model_name>`
55-
- Model name to deploy with default configuration. The model name must be a huggingface pretrained model name. You can find the list of available models [here](https://huggingface.co/models).
57+
- Model name to deploy with default configuration. The model name must be a Hugging Face pretrained model name. You can find the list of available models [here](https://huggingface.co/models).
5658

5759
- `--config <config_path>`
58-
- Path to the JSON configuration file.
60+
- Path to the JSON configuration file. The configuration file can be incomplete, and missing sections will be filled in by the default configuration.
5961

60-
##### Example
62+
- `--backend <backend_name>`
63+
- Overwrite the backend in the default configuration.
64+
65+
- `--num_gpus <number>`
66+
- Overwrite the number of GPUs in the default configuration.
67+
68+
- `--target <number>`
69+
- Overwrite the target concurrency in the default configuration.
70+
71+
- `--min_instances <number>`
72+
- Overwrite the minimum instances in the default configuration.
73+
74+
- `--max_instances <number>`
75+
- Overwrite the maximum instances in the default configuration.
76+
77+
##### Examples
78+
Deploy using a model name with default configuration:
6179
```bash
6280
sllm-cli deploy --model facebook/opt-1.3b
81+
```
82+
83+
Deploy using a configuration file:
84+
```bash
6385
sllm-cli deploy --config /path/to/config.json
6486
```
6587

88+
Deploy using a model name and overwrite the backend:
89+
```bash
90+
sllm-cli deploy --model facebook/opt-1.3b --backend transformers
91+
```
92+
93+
Deploy using a model name and overwrite multiple configurations:
94+
```bash
95+
sllm-cli deploy --model facebook/opt-1.3b --num_gpus 2 --target 5 --min_instances 1 --max_instances 5
96+
```
97+
6698
##### Example Configuration File (`config.json`)
99+
This file can be incomplete, and missing sections will be filled in by the default configuration:
67100
```json
68101
{
69102
"model": "facebook/opt-1.3b",

docs/stable/cli/sllm_cli_doc.md

Lines changed: 0 additions & 181 deletions
This file was deleted.

docs/stable/getting_started/docker_quickstart.md

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -6,9 +6,9 @@ sidebar_position: 2
66

77
This guide will help you get started with the basics of using ServerlessLLM with Docker. Please make sure you have Docker installed on your system and have installed ServerlessLLM CLI following the [installation guide](./installation.md).
88

9-
## Pre-requirements
9+
## Pre-requisites
1010

11-
Ensure you have the following pre-requirements installed:
11+
Ensure you have the following pre-requisites:
1212

1313
1. **GPUs**: Ensure you have at least 2 GPUs available. If more GPUs are provided, you can adjust the number of workers and the number of devices assigned to each worker.
1414
2. **NVIDIA Docker Toolkit**: This allows Docker to use NVIDIA GPUs. You can find the installation guide [here](https://docs.nvidia.com/datacenter/cloud-native/container-toolkit/install-guide.html).
@@ -109,8 +109,8 @@ sllm-cli deploy --model facebook/opt-1.3b
109109
Expected output:
110110

111111
```plaintext
112-
INFO xx-xx xx:xx:xx deploy.py:36] Deploying model facebook/opt-1.3b with default configuration.
113-
INFO xx-xx xx:xx:xx deploy.py:49] Model registered successfully.
112+
INFO 08-01 07:38:12 deploy.py:36] Deploying model facebook/opt-1.3b with default configuration.
113+
INFO 08-01 07:39:00 deploy.py:49] Model registered successfully.
114114
```
115115

116116
### Step 6: Query the Model

0 commit comments

Comments
 (0)