Update README.md
Browse files
README.md
CHANGED
|
@@ -26,8 +26,7 @@ from INT4 checkpoint from 36.98 to 43.13, while bfloat16 baseline accuracy is 46
|
|
| 26 |
# Inference with vLLM
|
| 27 |
Install vllm nightly and torchao nightly to get some recent changes:
|
| 28 |
```
|
| 29 |
-
pip install
|
| 30 |
-
pip install --pre torchao --index-url https://download.pytorch.org/whl/nightly/cu126
|
| 31 |
```
|
| 32 |
|
| 33 |
## Serving
|
|
@@ -125,9 +124,8 @@ Install the required packages:
|
|
| 125 |
```Shell
|
| 126 |
# for compatibility with modeling file in checkpoint
|
| 127 |
pip install transformers==4.53.0
|
| 128 |
-
pip install --pre torchao --index-url https://download.pytorch.org/whl/nightly/cu126
|
| 129 |
-
pip install --pre torch --index-url https://download.pytorch.org/whl/nightly/cu126
|
| 130 |
pip install accelerate
|
|
|
|
| 131 |
```
|
| 132 |
|
| 133 |
|
|
|
|
| 26 |
# Inference with vLLM
|
| 27 |
Install vllm nightly and torchao nightly to get some recent changes:
|
| 28 |
```
|
| 29 |
+
pip install --pre torchao torch vllm --index-url https://download.pytorch.org/whl/nightly/cu128
|
|
|
|
| 30 |
```
|
| 31 |
|
| 32 |
## Serving
|
|
|
|
| 124 |
```Shell
|
| 125 |
# for compatibility with modeling file in checkpoint
|
| 126 |
pip install transformers==4.53.0
|
|
|
|
|
|
|
| 127 |
pip install accelerate
|
| 128 |
+
pip install --pre torchao torch --index-url https://download.pytorch.org/whl/nightly/cu128
|
| 129 |
```
|
| 130 |
|
| 131 |
|