Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -17,16 +17,14 @@ base_model:
 pipeline_tag: text-generation
 ---
-# Installation
 ```
 pip install git+https://github.com/huggingface/transformers@main
 pip install --pre torchao --index-url https://download.pytorch.org/whl/nightly/cu126
-pip install vllm --pre --extra-index-url https://wheels.vllm.ai/nightly
 ```
-Also need to install lm-eval from source: https://github.com/EleutherAI/lm-evaluation-harness#install
-# Quantization Recipe
 We used following code to get the quantized model:
 ```
@@ -119,6 +117,8 @@ Hello! As an AI, I don't have consciousness in the way humans do, but I am fully
 We rely on [lm-evaluation-harness](https://github.com/EleutherAI/lm-evaluation-harness) to evaluate the quality of the quantized model.
 ## baseline
 ```
 lm_eval --model hf --model_args pretrained=microsoft/Phi-4-mini-instruct --tasks hellaswag --device cuda:0 --batch_size 64

 pipeline_tag: text-generation
 ---
+# Quantization Recipe
+First need to install the required packages:
 ```
 pip install git+https://github.com/huggingface/transformers@main
 pip install --pre torchao --index-url https://download.pytorch.org/whl/nightly/cu126
 ```
 We used following code to get the quantized model:
 ```
 We rely on [lm-evaluation-harness](https://github.com/EleutherAI/lm-evaluation-harness) to evaluate the quality of the quantized model.
+Need to install lm-eval from source: https://github.com/EleutherAI/lm-evaluation-harness#install
 ## baseline
 ```
 lm_eval --model hf --model_args pretrained=microsoft/Phi-4-mini-instruct --tasks hellaswag --device cuda:0 --batch_size 64