CS521FinalProject / quantized_models /model_comparison.csv
Charlie81's picture
quantization completed
ee589d3
Model,Configuration,Attn (bits),FFN (bits),Emb (bits),Size (MB),Attn Layers,FFN Layers,Total Quantized
DeepSeek-R1-Distill-Qwen-1.5B,aggressive,4,8,8,6779.05,112,84,197
DeepSeek-R1-Distill-Qwen-1.5B,uniform_8bit,8,8,8,6779.05,112,84,197
DeepSeek-R1-Distill-Qwen-1.5B,very_aggressive,4,4,8,6779.05,112,84,197
Phi-3-mini-128k-instruct,aggressive,4,8,8,14576.26,64,64,129
Phi-3-mini-128k-instruct,uniform_8bit,8,8,8,14576.26,64,64,129
Phi-3-mini-128k-instruct,very_aggressive,4,4,8,14576.26,64,64,129
Falcon-E-3B-Base,aggressive,4,8,8,512.51,0,0,1
Falcon-E-3B-Base,uniform_8bit,8,8,8,512.51,0,0,1
Falcon-E-3B-Base,very_aggressive,4,4,8,512.51,0,0,1