File size: 646 Bytes
ee589d3 |
1 2 3 4 5 6 7 8 9 10 11 |
Model,Configuration,Attn (bits),FFN (bits),Emb (bits),Size (MB),Attn Layers,FFN Layers,Total Quantized
DeepSeek-R1-Distill-Qwen-1.5B,aggressive,4,8,8,6779.05,112,84,197
DeepSeek-R1-Distill-Qwen-1.5B,uniform_8bit,8,8,8,6779.05,112,84,197
DeepSeek-R1-Distill-Qwen-1.5B,very_aggressive,4,4,8,6779.05,112,84,197
Phi-3-mini-128k-instruct,aggressive,4,8,8,14576.26,64,64,129
Phi-3-mini-128k-instruct,uniform_8bit,8,8,8,14576.26,64,64,129
Phi-3-mini-128k-instruct,very_aggressive,4,4,8,14576.26,64,64,129
Falcon-E-3B-Base,aggressive,4,8,8,512.51,0,0,1
Falcon-E-3B-Base,uniform_8bit,8,8,8,512.51,0,0,1
Falcon-E-3B-Base,very_aggressive,4,4,8,512.51,0,0,1
|