File size: 646 Bytes
ee589d3
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
Model,Configuration,Attn (bits),FFN (bits),Emb (bits),Size (MB),Attn Layers,FFN Layers,Total Quantized
DeepSeek-R1-Distill-Qwen-1.5B,aggressive,4,8,8,6779.05,112,84,197
DeepSeek-R1-Distill-Qwen-1.5B,uniform_8bit,8,8,8,6779.05,112,84,197
DeepSeek-R1-Distill-Qwen-1.5B,very_aggressive,4,4,8,6779.05,112,84,197
Phi-3-mini-128k-instruct,aggressive,4,8,8,14576.26,64,64,129
Phi-3-mini-128k-instruct,uniform_8bit,8,8,8,14576.26,64,64,129
Phi-3-mini-128k-instruct,very_aggressive,4,4,8,14576.26,64,64,129
Falcon-E-3B-Base,aggressive,4,8,8,512.51,0,0,1
Falcon-E-3B-Base,uniform_8bit,8,8,8,512.51,0,0,1
Falcon-E-3B-Base,very_aggressive,4,4,8,512.51,0,0,1