amd
/

Mixtral-8x7B-Instruct-v0.1-WMXFP4FP8-AMXFP4FP8-AMP-KVFP8

Model card Files Files and versions

Mixtral-8x7B-Instruct-v0.1-WMXFP4FP8-AMXFP4FP8-AMP-KVFP8 / model-00006-of-00008.safetensors

Commit History

update Quark quantized Mixtral-8x7B-Instruct-v0.1 AMP model with better accuracies (#3)

d377d50
verified

XuebinWang commited on Sep 23

update the AMP model with better accuracy numbers and lower effective bitwidth (5.6) (#2)

1d1cfda
verified

XuebinWang commited on Sep 19