Commit History

update Quark quantized Mixtral-8x7B-Instruct-v0.1 AMP model with better accuracies (#3)
d377d50
verified

XuebinWang commited on

update the AMP model with better accuracy numbers and lower effective bitwidth (5.6) (#2)
1d1cfda
verified

XuebinWang commited on