mobiuslabsgmbh/CLIP-ViT-H-14-laion2B-2bit_g16_s128-HQQ Image Classification • Updated Aug 22 • 29 • 5
mobiuslabsgmbh/Llama-3.1-8B-Instruct_mxfp4_weights_calib_demo Text Generation • Updated Jun 26 • 53 • 1
mobiuslabsgmbh/Llama-3.1-8B-Instruct_nvfp4_weights_calib_demo Text Generation • Updated Jun 26 • 55 • 1
mobiuslabsgmbh/Qwen2.5-VL-7B-Instruct_gemlite-ao_a16w4_gs_128_pack_32bit Image-to-Text • Updated Jun 4 • 28 • 2
mobiuslabsgmbh/Phi-4-mini-instruct_gemlite-ao_a16w4_gs_128_pack_16bit Text Generation • Updated Jun 4 • 99 • 1
mobiuslabsgmbh/Llama-3.1-8B-Instruct_gemlite-ao_a16w4_gs_128_pack_16bit Text Generation • Updated Jun 4 • 52 • 1
mobiuslabsgmbh/Qwen2.5-7B-Instruct_gemlite-ao_a16w4_gs_128_pack_32bit Text Generation • Updated Jun 4 • 52 • 2
mobiuslabsgmbh/Phi-4-mini-instruct_gemlite-ao_a16w4_gs_128_pack_32bit Text Generation • Updated Jun 4 • 100 • 1
mobiuslabsgmbh/Llama-3.1-8B-Instruct_gemlite-ao_a16w4_gs_128_pack_32bit Text Generation • Updated Jun 3 • 51 • 2
mobiuslabsgmbh/Meta-Llama-3-8B-Instruct_4bitgs64_hqq_hf Text Generation • 5B • Updated May 23 • 25 • 2
mobiuslabsgmbh/Mixtral-8x7B-Instruct-v0.1_4bitgs64_hqq_hf Text Generation • 25B • Updated Feb 10 • 38 • 1
mobiuslabsgmbh/Mixtral-8x7B-Instruct-v0.1-hf-attn-4bit-moe-2bitgs8-metaoffload-HQQ Text Generation • Updated Feb 5 • 25 • 19
mobiuslabsgmbh/Mixtral-8x7B-Instruct-v0.1-hf-attn-4bit-moe-2bit-metaoffload-HQQ Text Generation • Updated Feb 5 • 27 • 16