jiangchengchengNLP
/

L3.3-MS-Nevoria-70b-NVFP4

8-bit precision

compressed-tensors

Model card Files Files and versions

Powered by llm-compressor, it enables inference on vllm.

Downloads last month: 34

Safetensors

Model size

41B params

Tensor type

F32

·

BF16

·

F8_E4M3

·

U8

·

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support