nvidia
/

DeepSeek-R1-FP4

Text Generation

Model card Files Files and versions

Resources

View closed (5)

How to reproduce the accuracy result?

#15 opened 3 months ago by

quantize deepseek-r1-0528 please

#14 opened 5 months ago by

make model generate think tag

#13 opened 6 months ago by

Update config.json

#12 opened 6 months ago by

can this model run on A800 ?

#10 opened 8 months ago by

FP4 in attention proj

#9 opened 8 months ago by

can this model run on Hopper GPU

#8 opened 8 months ago by

Can this model work with vLLM?

#7 opened 8 months ago by

Request for Detailed Benchmarking Setup with TensorRT-LLM on B200

#6 opened 8 months ago by

Benchmark results compared to orig fp8 / int4 quants etc?

#1 opened 8 months ago by