will's picture

12 21

will

willfalco

·

AI & ML interests

None yet

Recent Activity

new activity 1 day ago

XiaomiMiMo/MiMo-V2-Flash:Great Model! - sglang mtp support for triton backend

new activity 10 days ago

QuantTrio/DeepSeek-V3.1-AWQ-Lite:[request] DeepSeek-V3.1-Terminus

new activity 11 days ago

lukealonso/MiniMax-M2-NVFP4:you know which nightly it worked with? because it does not with current one

View all activity

Organizations

None yet

New activity in XiaomiMiMo/MiMo-V2-Flash 1 day ago

Great Model! - sglang mtp support for triton backend

#19 opened 4 days ago by

chriswritescode

New activity in QuantTrio/DeepSeek-V3.1-AWQ-Lite 10 days ago

[request] DeepSeek-V3.1-Terminus

#3 opened 14 days ago by

New activity in lukealonso/MiniMax-M2-NVFP4 11 days ago

you know which nightly it worked with? because it does not with current one

#1 opened about 1 month ago by

New activity in QuantTrio/DeepSeek-V3.1-AWQ-Lite 13 days ago

random atrifacts on larger outputs

#4 opened 13 days ago by

liked 2 models 14 days ago

cyankiwi/Devstral-2-123B-Instruct-2512-AWQ-4bit

22B • Updated 14 days ago • 3.78k • 15

cerebras/DeepSeek-V3.2-REAP-345B-A37B

Text Generation • 345B • Updated 16 days ago • 1.8k • 28

New activity in Firworks/INTELLECT-3-nvfp4 16 days ago

is NVFP4 supported on sm120 (blackwell rtx pro 6000, rtx 5090 etc)?

#4 opened 25 days ago by

New activity in tencent/DeepSeek-V3.1-Terminus-W4AFP8 19 days ago

4 x RTX PRO 6000

#1 opened 24 days ago by

New activity in eousphoros/DeepSeek-V3.2-NVFP4 20 days ago

Is it possible to make smaller NVFP4 quant at 340-360GB to fit in 4x96gb?

#1 opened 23 days ago by

New activity in Intel/DeepSeek-V3.1-Terminus-int4-mixed-AutoRound 20 days ago

Question will it work in vllm or sglang with rtx 6000 blackwells? cuda arch sm120

#1 opened 2 months ago by

liked a model 20 days ago

Intel/DeepSeek-V3.1-Terminus-int4-mixed-AutoRound

Text Generation • 2B • Updated Sep 23 • 344 • 4

New activity in QuantTrio/DeepSeek-V3.1-AWQ-Lite 20 days ago

ooof this fits in 4x96gb can we get this for the new 3.2 Speciale ase well please :)

#2 opened 24 days ago by

New activity in QuantTrio/DeepSeek-V3.2-AWQ 21 days ago

Aww Man!

#1 opened 23 days ago by

liked a model 23 days ago

Kwaipilot/KAT-Dev-FP8

Text Generation • 33B • Updated Oct 10 • 12 • 4

New activity in miromind-ai/MiroThinker-v1.0-72B 26 days ago

slow by design?

#1 opened 26 days ago by

liked a model 26 days ago

Firworks/MiroThinker-v1.0-72B-nvfp4

42B • Updated Nov 19 • 10 • 1

liked a model 28 days ago

PrimeIntellect/INTELLECT-3-FP8

Text Generation • 107B • Updated 29 days ago • 2.36k • • 18

liked a model about 1 month ago

QuantTrio/GLM-4.6-GPTQ-Int4-Int8Mix

Text Generation • 69B • Updated Oct 3 • 1.11k • 4

New activity in QuantTrio/DeepSeek-V3.2-Exp-AWQ-Lite about 1 month ago

anyone ran this on blackwell?

#2 opened about 1 month ago by

liked a model about 1 month ago

QuantTrio/DeepSeek-V3.2-Exp-AWQ-Lite

Text Generation • 685B • Updated Oct 1 • 100 • 4