2 7 10

Kamal Ali

Kamali-Lab

AI & ML interests

None yet

Recent Activity

upvoted a collection 30 days ago

LLaDA 2.0

liked a model 30 days ago

inclusionAI/LLaDA2.0-flash

upvoted a paper about 1 month ago

Souper-Model: How Simple Arithmetic Unlocks State-of-the-Art LLM Performance

View all activity

Organizations

upvoted a collection 30 days ago

LLaDA 2.0

Collection

7 items • Updated about 23 hours ago • 37

liked a model 30 days ago

inclusionAI/LLaDA2.0-flash

Text Generation • 103B • Updated 6 days ago • 428 • 58

upvoted 2 papers about 1 month ago

Souper-Model: How Simple Arithmetic Unlocks State-of-the-Art LLM Performance

Paper • 2511.13254 • Published Nov 17 • 136

TiDAR: Think in Diffusion, Talk in Autoregression

Paper • 2511.08923 • Published Nov 12 • 117

liked a dataset about 1 month ago

Open-Bee/Honey-Data-15M

Viewer • Updated Nov 5 • 14.8M • 38.3k • 102

liked a dataset about 2 months ago

sequelbox/Raiden-DeepSeek-R1

Viewer • Updated Mar 12 • 62.9k • 281 • 47

liked a Space about 2 months ago

The Smol Training Playbook

📚

2.68k

The secrets to building world-class LLMs

liked a model about 2 months ago

KaraKaraWitch/GoldDiamondGold-L33-70b

Text Generation • 71B • Updated Oct 20 • 85 • 4

New activity in 12bitmisfit/OpenAI_GPT-OSS-120B_Pruned_REAP_58B-GGUF about 2 months ago

Q8 Quant

#1 opened about 2 months ago by

Kamali-Lab

upvoted a collection 2 months ago

Cerebras REAP

Collection

Sparse MoE models compressed using REAP (Router-weighted Expert Activation Pruning) method • 19 items • Updated 6 days ago • 64

liked a model 2 months ago

cerebras/GLM-4.5-Air-REAP-82B-A12B

Text Generation • 82B • Updated Oct 21 • 7.7k • 102

New activity in cerebras/GLM-4.5-Air-REAP-82B-A12B 2 months ago

Fixed Incorrect Parameter Count in README.md

#2 opened 2 months ago by

Kamali-Lab

liked a model 4 months ago

LatitudeGames/Wayfarer-2-12B

Text Generation • 12B • Updated Sep 3 • 130 • 60

liked 3 models 5 months ago

upvoted an article 5 months ago

Article

All LLMs Will Be Sparse BitNet Hybrids

May 14

•

upvoted a paper 6 months ago

Locality-aware Parallel Decoding for Efficient Autoregressive Image Generation

Paper • 2507.01957 • Published Jul 2 • 21

upvoted an article 8 months ago

Article

Bamba-9B-v2 - Fast and powerful!

Apr 29

•

Kamal Ali

AI & ML interests

Recent Activity

Organizations

Kamali-Lab's activity

The Smol Training Playbook

Q8 Quant

Fixed Incorrect Parameter Count in README.md

All LLMs Will Be Sparse BitNet Hybrids

Bamba-9B-v2 - Fast and powerful!