2 5 4

Vaibhav singh

Veb-BLK

AI & ML interests

None yet

Recent Activity

liked a model 2 days ago

MiniMaxAI/MiniMax-M2

liked a Space 2 days ago

HuggingFaceTB/smol-training-playbook

upvoted an article 6 days ago

⚡ nano-vLLM: Lightweight, Low-Latency LLM Inference from Scratch

View all activity

Organizations

None yet

liked a model 2 days ago

MiniMaxAI/MiniMax-M2

Text Generation • 229B • Updated 5 days ago • 726k • • 979

liked a Space 2 days ago

1.16k

The Smol Training Playbook: The Secrets to Building World-Class LLMs

📝

upvoted an article 6 days ago

Article

⚡ nano-vLLM: Lightweight, Low-Latency LLM Inference from Scratch

•

Jun 28

• 20

liked 2 models about 1 month ago

rednote-hilab/dots.ocr

Image-Text-to-Text • 3B • Updated 3 days ago • 1.13M • 1.11k

LiquidAI/LFM2-700M

Text Generation • 0.7B • Updated 7 days ago • 11.1k • 86

New activity in LiquidAI/LFM2-VL-1.6B about 1 month ago

Model weights loading issue for 1.6 B model

#4 opened about 1 month ago by

Veb-BLK

Non-existent target modules used for PEFT in the provided SFT Colab notebook .

#3 opened about 1 month ago by

Veb-BLK

upvoted a collection about 2 months ago

mmBERT: a modern multilingual encoder

Collection

mmBERT is trained on 3T tokens from over 1800 languages, showing SoTA scores on benchmarks and exceptional low-resource performance • 16 items • Updated Sep 9 • 48

upvoted a paper about 2 months ago

mmBERT: A Modern Multilingual Encoder with Annealed Language Learning

Paper • 2509.06888 • Published Sep 8 • 12

upvoted 2 collections about 2 months ago

DINOv3

Collection

DINOv3: foundation models producing excellent dense features, outperforming SotA w/o fine-tuning - https://arxiv.org/abs/2508.10104 • 13 items • Updated Aug 21 • 367

MobileLLM-R1

Collection

MobileLLM-R1, a series of sub-billion parameter reasoning models • 7 items • Updated 21 days ago • 21

Vaibhav singh

AI & ML interests

Recent Activity

Organizations

Veb-BLK's activity

The Smol Training Playbook: The Secrets to Building World-Class LLMs

⚡ nano-vLLM: Lightweight, Low-Latency LLM Inference from Scratch

Model weights loading issue for 1.6 B model

Non-existent target modules used for PEFT in the provided SFT Colab notebook .