Juan CM's picture

Juan CM

jucamohedano

·

AI & ML interests

AI Systems MSc at Trento 🚀🤖

Recent Activity

updated a collection 28 days ago

upvoted an article 28 days ago

Vision Language Model Alignment in TRL ⚡️

upvoted an article 29 days ago

KV Cache from scratch in nanoVLM

View all activity

Organizations

updated a collection 28 days ago

Model merging

1 item • Updated 28 days ago

upvoted an article 28 days ago

Article

Vision Language Model Alignment in TRL ⚡️

Aug 7

• 97

upvoted an article 29 days ago

Article

KV Cache from scratch in nanoVLM

Jun 4

• 98

upvoted 2 articles about 1 month ago

Article

Vision Language Models (Better, Faster, Stronger)

May 12

• 555

Article

nanoVLM: The simplest repository to train your VLM in pure PyTorch

May 21

• 224

liked a Space 4 months ago

The Ultra-Scale Playbook

The ultimate guide to training LLM on large GPU Clusters

upvoted a collection 4 months ago

Model Merging

Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated Jun 12, 2024 • 248

updated a collection 5 months ago

Model search via model weights

2 items • Updated Jun 4

upvoted 2 papers 5 months ago

Can this Model Also Recognize Dogs? Zero-Shot Model Search from Weights

Paper • 2502.09619 • Published Feb 13 • 35

Unsupervised Post-Training for Multi-Modal LLM Reasoning via GRPO

Paper • 2505.22453 • Published May 28 • 46

upvoted a collection 9 months ago

🤖 Agents

21 items • Updated Dec 31, 2024 • 166

upvoted an article 9 months ago

Article

Introducing smolagents: simple agents that write actions in code.

Dec 31, 2024

• 1.14k

upvoted a paper 9 months ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 243

upvoted 2 articles 9 months ago

Article

Open-source DeepResearch – Freeing our search agents

Feb 4

• 1.31k

Article

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

Jan 23

• 186

updated a model 12 months ago

jucamohedano/paligemma_a-okvqa

Updated Nov 15, 2024 • 1

updated a model about 1 year ago

jucamohedano/char-lstm-shakespeare

Updated Sep 22, 2024

liked a dataset about 1 year ago

karpathy/tiny_shakespeare

Updated Jan 18, 2024 • 6.77k • 65

updated a model about 1 year ago

jucamohedano/char-lstm-shakespeare_

Updated Sep 21, 2024