2 25 44

Juan CM

jucamohedano

AI & ML interests

AI Systems MSc at Trento 🚀🤖

Recent Activity

updated a collection 1 day ago

Model merging

updated a collection 1 day ago

Model search via model weights

liked a Space 3 days ago

HuggingFaceTB/smol-training-playbook

View all activity

Organizations

upvoted 4 articles about 1 month ago

Article

Vision Language Model Alignment in TRL ⚡️

Aug 7

• 98

Article

KV Cache from scratch in nanoVLM

Jun 4

• 98

Article

Vision Language Models (Better, Faster, Stronger)

May 12

• 558

Article

nanoVLM: The simplest repository to train your VLM in pure PyTorch

May 21

• 225

upvoted a collection 4 months ago

Model Merging

Collection

Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated Jun 12, 2024 • 247

upvoted 2 papers 5 months ago

Can this Model Also Recognize Dogs? Zero-Shot Model Search from Weights

Paper • 2502.09619 • Published Feb 13 • 35

Unsupervised Post-Training for Multi-Modal LLM Reasoning via GRPO

Paper • 2505.22453 • Published May 28 • 46

upvoted a collection 9 months ago

🤖 Agents

Collection

21 items • Updated Dec 31, 2024 • 166

upvoted an article 9 months ago

Article

Introducing smolagents: simple agents that write actions in code.

Dec 31, 2024

• 1.14k

upvoted a paper 9 months ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 246

upvoted 2 articles 9 months ago

Article

Open-source DeepResearch – Freeing our search agents

Feb 4

• 1.31k

Article

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

Jan 23

• 186

upvoted 2 articles over 1 year ago

Article

PaliGemma – Google's Cutting-Edge Open Vision Language Model

May 14, 2024

• 272

Article

SeeMoE: Implementing a MoE Vision Language Model from Scratch

•

Jun 23, 2024

• 36

upvoted a collection over 1 year ago

[lecture artifacts] aligning open language models

Collection

artifacts referenced in the talk timeline! Slides: https://docs.google.com/presentation/d/1quMyI4BAx4rvcDfk8jjv063bmHg4RxZd9mhQloXpMn0/edit?usp=sharin • 63 items • Updated Apr 17, 2024 • 57

upvoted 5 articles over 1 year ago

Article

Fine-tuning a large language model on Kaggle Notebooks (or even on your own computer) for solving real-world tasks

•

Feb 21, 2024

• 18

Article

Design choices for Vision Language Models in 2024

•

Apr 16, 2024

• 33

Article

Vision Language Models Explained

Apr 11, 2024

• 479

Article

Mixture of Experts Explained

Dec 11, 2023

• 948

Article

Fine-tune Llama 2 with DPO

Aug 8, 2023

• 64

Juan CM

AI & ML interests

Recent Activity

Organizations

jucamohedano's activity

Vision Language Model Alignment in TRL ⚡️

KV Cache from scratch in nanoVLM

Vision Language Models (Better, Faster, Stronger)

nanoVLM: The simplest repository to train your VLM in pure PyTorch

Introducing smolagents: simple agents that write actions in code.

Open-source DeepResearch – Freeing our search agents

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

PaliGemma – Google's Cutting-Edge Open Vision Language Model

SeeMoE: Implementing a MoE Vision Language Model from Scratch

Fine-tuning a large language model on Kaggle Notebooks (or even on your own computer) for solving real-world tasks

Design choices for Vision Language Models in 2024

Vision Language Models Explained

Mixture of Experts Explained

Fine-tune Llama 2 with DPO