30 61 203

Théo Gigant

gigant

https://giganttheo.github.io/

AI & ML interests

multimodal

Recent Activity

updated a dataset 14 days ago

gigant/pixelprose-bmpjpg

published a dataset 14 days ago

gigant/pixelprose-bmpjpg

updated a model about 1 month ago

gigant/bytes-tokenizer

View all activity

Organizations

upvoted a collection about 2 months ago

Hermes 4 Collection

Collection

11 items • Updated Sep 8 • 70

upvoted a paper 2 months ago

Hermes 4 Technical Report

Paper • 2508.18255 • Published Aug 25 • 39

upvoted a paper 4 months ago

Should We Still Pretrain Encoders with Masked Language Modeling?

Paper • 2507.00994 • Published Jul 1 • 78

upvoted an article 4 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

Jul 8

• 702

upvoted a collection 4 months ago

🧠 SmolLM3

Collection

Smol, multilingual, long-context reasoner • 14 items • Updated 21 days ago • 81

upvoted an article 4 months ago

Article

Efficient MultiModal Data Pipeline

Jul 8

• 58

upvoted an article 5 months ago

Article

nanoVLM: The simplest repository to train your VLM in pure PyTorch

May 21

• 225

upvoted an article 6 months ago

Article

Vision Language Models (Better, Faster, Stronger)

May 12

• 557

upvoted 2 papers 7 months ago

Perception Encoder: The best visual embeddings are not at the output of the network

Paper • 2504.13181 • Published Apr 17 • 34

Summarization of Multimodal Presentations with Vision-Language Models: Study of the Effect of Modalities and Structure

Paper • 2504.10049 • Published Apr 14 • 2

upvoted 2 articles 8 months ago

Article

Open R1: Update #3

and 9 others •

Mar 11

• 295

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

Mar 12

• 468

upvoted a paper 8 months ago

EuroBERT: Scaling Multilingual Encoders for European Languages

Paper • 2503.05500 • Published Mar 7 • 79

upvoted 2 articles 8 months ago

Article

Introducing EuroBERT: A High-Performance Multilingual Encoder Model

and 3 others •

Mar 10

• 146

Article

A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality

Mar 4

• 78

upvoted a paper 8 months ago

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 207

upvoted an article 8 months ago

Article

SigLIP 2: A better multilingual vision language encoder

Feb 21

• 186

upvoted 2 papers 9 months ago

Ignore the KL Penalty! Boosting Exploration on Critical Tokens to Enhance RL Fine-Tuning

Paper • 2502.06533 • Published Feb 10 • 17

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 243

upvoted an article 9 months ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28

• 884

Théo Gigant

AI & ML interests

Recent Activity

Organizations

gigant's activity

SmolLM3: smol, multilingual, long-context reasoner

Efficient MultiModal Data Pipeline

nanoVLM: The simplest repository to train your VLM in pure PyTorch

Vision Language Models (Better, Faster, Stronger)

Open R1: Update #3

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

Introducing EuroBERT: A High-Performance Multilingual Encoder Model

A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality

SigLIP 2: A better multilingual vision language encoder

Open-R1: a fully open reproduction of DeepSeek-R1