Thierry Herrmann's picture

9

Thierry Herrmann

thierryh

AI & ML interests

deep learning, machine learning

Organizations

None yet

upvoted 2 articles 2 months ago

Article

Sparse Mixture of Experts Language Model from Scratch: Extending makeMoE with Expert Capacity

By

•

Mar 18, 2024

• 13

Article

makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch

By

•

May 7, 2024

• 102

upvoted 3 articles 6 months ago

Article

Train 400x faster Static Embedding Models with Sentence Transformers

Jan 15

• 216

Article

Training and Finetuning Embedding Models with Sentence Transformers v3

May 28, 2024

• 257

Article

Training and Finetuning Reranker Models with Sentence Transformers v4

Mar 26

• 167

upvoted an article 7 months ago

Article

Faster Text Generation with Self-Speculative Decoding

Nov 20, 2024

• 62

upvoted 3 articles 8 months ago

Article

Controlling Language Model Generation with NVIDIA's LogitsProcessorZoo

Dec 23, 2024

• 49

Article

Visualize and understand GPU memory in PyTorch

Dec 24, 2024

• 248

Article

From PyTorch DDP to 🤗 Accelerate to 🤗 Trainer, mastery of distributed training with ease

Oct 21, 2022

• 40