Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Menan Velayuthan's picture
3 8 2

Menan Velayuthan

velmen
ramithuh's profile picture shayarigo's profile picture webxos's profile picture
·

AI & ML interests

Machine learning with graphs

Recent Activity

reacted to Jaward's post with ❤️ 26 days ago
nanoBLT: Simplified lightweight implementation of a character-level Byte Latent Transformer model (under 500 lines of code). The model is 2x4x2 (n_layers_encoder, n_layers_latent, n_layers_decoder) layer deep trained on ~1M bytes of tiny Shakespeare with a patch size of 4. Code: https://github.com/Jaykef/ai-algorithms/blob/main/byte_latent_transformer.ipynb
upvoted an article 28 days ago
Tokenization in Transformers v5: Simpler, Clearer, and More Modular
upvoted an article about 1 month ago
Gotchas in Tokenizer Behavior Every Developer Should Know
View all activity

Organizations

The National Languages Processing Centre's profile picture nanochat students's profile picture MVA+IASD LLM for code and proof's profile picture

liked a Space about 1 month ago
Running
Featured
44

Porting nanochat to Transformers: an AI modeling history lesson

📝
44

Learn about ML and Transformers through nanochat

liked a model over 2 years ago

facebook/mbart-large-50-many-to-many-mmt

Translation • 0.6B • Updated Sep 28, 2023 • 105k • • 402
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs