Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Menan Velayuthan's picture
3 8 2

Menan Velayuthan

velmen
pcuenq's profile picture ramithuh's profile picture shayarigo's profile picture
Β·

AI & ML interests

Machine learning with graphs

Recent Activity

reacted to Jaward's post with ❀️ 25 days ago
nanoBLT: Simplified lightweight implementation of a character-level Byte Latent Transformer model (under 500 lines of code). The model is 2x4x2 (n_layers_encoder, n_layers_latent, n_layers_decoder) layer deep trained on ~1M bytes of tiny Shakespeare with a patch size of 4. Code: https://github.com/Jaykef/ai-algorithms/blob/main/byte_latent_transformer.ipynb
upvoted an article 26 days ago
Tokenization in Transformers v5: Simpler, Clearer, and More Modular
upvoted an article about 1 month ago
Gotchas in Tokenizer Behavior Every Developer Should Know
View all activity

Organizations

The National Languages Processing Centre's profile picture nanochat students's profile picture MVA+IASD LLM for code and proof's profile picture

New activity in nanochat-students/README 3 months ago

Let's Gooooo! Let us know if you're on board.

😎 1
14
#1 opened 3 months ago by
burtenshaw
New activity in GAIR/MathPile about 2 years ago

Issue with TypeError in GAIR/MathPile Dataset Loading

πŸ‘ 1
5
#2 opened about 2 years ago by
BaiXue
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs