Yadnyesh Chakane

ydnysh

AI & ML interests

RL and Reasoning, ML Systems and Inference, Mechanistic Interpretability, SciML, Flow and Diffusion Models, Robotics

Recent Activity

upvoted an article 11 days ago

AI for Food Allergies

upvoted an article 29 days ago

Putting RL back in RLHF

upvoted an article 29 days ago

There is no such thing as a tokenizer-free lunch

View all activity

Organizations

upvoted an article 11 days ago

Article

AI for Food Allergies

and 3 others •

11 days ago

• 27

upvoted 2 articles 29 days ago

Article

Putting RL back in RLHF

Jun 12, 2024

• 105

Article

There is no such thing as a tokenizer-free lunch

•

Sep 25

• 84

upvoted an article about 1 month ago

Article

`LeRobotDataset`: Bringing large-scale datasets to lerobot

Sep 16

• 44

upvoted an article 6 months ago

Article

Vision Language Models (Better, Faster, Stronger)

May 12

• 553

upvoted 2 collections 7 months ago

Unsloth 4-bit Dynamic Quants

Collection

Unsloths Dynamic 4bit Quants selectively skips quantizing certain parameters; greatly improving accuracy while only using <10% more VRAM than BnB 4bit • 28 items • Updated 26 days ago • 87

🧠 Reasoning datasets

Collection

Datasets with reasoning traces for math and code released by the community • 24 items • Updated May 19 • 172

upvoted 4 papers 7 months ago

Scaling Laws for Downstream Task Performance of Large Language Models

Paper • 2402.04177 • Published Feb 6, 2024 • 19

OpenMoE: An Early Effort on Open Mixture-of-Experts Language Models

Paper • 2402.01739 • Published Jan 29, 2024 • 28

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Paper • 2402.03300 • Published Feb 5, 2024 • 129

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 420

upvoted a collection 7 months ago

The Deepseek AI Collection

Collection

Papers and Models by Deepseek AI • 7 items • Updated Apr 4 • 1

upvoted a paper over 1 year ago

Aya Dataset: An Open-Access Collection for Multilingual Instruction Tuning

Paper • 2402.06619 • Published Feb 9, 2024 • 56

Yadnyesh Chakane

AI & ML interests

Recent Activity

Organizations

ydnysh's activity

AI for Food Allergies

Putting RL back in RLHF

There is no such thing as a tokenizer-free lunch

`LeRobotDataset`: Bringing large-scale datasets to lerobot

Vision Language Models (Better, Faster, Stronger)