4 60 81

Irene Solaiman

irenesolaiman

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 hours ago

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

upvoted a paper about 2 hours ago

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

upvoted an article about 2 hours ago

Creating custom kernels for the AMD MI300

View all activity

Organizations

upvoted 2 papers about 2 hours ago

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

Paper • 2405.04434 • Published May 7, 2024 • 22

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Paper • 2402.03300 • Published Feb 5, 2024 • 130

upvoted 3 articles about 2 hours ago

Article

Creating custom kernels for the AMD MI300

Jul 9

• 51

Article

From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels

Aug 18

• 84

Article

Enhance Your Models in 5 Minutes with the Hugging Face Kernel Hub

Jun 12

• 148

upvoted 3 papers about 4 hours ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 421

Insights into DeepSeek-V3: Scaling Challenges and Reflections on Hardware for AI Architectures

Paper • 2505.09343 • Published May 14 • 72

DeepSeek-OCR: Contexts Optical Compression

Paper • 2510.18234 • Published 12 days ago • 66

upvoted 3 articles 4 days ago

Article

On the Shifting Global Compute Landscape

and 1 other •

4 days ago

• 23

Article

AI Policy @🤗: Response to the 2025 National AI R&D Strategic Plan

and 2 others •

Jun 2

• 14

Article

5 Things You Need to Know About Moonshot AI and Kimi K2, the New #1 model on the Hub

and 1 other •

Jul 15

• 24

upvoted a paper 5 days ago

The Gradient of Generative AI Release: Methods and Considerations

Paper • 2302.04844 • Published Feb 5, 2023 • 8

upvoted an article 12 days ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28

• 884

upvoted 4 articles about 2 months ago

Article

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

Sep 11

• 161

Article

SmolVLM - small yet mighty Vision Language Model

Nov 26, 2024

• 378

Article

Welcome EmbeddingGemma, Google's new efficient embedding model

Sep 4

• 252

Article

Welcome GPT OSS, the new open-source model family from OpenAI!

Aug 5

• 503

upvoted 3 articles 3 months ago

Article

What Open-Source Developers Need to Know about the EU AI Act's Rules for GPAI Models

and 5 others •

Aug 4

• 28

Article

Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face

Jul 29

• 191

Article

What is the Hugging Face Community Building?

and 2 others •

Jul 15

• 13

Irene Solaiman

AI & ML interests

Recent Activity

Organizations

irenesolaiman's activity

Creating custom kernels for the AMD MI300

From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels

Enhance Your Models in 5 Minutes with the Hugging Face Kernel Hub

On the Shifting Global Compute Landscape

AI Policy @🤗: Response to the 2025 National AI R&D Strategic Plan

5 Things You Need to Know About Moonshot AI and Kimi K2, the New #1 model on the Hub

Open-R1: a fully open reproduction of DeepSeek-R1

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

SmolVLM - small yet mighty Vision Language Model

Welcome EmbeddingGemma, Google's new efficient embedding model

Welcome GPT OSS, the new open-source model family from OpenAI!

What Open-Source Developers Need to Know about the EU AI Act's Rules for GPAI Models

Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face

What is the Hugging Face Community Building?