Artem

LuLim

AI & ML interests

None yet

Recent Activity

upvoted an article 4 days ago

Deriving the PPO Loss from First Principles

upvoted an article 25 days ago

Tensor Parallelism (TP) in Transformers: 5 Minutes to Understand

upvoted a paper about 1 month ago

Kandinsky 5.0: A Family of Foundation Models for Image and Video Generation

View all activity

Organizations

None yet

upvoted an article 4 days ago

Article

Deriving the PPO Loss from First Principles

7 days ago

•

upvoted an article 25 days ago

Article

Tensor Parallelism (TP) in Transformers: 5 Minutes to Understand

28 days ago

•

upvoted a paper about 1 month ago

Kandinsky 5.0: A Family of Foundation Models for Image and Video Generation

Paper • 2511.14993 • Published Nov 19, 2025 • 227

upvoted a paper 7 months ago

Train Sparse Autoencoders Efficiently by Utilizing Features Correlation

Paper • 2505.22255 • Published May 28, 2025 • 24

upvoted a paper 11 months ago

You Do Not Fully Utilize Transformer's Representation Capacity

Paper • 2502.09245 • Published Feb 13, 2025 • 37

upvoted a paper over 1 year ago

Learn Your Reference Model for Real Good Alignment

Paper • 2404.09656 • Published Apr 15, 2024 • 89

upvoted a paper about 2 years ago

FusionFrames: Efficient Architectural Aspects for Text-to-Video Generation Pipeline

Paper • 2311.13073 • Published Nov 22, 2023 • 58

liked a Space over 2 years ago

MusicGen

🎵

5.07k

Generate music from text descriptions and optional melodies

Artem

AI & ML interests

Recent Activity

Organizations

LuLim's activity

Deriving the PPO Loss from First Principles

Tensor Parallelism (TP) in Transformers: 5 Minutes to Understand

MusicGen