Dongseong Hwang's picture

3

Dongseong Hwang

dongseong

·

https://www.linkedin.com/in/dongseong-hwang-5ba79547/

AI & ML interests

Speech, RL, AGI

Recent Activity

upvoted an article 18 days ago

Prefill and Decode for Concurrent Requests - Optimizing LLM Performance

published a model 6 months ago

dongseong/Qwen2.5-1.5B-Open-R1-Distill

published a model 7 months ago

dongseong/qwen1.5-tiny

View all activity

Organizations

None yet

upvoted an article 18 days ago

Article

Prefill and Decode for Concurrent Requests - Optimizing LLM Performance

By

•

Apr 16

• 50

published a model 6 months ago

dongseong/Qwen2.5-1.5B-Open-R1-Distill

published a model 7 months ago

dongseong/qwen1.5-tiny

published a model 8 months ago

dongseong/DeepSeek-R1-Distill-Qwen-1.5B-GRPO

upvoted a paper over 1 year ago

Thermodynamic Natural Gradient Descent

Paper • 2405.13817 • Published May 22, 2024 • 16

authored a paper over 1 year ago

TransformerFAM: Feedback attention is working memory

Paper • 2404.09173 • Published Apr 14, 2024 • 43

upvoted a paper over 1 year ago

TransformerFAM: Feedback attention is working memory

Paper • 2404.09173 • Published Apr 14, 2024 • 43