Peng

pennlio

pennlio111

AI & ML interests

None yet

Recent Activity

liked a model about 1 month ago

InstantX/CSGO

liked a model about 1 month ago

qth/DEADiff

upvoted a paper 4 months ago

Think in Games: Learning to Reason in Games via Reinforcement Learning with Large Language Models

View all activity

Organizations

liked 2 models about 1 month ago

InstantX/CSGO

Text-to-Image • Updated Sep 18, 2024 • 182 • 38

qth/DEADiff

Updated Apr 3, 2024 • 9

upvoted a paper 4 months ago

Think in Games: Learning to Reason in Games via Reinforcement Learning with Large Language Models

Paper • 2508.21365 • Published Aug 29, 2025 • 29

upvoted 2 papers 8 months ago

LLark: A Multimodal Foundation Model for Music

Paper • 2310.07160 • Published Oct 11, 2023 • 2

TALKPLAY: Multimodal Music Recommendation with Large Language Models

Paper • 2502.13713 • Published Feb 19, 2025 • 4

liked 2 models over 1 year ago

xai-org/grok-1

Text Generation • Updated Mar 28, 2024 • 706 • 2.38k

gradientai/Llama-3-8B-Instruct-Gradient-1048k

Text Generation • 8B • Updated Oct 29, 2024 • 11.3k • 679

liked a dataset over 1 year ago

m-a-p/COIG-CQIA

Viewer • Updated Apr 18, 2024 • 44.7k • 5.12k • 691

upvoted an article over 1 year ago

Article

Illustrating Reinforcement Learning from Human Feedback (RLHF)

Dec 9, 2022

•

390

liked 2 models over 1 year ago

meta-llama/Meta-Llama-3-8B

Text Generation • 8B • Updated Sep 27, 2024 • 1.8M • • 6.42k

unsloth/llama-3-8b-bnb-4bit

Text Generation • 8B • Updated Jan 7, 2025 • 59.4k • 202

liked 2 models about 2 years ago

stabilityai/stable-diffusion-x4-upscaler

Updated Jul 5, 2023 • 42.3k • 719

stabilityai/stable-diffusion-xl-base-1.0

Text-to-Image • Updated Oct 30, 2023 • 1.78M • • 7.32k

liked a model over 2 years ago

Vision-CAIR/MiniGPT-4

Updated Apr 19, 2023 • 428

liked a dataset over 2 years ago

fka/awesome-chatgpt-prompts

Viewer • Updated about 20 hours ago • 1k • 19.3k • 9.55k

updated a model over 2 years ago

pennlio/test

Updated May 22, 2023

Peng

AI & ML interests

Recent Activity

Organizations

pennlio's activity

Illustrating Reinforcement Learning from Human Feedback (RLHF)