Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2
Puranjay Datta
Puranjay14
Follow
0 followers
·
1 following
puranjay14
AI & ML interests
None yet
Recent Activity
upvoted
an
article
10 days ago
Deriving the DPO Loss from First Principles
upvoted
an
article
15 days ago
Deriving the PPO Loss from First Principles
updated
a model
about 2 years ago
Puranjay14/a2c-PandaReachDense-v3
View all activity
Organizations
Puranjay14
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
upvoted
an
article
10 days ago
view article
Article
Deriving the DPO Loss from First Principles
12 days ago
•
6
upvoted
an
article
15 days ago
view article
Article
Deriving the PPO Loss from First Principles
16 days ago
•
33
updated
11 models
about 2 years ago
Puranjay14/a2c-PandaReachDense-v3
Updated
Jan 7, 2024
Puranjay14/rl_course_vizdoom_health_gathering_supreme
Reinforcement Learning
•
Updated
Jan 6, 2024
Puranjay14/pp01
Reinforcement Learning
•
Updated
Jan 6, 2024
Puranjay14/ppo-SnowballTarget
Reinforcement Learning
•
Updated
Jan 6, 2024
•
3
Puranjay14/MLAgents-Pyramids
Reinforcement Learning
•
Updated
Jan 6, 2024
•
8
Puranjay14/Reinforce-1
Reinforcement Learning
•
Updated
Jan 6, 2024
Puranjay14/Reinforce-0
Reinforcement Learning
•
Updated
Jan 6, 2024
Puranjay14/atari
Reinforcement Learning
•
Updated
Jan 6, 2024
•
3
Puranjay14/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
Jan 6, 2024
•
2
Puranjay14/Taxi
Reinforcement Learning
•
Updated
Jan 2, 2024
Puranjay14/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
Jan 2, 2024