Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
5
Changhao
lichangh20
Follow
jiwangcdi's profile picture
1 follower
·
3 following
https://lichangh20.github.io/
lichangh20
AI & ML interests
RL, Agent, Efficient ML
Recent Activity
upvoted
an
article
about 1 month ago
Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment
upvoted
a
paper
about 2 months ago
Matryoshka: Learning to Drive Black-Box LLMs with LLMs
upvoted
a
paper
3 months ago
MLE-Smith: Scaling MLE Tasks with Automated Multi-Agent Pipeline
View all activity
Organizations
lichangh20
's datasets
15
Sort: Recently updated
lichangh20/s1K_initial_filtered_for_llama8b
Viewer
•
Updated
May 2
•
1k
•
6
lichangh20/olympiadbench
Viewer
•
Updated
Apr 22
•
674
•
16
lichangh20/minervamath
Viewer
•
Updated
Apr 22
•
272
•
8
lichangh20/s1K_simplified_filtered_for_adapter
Viewer
•
Updated
Mar 24
•
927
•
14
lichangh20/s1K_initial_filtered_for_qwen7b_simplified_summarized
Viewer
•
Updated
Mar 15
•
997
•
7
lichangh20/s1K_initial_filtered_for_qwen7b_summarized
Viewer
•
Updated
Mar 12
•
997
•
11
lichangh20/s1K_filtered_for_qwen7b_sft
Viewer
•
Updated
Mar 11
•
899
•
14
lichangh20/s1k_eval_sampled_1of12
Viewer
•
Updated
Mar 9
•
77
•
8
lichangh20/s1k_train_sampled_1of12
Viewer
•
Updated
Mar 9
•
77
•
8
lichangh20/gpqa_sampled_1of3
Viewer
•
Updated
Mar 9
•
66
•
12
lichangh20/openai_math_sampled_1of5
Viewer
•
Updated
Mar 9
•
100
•
17
lichangh20/s1K_initial_filtered_for_adapter
Viewer
•
Updated
Mar 5
•
929
•
7
lichangh20/s1K_initial_filtered_for_qwen7b
Viewer
•
Updated
Feb 21
•
1k
•
10
lichangh20/s1k_filtered_eval1K
Viewer
•
Updated
Feb 12
•
929
•
12
lichangh20/s1k_filtered_full59K
Viewer
•
Updated
Feb 12
•
54.2k
•
8