Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
Nguyen Vy
ntthuyvy73
Follow
0 followers
·
1 following
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
16 days ago
DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search
published
a model
29 days ago
ntthuyvy73/Qwen3-4B_SFT-MCQ-v1
published
a model
about 1 month ago
ntthuyvy73/Qwen3-4B-RLHF-GRPO_v7_lora_merge
View all activity
Organizations
models
20
Sort: Recently updated
ntthuyvy73/Qwen3-4B_SFT-MCQ-v1
Updated
29 days ago
ntthuyvy73/Qwen3-4B-RLHF-GRPO_v7_lora_merge
Updated
Nov 14
ntthuyvy73/Qwen3-4B-RLHF-DPO_v7_lora_merge
Updated
Nov 14
ntthuyvy73/Qwen3-4B-RLHF-GRPO_v7
4B
•
Updated
Nov 13
•
21
ntthuyvy73/Qwen3-4B-RLHF-DPO_v7
Updated
Nov 13
ntthuyvy73/Qwen3-4B_RLHF-SFT-v7
Text Generation
•
4B
•
Updated
Nov 11
•
12
ntthuyvy73/Qwen3-4B-RLHF-SFT_v6
Text Generation
•
4B
•
Updated
Nov 10
•
5
ntthuyvy73/Qwen3-1.7B_RLHF_SFT_full
2B
•
Updated
Nov 10
•
4
ntthuyvy73/Qwen3-1.7B_RLHF_SFT
Updated
Nov 10
ntthuyvy73/Qwen3-4B-RLHF-SFT_v4
Text Generation
•
4B
•
Updated
Nov 9
•
4
View 20 models
datasets
1
ntthuyvy73/vlaw-train
Viewer
•
Updated
Jul 2
•
57.5k
•
22