Toward Autonomous and Faithful Claim Verification via Online Reinforcement Learning
H0key
H0key
·
AI & ML interests
None yet
Organizations
models
10
H0key/Veri-R1_Llama3.2-3B-Instruct-OfflineRL
4B
•
Updated
•
4
H0key/Veri-R1_Llama3.2-3B-Instruct-OnlineRL
4B
•
Updated
•
4
H0key/Veri-R1_Qwen-3B-Instruct-OfflineRL
3B
•
Updated
•
7
H0key/Veri-R1_Qwen2.5-3B-Instruct-OnlineRL
3B
•
Updated
•
3
H0key/qwen2.5-3b-max1step30max3
3B
•
Updated
•
7
H0key/qwen2.5-3b-correctmax1
3B
•
Updated
•
9
H0key/qwen2.5-1.5b-ins
2B
•
Updated
•
7
H0key/qwen2.5-1.5b-4kdata
Updated
H0key/qwen2.5-3b-nolength
3B
•
Updated
•
9
H0key/qwen2.5-3b-ins
3B
•
Updated
•
5