Collection of models and datasets for Beyond Binary Rewards: Training LMs to Reason about their Uncertainty
Mehul Damani PRO
mehuldamani
AI & ML interests
Reinforcement Learning, Large Language Models
Recent Activity
updated
a dataset
about 4 hours ago
mehuldamani/medDataset
published
a dataset
about 4 hours ago
mehuldamani/medDataset
updated
a dataset
about 18 hours ago
mehuldamani/qwen3_8b_ambigQA_rlcr_multi_analysis
Organizations
None yet