1 2

Mehul Damani PRO

mehuldamani

https://damanimehul.github.io

AI & ML interests

Reinforcement Learning, Large Language Models

Recent Activity

updated a dataset about 4 hours ago

mehuldamani/medDataset

published a dataset about 4 hours ago

mehuldamani/medDataset

updated a dataset about 18 hours ago

mehuldamani/qwen3_8b_ambigQA_rlcr_multi_analysis

View all activity

Organizations

None yet

Collections 1

Papers 4

models 148

mehuldamani/qwen3_8b_ambigQA_rlcr_multiple_newAccReward_standardPrompt_initFromFirstModel_weighAccMore

Updated 1 day ago

mehuldamani/qwen3_8b_ambigQA_rlcr_multiple_newAccReward_standardPrompt_initFromBase_weighAccMore

Updated 1 day ago

datasets 46

mehuldamani/medDataset

Viewer • Updated about 4 hours ago • 1.29M

mehuldamani/qwen3_8b_ambigQA_rlcr_multi_analysis

Viewer • Updated about 18 hours ago • 2k • 15

mehuldamani/qwen3_8b_ambigQA_rlcr_single_passk_tryAgain

Viewer • Updated 2 days ago • 2k • 9

mehuldamani/ambigQA

Viewer • Updated 5 days ago • 12k • 92

mehuldamani/judge-new-sft-instruct

Viewer • Updated 17 days ago • 100 • 8

mehuldamani/judge-new-sft-base

Viewer • Updated 17 days ago • 100 • 8

mehuldamani/judge-new-instruct

Viewer • Updated 17 days ago • 100 • 15

mehuldamani/judge-new-sft

Viewer • Updated 17 days ago • 100 • 20

mehuldamani/judge-new-base

Viewer • Updated 17 days ago • 100 • 18

mehuldamani/gen-tranches-v1-Instruct

Viewer • Updated 17 days ago • 22k • 21

View 46 datasets

Mehul Damani PRO

AI & ML interests

Recent Activity

Organizations

Collections 1

mehuldamani/big-math-digits-v2-correctness

mehuldamani/hotpot-v2-correctness-7b

mehuldamani/orm-big-math-digits-v2-correctness

mehuldamani/big-math-digits-v2-brier

mehuldamani/big-math-digits-v2-correctness

mehuldamani/hotpot-v2-correctness-7b

mehuldamani/orm-big-math-digits-v2-correctness

mehuldamani/big-math-digits-v2-brier

Papers 4

models 148

mehuldamani/qwen3_8b_ambigQA_rlcr_multiple_newAccReward_standardPrompt_initFromFirstModel_weighAccMore

mehuldamani/qwen3_8b_ambigQA_rlcr_multiple_newAccReward_standardPrompt_initFromBase_weighAccMore

mehuldamani/qwen3_14b_ambigQA_rlcr_single

mehuldamani/qwen3_14b_ambigQA_rlcr_multiple_ambigQASpecificPrompt

mehuldamani/qwen3_8b_ambigQA_rlcr_multiple_ambigQASpecificPrompt

mehuldamani/qwen3_8b_ambigQA_rlcr_multiple_ogPrompt

mehuldamani/qwen3_8b_ambigQA_rlcr_multiple_ogPromptHiHI

mehuldamani/qwen3_8b_ambigQA_rlcr_multiple_ambigqaPrompt

mehuldamani/qwen3_8b_ambigQA_rlcr_single_tryShorterAnswer

mehuldamani/qwen3_8b_ambigQA_rlcr_single

datasets 46

mehuldamani/medDataset

mehuldamani/qwen3_8b_ambigQA_rlcr_multi_analysis

mehuldamani/qwen3_8b_ambigQA_rlcr_single_passk_tryAgain

mehuldamani/ambigQA

mehuldamani/judge-new-sft-instruct

mehuldamani/judge-new-sft-base

mehuldamani/judge-new-instruct

mehuldamani/judge-new-sft

mehuldamani/judge-new-base

mehuldamani/gen-tranches-v1-Instruct

Mehul Damani PRO

AI & ML interests

Recent Activity

Organizations

Collections 1

Papers 4

models 148 Sort: Recently updated

datasets 46 Sort: Recently updated

models 148

datasets 46